[X86][SSE] Combine v16i8 SHL by constants to multiplies (2eced71e) · Commits · Lorenzo Albano / LLVM bpEVL

Commit 2eced71e authored Jul 08, 2018 by Simon Pilgrim

[X86][SSE] Combine v16i8 SHL by constants to multiplies

Pre-AVX512 (which can perform a quick extend/shift/truncate), extending to 2 v8i16 for the PMULLW and then truncating is more performant than relying on the generic PBLENDVB vXi8 shift path and uses a similar amount of mask constant pool data.

Differential Revision: https://reviews.llvm.org/D48963

llvm-svn: 336513

parent 1795870b

Hide whitespace changes

Inline Side-by-side

Please register or to comment