X86: When lowering v8i32 himuls use the correct shuffle masks for AVX2. (d6f1733a) · Commits · Lorenzo Albano / LLVM bpEVL

Commit d6f1733a authored Jul 09, 2014 by Benjamin Kramer

X86: When lowering v8i32 himuls use the correct shuffle masks for AVX2.

Turns out my trick of using the same masks for SSE4.1 and AVX2 didn't work out
as we have to blend two vectors. While there remove unecessary cross-lane moves
from the shuffles so the backend can lower it to palignr instead of vperm.

Fixes PR20118, a miscompilation of vector sdiv by constant on AVX2.

llvm-svn: 212611

parent afe4b250

Hide whitespace changes

Inline Side-by-side

Please register or to comment