Commit 5fe851b6 authored Jul 02, 2019 by Matt Arsenault

AMDGPU: Custom lower vector_shuffle for v4i16/v4f16

Ordinarily it is lowered as a build_vector of each extract_vector_elt,
which in turn get lowered to bitcasts and bit shifts. Very little
understand the lowered extract pattern, resulting in much worse
code. We treat concat_vectors of v2i16 as legal, so prefer that.

llvm-svn: 364959

parent e6768d61

Expand all Show whitespace changes

Inline Side-by-side

Please to comment