[x86] split more v8f32/v8i32 shuffles in lowering (fff62827) · Commits · Roger Ferrer / llvm-epi

Commit fff62827 authored Feb 18, 2019 by Sanjay Patel

[x86] split more v8f32/v8i32 shuffles in lowering

Similar to D57867 - this is a small patch with lots of test diffs.
With half-vector-width narrowing potential, using an extract + 128-bit vshufps
is a win because it replaces a 256-bit shuffle with a 128-bit shufle.

This seems like it should be a win even for targets with 'fast-variable-shuffle',
but we are intentionally deferring that to an independent change to make sure
that is true.

Differential Revision: https://reviews.llvm.org/D58181

llvm-svn: 354279

parent 9d800a13

Expand all Hide whitespace changes

Inline Side-by-side

Please register or to comment