Commit 35625464 authored Jan 29, 2020 by Craig Topper

[X86] Fix the cost model for v16i16->v16i32 zero_extend/sign_extend with AVX2

We seem to be inheriting the cost from sse4.1. But if we have 256-bit registers we should be able to do this with just one extract to split the 16i16 and two v8i16->v8i32 operations so our cost should be 3 not 4.

Differential Revision: https://reviews.llvm.org/D73646

parent 228ea1a4

Show whitespace changes

Inline Side-by-side

Please to comment