Commit def62697 authored May 25, 2021 by Simon Pilgrim

[CostModel][X86] Improve accuracy of 256-bit non-uniform vector shifts on AVX1

Determined from llvm-mca analysis, AVX1 capable targets have a higher throughput for VPBLENDVB and shuffle ops, making it cheaper to perform shift+shuffle/select shift patterns.

parent 8e83ff58

Expand all Show whitespace changes

Inline Side-by-side

Please to comment