Skip to content
Commit f56a5751 authored by Craig Topper's avatar Craig Topper
Browse files

[X86] Don't use a pmaddwd for vXi32 multiply if the inputs are zero extends...

[X86] Don't use a pmaddwd for vXi32 multiply if the inputs are zero extends from i8 or smaller without SSE4.1. Prefer to shrink the mul instead.

The zero extend will require two stages of unpacks to implement. So its better to shrink the multiply using pmullw and then extend that result back to v4i32 using a single unpack.

llvm-svn: 347149
parent ab778149
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment