Skip to content
  • Elena Demikhovsky's avatar
    I optimized the following patterns: · 0ccdd131
    Elena Demikhovsky authored
     sext <4 x i1> to <4 x i64>
     sext <4 x i8> to <4 x i64>
     sext <4 x i16> to <4 x i64>
     
    I'm running Combine on SIGN_EXTEND_IN_REG and revert SEXT patterns:
     (sext_in_reg (v4i64 anyext (v4i32 x )), ExtraVT) -> (v4i64 sext (v4i32 sext_in_reg (v4i32 x , ExtraVT)))
     
     The sext_in_reg (v4i32 x) may be lowered to shl+sar operations.
     The "sar" does not exist on 64-bit operation, so lowering sext_in_reg (v4i64 x) has no vector solution.
    
    I also added a cost of this operations to the AVX costs table.
    
    llvm-svn: 175619
    0ccdd131
Loading