Skip to content
  1. Oct 18, 2012
  2. Oct 17, 2012
  3. Oct 16, 2012
    • Michael Gottesman's avatar
      [InstCombine] Teach InstCombine how to handle an obfuscated splat. · 02a1141e
      Michael Gottesman authored
      An obfuscated splat is where the frontend poorly generates code for a splat
      using several different shuffles to create the splat, i.e.,
      
        %A = load <4 x float>* %in_ptr, align 16
        %B = shufflevector <4 x float> %A, <4 x float> undef, <4 x i32> <i32 0, i32 0, i32 undef, i32 undef>
        %C = shufflevector <4 x float> %B, <4 x float> %A, <4 x i32> <i32 0, i32 1, i32 4, i32 undef>
        %D = shufflevector <4 x float> %C, <4 x float> %A, <4 x i32> <i32 0, i32 1, i32 2, i32 4>
      
      llvm-svn: 166061
      02a1141e
    • Chad Rosier's avatar
    • Jakub Staszak's avatar
      Simplify code. No functionality change. · 8f46e914
      Jakub Staszak authored
      llvm-svn: 166053
      8f46e914
    • Michael Liao's avatar
      Check .rela instead of ELF64 for the compensation vaue resetting · d6f3168a
      Michael Liao authored
      llvm-svn: 166051
      d6f3168a
    • Jakub Staszak's avatar
      80-col fixup. · 25dcab1e
      Jakub Staszak authored
      llvm-svn: 166050
      25dcab1e
    • Michael Liao's avatar
      Teach DAG combine to fold (trunc (fptoXi x)) to (fptoXi x) · 19006206
      Michael Liao authored
      llvm-svn: 166049
      19006206
    • Rafael Espindola's avatar
      Switch back to the old coalescer for now to fix the 32 bit bit · b58be2c5
      Rafael Espindola authored
      llvm+clang+compiler-rt bootstrap.
      
      llvm-svn: 166046
      b58be2c5
    • Jakub Staszak's avatar
    • Michael Liao's avatar
      Support v8f32 to v8i8/vi816 conversion through custom lowering · 02ca3454
      Michael Liao authored
      - Add custom FP_TO_SINT on v8i16 (and v8i8 which is legalized as v8i16 due to
        vector element-wise widening) to reduce DAG combiner and its overhead added
        in X86 backend.
      
      llvm-svn: 166036
      02ca3454
    • Bill Schmidt's avatar
      This patch addresses PR13949. · 48081cad
      Bill Schmidt authored
      For the PowerPC 64-bit ELF Linux ABI, aggregates of size less than 8
      bytes are to be passed in the low-order bits ("right-adjusted") of the
      doubleword register or memory slot assigned to them.  A previous patch
      addressed this for aggregates passed in registers.  However, small
      aggregates passed in the overflow portion of the parameter save area are
      still being passed left-adjusted.
      
      The fix is made in PPCTargetLowering::LowerCall_Darwin_Or_64SVR4 on the
      caller side, and in PPCTargetLowering::LowerFormalArguments_64SVR4 on
      the callee side.  The main fix on the callee side simply extends
      existing logic for 1- and 2-byte objects to 1- through 7-byte objects,
      and correcting a constant left over from 32-bit code.  There is also a
      fix to a bogus calculation of the offset to the following argument in
      the parameter save area.
      
      On the caller side, again a constant left over from 32-bit code is
      fixed.  Additionally, some code for 1, 2, and 4-byte objects is
      duplicated to handle the 3, 5, 6, and 7-byte objects for SVR4 only.  The
      LowerCall_Darwin_Or_64SVR4 logic is getting fairly convoluted trying to
      handle both ABIs, and I propose to separate this into two functions in a
      future patch, at which time the duplication can be removed.
      
      The patch adds a new test (structsinmem.ll) to demonstrate correct
      passing of structures of all seven sizes.  Eight dummy parameters are
      used to force these structures to be in the overflow portion of the
      parameter save area.
      
      As a side effect, this corrects the case when aggregates passed in
      registers are saved into the first eight doublewords of the parameter
      save area:  Previously they were stored left-justified, and now are
      properly stored right-justified.  This requires changing the expected
      output of existing test case structsinregs.ll.
      
      llvm-svn: 166022
      48081cad
Loading