Skip to content
  1. Aug 06, 2013
  2. Aug 05, 2013
  3. Aug 04, 2013
    • Benjamin Kramer's avatar
      X86: Turn fp selects into mask operations. · 5bc180c1
      Benjamin Kramer authored
      double test(double a, double b, double c, double d) { return a<b ? c : d; }
      
      before:
      _test:
      	ucomisd	%xmm0, %xmm1
      	ja	LBB0_2
      	movaps	%xmm3, %xmm2
      LBB0_2:
      	movaps	%xmm2, %xmm0
      
      after:
      _test:
      	cmpltsd	%xmm1, %xmm0
      	andpd	%xmm0, %xmm2
      	andnpd	%xmm3, %xmm0
      	orpd	%xmm2, %xmm0
      
      Small speedup on Benchmarks/SmallPT
      
      llvm-svn: 187706
      5bc180c1
    • Tim Northover's avatar
      X86: correct tail return address calculation · ecc018c7
      Tim Northover authored
      Due to the weird and wondeful usual arithmetic conversions, some
      calculations involving negative values were getting performed in
      uint32_t and then promoted to int64_t, which is really not a good
      idea.
      
      Patch by Katsuhiro Ueno.
      
      llvm-svn: 187703
      ecc018c7
  4. Aug 01, 2013
  5. Jul 31, 2013
  6. Jul 29, 2013
  7. Jul 26, 2013
  8. Jul 24, 2013
  9. Jul 16, 2013
    • Juergen Ributzka's avatar
      [X86] Use min/max to optimze unsigend vector comparison on X86 · 3d527d80
      Juergen Ributzka authored
      Use PMIN/PMAX for UGE/ULE vector comparions to reduce the number of required
      instructions. This trick also works for UGT/ULT, but there is no advantage in
      doing so. It wouldn't reduce the number of instructions and it would actually
      reduce performance.
      
      Reviewer: Ben
      
      radar:5972691
      
      llvm-svn: 186432
      3d527d80
  10. Jul 15, 2013
  11. Jul 14, 2013
  12. Jul 12, 2013
  13. Jul 09, 2013
    • Stephen Lin's avatar
      AArch64/PowerPC/SystemZ/X86: This patch fixes the interface, usage, and all · 73de7bf5
      Stephen Lin authored
      in-tree implementations of TargetLoweringBase::isFMAFasterThanMulAndAdd in
      order to resolve the following issues with fmuladd (i.e. optional FMA)
      intrinsics:
      
      1. On X86(-64) targets, ISD::FMA nodes are formed when lowering fmuladd
      intrinsics even if the subtarget does not support FMA instructions, leading
      to laughably bad code generation in some situations.
      
      2. On AArch64 targets, ISD::FMA nodes are formed for operations on fp128,
      resulting in a call to a software fp128 FMA implementation.
      
      3. On PowerPC targets, FMAs are not generated from fmuladd intrinsics on types
      like v2f32, v8f32, v4f64, etc., even though they promote, split, scalarize,
      etc. to types that support hardware FMAs.
      
      The function has also been slightly renamed for consistency and to force a
      merge/build conflict for any out-of-tree target implementing it. To resolve,
      see comments and fixed in-tree examples.
      
      llvm-svn: 185956
      73de7bf5
  14. Jul 08, 2013
  15. Jul 07, 2013
  16. Jul 06, 2013
  17. Jul 04, 2013
  18. Jul 03, 2013
  19. Jun 26, 2013
  20. Jun 22, 2013
  21. Jun 07, 2013
  22. May 30, 2013
    • Andrew Trick's avatar
      Order CALLSEQ_START and CALLSEQ_END nodes. · ad6d08ac
      Andrew Trick authored
      Fixes PR16146: gdb.base__call-ar-st.exp fails after
      pre-RA-sched=source fixes.
      
      Patch by Xiaoyi Guo!
      
      This also fixes an unsupported dbg.value test case. Codegen was
      previously incorrect but the test was passing by luck.
      
      llvm-svn: 182885
      ad6d08ac
  23. May 25, 2013
  24. May 22, 2013
  25. May 21, 2013
  26. May 18, 2013
Loading