Skip to content
  1. Sep 21, 2011
  2. Sep 20, 2011
  3. Sep 19, 2011
  4. Sep 15, 2011
  5. Sep 14, 2011
  6. Sep 13, 2011
  7. Sep 12, 2011
  8. Sep 10, 2011
    • Andrew Trick's avatar
      [disable-iv-rewrite] Allow WidenIV to handle NSW/NUW operations · c7868bf0
      Andrew Trick authored
      better.
      
      Don't immediately give up when an add operation can't be trivially
      sign/zero-extended within a loop. If it has NSW/NUW flags, generate a
      new expression with sign extended (non-recurrent) operand. As before,
      if SCEV says that all sign extends are loop invariant, then we can
      widen the operation.
      
      llvm-svn: 139453
      c7868bf0
  9. Sep 09, 2011
  10. Sep 06, 2011
    • Andrew Trick's avatar
      1eee7f12
    • Devang Patel's avatar
      Use IRBuilder. · c10e52a0
      Devang Patel authored
      llvm-svn: 139156
      c10e52a0
    • Owen Anderson's avatar
      Try again at r138809 (make DSE more aggressive in removing dead stores at the... · 58704ee4
      Owen Anderson authored
      Try again at r138809 (make DSE more aggressive in removing dead stores at the end of a function), now with less deleting stores before memcpy's.
      
      llvm-svn: 139150
      58704ee4
    • Duncan Sands's avatar
      Split the init.trampoline intrinsic, which currently combines GCC's · a098436b
      Duncan Sands authored
      init.trampoline and adjust.trampoline intrinsics, into two intrinsics
      like in GCC.  While having one combined intrinsic is tempting, it is
      not natural because typically the trampoline initialization needs to
      be done in one function, and the result of adjust trampoline is needed
      in a different (nested) function.  To get around this llvm-gcc hacks the
      nested function lowering code to insert an additional parent variable
      holding the adjust.trampoline result that can be accessed from the child
      function.  Dragonegg doesn't have the luxury of tweaking GCC code, so it
      stored the result of adjust.trampoline in the memory GCC set aside for
      the trampoline itself (this is always available in the child function),
      and set up some new memory (using an alloca) to hold the trampoline.
      Unfortunately this breaks Go which allocates trampoline memory on the
      heap and wants to use it even after the parent has exited (!).  Rather
      than doing even more hacks to get Go working, it seemed best to just use
      two intrinsics like in GCC.  Patch mostly by Sanjoy Das.
      
      llvm-svn: 139140
      a098436b
  11. Sep 05, 2011
  12. Sep 04, 2011
  13. Sep 02, 2011
    • Bill Wendling's avatar
      Update comments to reflect reality. · a336e705
      Bill Wendling authored
      llvm-svn: 139023
      a336e705
    • Andrew Trick's avatar
      Enable SCEV-based unrolling by default. · 31b941a6
      Andrew Trick authored
      This changes loop unrolling to use the same mechanism for trip count
      computation as indvars. This is a stronger check that tends to unroll
      more loops. A very common side-effect is that many single iteration
      loops will be removed sooner. The real goal was simply to remove
      dependence on canonical IVs.
      
      x86 is break even.
      ARM performance changes to expect (+ is good):
      External/SPEC/CFP2000/183.equake/183.equake +13%
      SingleSource/Benchmarks/Dhrystone/fldry     +21%
      MultiSource/Applications/spiff/spiff         +3%
      SingleSource/Benchmarks/Stanford/Puzzle     -14%
      
      The Puzzle regression is actually an improvement in loop optimization
      that defeats GVN: rdar://problem/10065079.
      
      llvm-svn: 139009
      31b941a6
    • Jakub Staszak's avatar
      Compare type size instead of type _store_ size to make sure that BitCastInst · 7470fb01
      Jakub Staszak authored
      will be valid. This fixes PR10820.
      
      llvm-svn: 139005
      7470fb01
Loading