Commits · f4f47ccd1299a83c62680a4871cc46ffec9fa2c4 · Roger Ferrer / llvm-epi-0.8

Oct 05, 2011

GVN does simple propagation of conditions: when it sees a conditional · f4f47ccd

Duncan Sands authored Oct 05, 2011

branch "br i1 %x, label %if_true, label %if_false" then it replaces
"%x" with "true" in places only reachable via the %if_true arm, and
with "false" in places only reachable via the %if_false arm. Except
that actually it doesn't: if value numbering shows that %y is equal
to %x then, yes, %y will be turned into true/false in this way, but
any occurrences of %x itself are not transformed. Fix this. What's
more, it's often the case that %x is an equality comparison such as
"%x = icmp eq %A, 0", in which case every occurrence of %A that is
only reachable via the %if_true arm can be replaced with 0. Implement
this and a few other variations on this theme. This reduces the number
of lines of LLVM IR in "GCC as one big file" by 0.2%. It has a bigger
impact on Ada code, typically reducing the number of lines of bitcode
by around 0.4% by removing repeated compiler generated checks. Passes
the LLVM nightly testsuite and the Ada ACATS testsuite.

llvm-svn: 141177

f4f47ccd

Generalize GVN's conditional propagation logic slightly: · e90dd058

Duncan Sands authored Oct 05, 2011

it's OK for the false/true destination to have multiple
predecessors as long as the extra ones are dominated by
the branch destination.

llvm-svn: 141176

e90dd058

Oct 04, 2011

LSR should avoid redundant edge splitting. · 8de329a9

Andrew Trick authored Oct 04, 2011

This handles the case in which LSR rewrites an IV user that is a phi and
splits critical edges originating from a switch.
Fixes <rdar://problem/6453893> LSR is not splitting edges "nicely"

llvm-svn: 141059

8de329a9

Oct 01, 2011

Inlining and unrolling heuristics should be aware of free truncs. · f7656015

Andrew Trick authored Oct 01, 2011

We want heuristics to be based on accurate data, but more importantly
we don't want llvm to behave randomly. A benign trunc inserted by an
upstream pass should not cause a wild swings in optimization
level. See PR11034. It's a general problem with threshold-based
heuristics, but we can make it less bad.

llvm-svn: 140919

f7656015

Sep 30, 2011
- Fold two identical set lookups into one. No functionality change. · a3e7ffda
  Nick Lewycky authored Sep 29, 2011
```
llvm-svn: 140821
```
  a3e7ffda
- When eliminating unnecessary retain+autorelease on return values, · 4ac148dc
  Dan Gohman authored Sep 29, 2011
```
handle the case where the retain is in a different basic block.
rdar://10210274.

llvm-svn: 140815
```
  4ac148dc
- Don't eliminate objc_retainBlock calls on stack objects if the · 2053a5dd
  Dan Gohman authored Sep 29, 2011
```
objc_retainBlock call is potentially responsible for copying
the block to the heap to extend its lifetime. rdar://10209613.

llvm-svn: 140814
```
  2053a5dd
Sep 29, 2011

typo + pasto · 168dfffd
Andrew Trick authored Sep 29, 2011
```
llvm-svn: 140769
```
168dfffd

LSR: rewrite inner loops only. · bc6de90a

Andrew Trick authored Sep 29, 2011

Rewriting the entire loop nest now requires -enable-lsr-nested.
See PR11035 for some performance data.
A few unit tests specifically test nested LSR, and are now under a flag.

llvm-svn: 140762

bc6de90a

Sep 28, 2011
- indvars should hoist [sz]ext because licm is not rerun. · e0e30532
  Andrew Trick authored Sep 28, 2011
```
llvm-svn: 140670
```
  e0e30532
Sep 27, 2011
- Stop emitting instructions with the name "tmp" they eat up memory and have to... · 547b6c5e
  Benjamin Kramer authored Sep 27, 2011
```
Stop emitting instructions with the name "tmp" they eat up memory and have to be uniqued, without any benefit.

If someone prefers %tmp42 to %42, run instnamer.

llvm-svn: 140634
```
  547b6c5e
- Split the landing pad basic block with the correct function. Also merge the · 90f90da1
  Bill Wendling authored Sep 27, 2011
```
split landingpad instructions into a PHI node.
PR11016

llvm-svn: 140592
```
  90f90da1
- Disable LSR retry by default. · 58124391
  Andrew Trick authored Sep 27, 2011
```
Disabling aggressive LSR saves compilation time, and with the new
indvars behavior usually improves performance.

llvm-svn: 140590
```
  58124391
- LSR, one of the new Cost::isLoser() checks did not get merged in the previous checkin. · 8868faec
  Andrew Trick authored Sep 26, 2011
```
llvm-svn: 140583
```
  8868faec
- LSR cost metric minor fix and verification. · 784729d4
  Andrew Trick authored Sep 26, 2011
```
The minor bug heuristic was noticed by inspection. I added the
isLoser/isValid helpers because they will become more
important with subsequent checkins.

llvm-svn: 140580
```
  784729d4
Sep 24, 2011

LSR minor bug fix in RateRegister. · 8b2fe2f7

Andrew Trick authored Sep 23, 2011

No test case. Noticed by inspection and I doubt it ever affects the
outcome of the overall heuristic, let alone final codegen.

llvm-svn: 140431

8b2fe2f7

Sep 22, 2011
- PR10987: add a missed safety check to isSafePHIToSpeculate in scalarrepl. · f9b785f1
  Eli Friedman authored Sep 22, 2011
```
llvm-svn: 140327
```
  f9b785f1
Sep 21, 2011

Make sure IPSCCP never marks a tracked call as overdefined in... · 1815b688

Eli Friedman authored Sep 20, 2011

Make sure IPSCCP never marks a tracked call as overdefined in SCCPSolver::ResolvedUndefsIn.  If we do, we can end up in a situation where a function is resolved to return a constant, but the caller is marked overdefined, which confuses the code later.

<rdar://problem/9956541> (again).

llvm-svn: 140210

1815b688

Sep 15, 2011
- Reapply r139759. Disable IV rewriting by default. See PR10916. · 74111ee0
  Andrew Trick authored Sep 15, 2011
```
llvm-svn: 139842
```
  74111ee0
Sep 14, 2011
- Don't mark objc_retainBlock as nounwind. It calls user copy constructors · fca43c21
  Dan Gohman authored Sep 14, 2011
```
which could theoretically throw.

llvm-svn: 139710
```
  fca43c21
- objc_retainBlock is not NoModRef because it can update forwarding pointers · d4b5e3a4
  Dan Gohman authored Sep 14, 2011
```
in memory relevant to the optimizer. rdar://10050579.

llvm-svn: 139708
```
  d4b5e3a4
Sep 13, 2011
- [indvars] Revert r139579 until 401.bzip -arch i386 miscompilation is fixed. PR10920. · f9f68b81
  Andrew Trick authored Sep 13, 2011
```
llvm-svn: 139583
```
  f9f68b81
- Disable IV rewriting by default. See PR10916. · 061d811c
  Andrew Trick authored Sep 13, 2011
```
llvm-svn: 139579
```
  061d811c
- [indvars] Fix bugs in floating point IV range checks noticed by inspection. · 3de5b8e4
  Andrew Trick authored Sep 13, 2011
```
llvm-svn: 139574
```
  3de5b8e4
- Add comment to clarify the behavior of a helper in DSE. · 72a93e5e
  Eli Friedman authored Sep 13, 2011
```
llvm-svn: 139571
```
  72a93e5e
- Correct grammar. · a93ab13e
  Eli Friedman authored Sep 13, 2011
```
llvm-svn: 139565
```
  a93ab13e
Sep 12, 2011

Change a bunch of isVolatile() checks to check for atomic load/store as well. · 7c5dc122

Eli Friedman authored Sep 12, 2011

No tests; these changes aren't really interesting in the sense that the logic is the same for volatile and atomic.

I believe this completes all of the changes necessary for the optimizer to handle loads and stores correctly. I'm going to try and come up with some additional testing, though.

llvm-svn: 139533

7c5dc122

Rename -disable-iv-rewrite to -enable-iv-rewrite=false in preparation for default change. · 183013d8
Andrew Trick authored Sep 12, 2011
```
llvm-svn: 139517
```
183013d8

Sep 10, 2011

[disable-iv-rewrite] Allow WidenIV to handle NSW/NUW operations · c7868bf0

Andrew Trick authored Sep 10, 2011

better.

Don't immediately give up when an add operation can't be trivially
sign/zero-extended within a loop. If it has NSW/NUW flags, generate a
new expression with sign extended (non-recurrent) operand. As before,
if SCEV says that all sign extends are loop invariant, then we can
widen the operation.

llvm-svn: 139453

c7868bf0

Sep 09, 2011
- Comment formatting. · 465f42ff
  Andrew Trick authored Sep 09, 2011
```
llvm-svn: 139375
```
  465f42ff
Sep 06, 2011
- Add -verify-indvars for imperfect SCEV trip count verification after indvars. · 1eee7f12
  Andrew Trick authored Sep 06, 2011
```
llvm-svn: 139169
```
  1eee7f12
- Use IRBuilder. · c10e52a0
  Devang Patel authored Sep 06, 2011
```
llvm-svn: 139156
```
  c10e52a0
- Try again at r138809 (make DSE more aggressive in removing dead stores at the... · 58704ee4
  Owen Anderson authored Sep 06, 2011
```
Try again at r138809 (make DSE more aggressive in removing dead stores at the end of a function), now with less deleting stores before memcpy's.

llvm-svn: 139150
```
  58704ee4
Sep 04, 2011
- Use Duncan's patch to delete the instructions in reverse order (minus the... · 321fb377
  Bill Wendling authored Sep 04, 2011
```
Use Duncan's patch to delete the instructions in reverse order (minus the landingpad and terminator).

llvm-svn: 139090
```
  321fb377
Sep 02, 2011

Update comments to reflect reality. · a336e705
Bill Wendling authored Sep 02, 2011
```
llvm-svn: 139023
```
a336e705

Enable SCEV-based unrolling by default. · 31b941a6

Andrew Trick authored Sep 02, 2011

This changes loop unrolling to use the same mechanism for trip count
computation as indvars. This is a stronger check that tends to unroll
more loops. A very common side-effect is that many single iteration
loops will be removed sooner. The real goal was simply to remove
dependence on canonical IVs.

x86 is break even.
ARM performance changes to expect (+ is good):
External/SPEC/CFP2000/183.equake/183.equake +13%
SingleSource/Benchmarks/Dhrystone/fldry     +21%
MultiSource/Applications/spiff/spiff         +3%
SingleSource/Benchmarks/Stanford/Puzzle     -14%

The Puzzle regression is actually an improvement in loop optimization
that defeats GVN: rdar://problem/10065079.

llvm-svn: 139009

31b941a6

Compare type size instead of type _store_ size to make sure that BitCastInst · 7470fb01
Jakub Staszak authored Sep 02, 2011
```
will be valid. This fixes PR10820.

llvm-svn: 139005
```
7470fb01

Sep 01, 2011
- Change worklist driven deletion to be an iterative process. · bf8280ff
  Bill Wendling authored Sep 01, 2011
```
Duncan noticed this!

llvm-svn: 138967
```
  bf8280ff
- Fix an issue with the IR sink pass found by inspection. (I'm not sure anyone... · 71f5c2f1
  Eli Friedman authored Sep 01, 2011
```
Fix an issue with the IR sink pass found by inspection.  (I'm not sure anyone is actually using this, but might as well fix it since I found the issue.)

llvm-svn: 138965
```
  71f5c2f1
Aug 31, 2011

Make sure we aren't deleting the landingpad instruction. · 770d0f07

Bill Wendling authored Aug 31, 2011

The landingpad instruction is required in the landing pad block. Because we're
not deleting terminating instructions, the invoke may still jump to here (see
Transforms/SCCP/2004-11-16-DeadInvoke.ll). Remove all uses of the landingpad
instruction, but keep it around until code-gen can remove the basic block.

llvm-svn: 138890

770d0f07