Commits · 7f079aabede770196b0e33292192a676b55b7b53 · Roger Ferrer / llvm-epi-0.8

Dec 05, 2008
- Revert invalid r60393. It causes llvm-gcc bootstrap fails in release builds. · 24600bf0
  Anton Korobeynikov authored Dec 05, 2008
```
See PR3160 for details

llvm-svn: 60604
```
  24600bf0
- Fix test/Transforms/GVN/pre-load.ll · c1008280
  Chris Lattner authored Dec 05, 2008
```
llvm-svn: 60594
```
  c1008280
- Make IsValueFullyAvailableInBlock safe. · d2a653af
  Chris Lattner authored Dec 05, 2008
```
llvm-svn: 60588
```
  d2a653af
Dec 04, 2008
- Rewrite code that 1) filters loops and 2) calculates new loop bounds. · c56423b5
  Devang Patel authored Dec 04, 2008
```
This fixes many bugs. I will add more test cases in a separate check-in.

Some day, the code that manipulates CFG and updates dom. info could use refactoring help.

llvm-svn: 60554
```
  c56423b5
- Start simplifying a switch that has a successor that is a switch. · 8f723670
  Chris Lattner authored Dec 04, 2008
```
llvm-svn: 60534
```
  8f723670
- add a debugging option to help track down j-t problems. · 75c2661d
  Chris Lattner authored Dec 04, 2008
```
llvm-svn: 60514
```
  75c2661d
Dec 03, 2008

Remove an unused field. · 4e9e6ea6
Dale Johannesen authored Dec 03, 2008
```
llvm-svn: 60508
```
4e9e6ea6
Fix a misspelled function name. · f7a588b9
Dale Johannesen authored Dec 03, 2008
```
llvm-svn: 60506
```
f7a588b9
Factor some code into a new FoldSingleEntryPHINodes method. · dc3f6f2c
Chris Lattner authored Dec 03, 2008
```
llvm-svn: 60501
```
dc3f6f2c
Fix a really wrong comment. · d49ceff6
Dale Johannesen authored Dec 03, 2008
```
llvm-svn: 60494
```
d49ceff6

Teach jump threading some more simple tricks: · 595c7279

Chris Lattner authored Dec 03, 2008

1) have it fold "br undef", which does occur with
   surprising frequency as jump threading iterates.
2) teach j-t to delete dead blocks.  This removes the successor
   edges, reducing the in-edges of other blocks, allowing 
   recursive simplification.
3) Fold things like:
     br COND, BBX, BBY
  BBX:
     br COND, BBZ, BBW

   which also happens because jump threading iterates.

llvm-svn: 60470

595c7279

Dec 02, 2008

Minor rewrite per review feedback. · 4d2ecb8f
Dale Johannesen authored Dec 02, 2008
```
llvm-svn: 60442
```
4d2ecb8f
Make the code do what the comment says it does. · 70060013
Dale Johannesen authored Dec 02, 2008
```
llvm-svn: 60431
```
70060013

Implement PRE of loads in the GVN pass with a pretty cheap and · 1db9bbe8

Chris Lattner authored Dec 02, 2008

straight-forward implementation.  This does not require any extra
alias analysis queries beyond what we already do for non-local loads.

Some programs really really like load PRE.  For example, SPASS triggers
this ~1000 times, ~300 times in 255.vortex, and ~1500 times on 403.gcc.

The biggest limitation to the implementation is that it does not split
critical edges.  This is a huge killer on many programs and should be
addressed after the initial patch is enabled by default.

The implementation of this should incidentally speed up rejection of 
non-local loads because it avoids creating the repl densemap in cases 
when it won't be used for fully redundant loads.

This is currently disabled by default.
Before I turn this on, I need to fix a couple of miscompilations in
the testsuite, look at compile time performance numbers, and look at
perf impact.  This is pretty close to ready though.

llvm-svn: 60408

1db9bbe8

Remove some errors that crept in. No functionality change. · 87beb9b9
Bill Wendling authored Dec 02, 2008
```
llvm-svn: 60403
```
87beb9b9
Merge two if-statements into one. · 790b4bf9
Bill Wendling authored Dec 02, 2008
```
llvm-svn: 60402
```
790b4bf9
More styalistic changes. No functionality change. · 56352952
Bill Wendling authored Dec 02, 2008
```
llvm-svn: 60401
```
56352952

- Remove the buggy -X/C -> X/-C transform. This isn't valid when X isn't a · 85de4b35

Bill Wendling authored Dec 02, 2008

  constant. If X is a constant, then this is folded elsewhere.

- Added a note to Target/README.txt to indicate that we'd like to implement
  this when we're able.

llvm-svn: 60399

85de4b35

Improve comment. · 5369db59
Bill Wendling authored Dec 02, 2008
```
llvm-svn: 60398
```
5369db59

- Reduce nesting. · 21716dff

Bill Wendling authored Dec 02, 2008

- No need to do a swap on a canonicalized pattern.

No functionality change.

llvm-svn: 60397

21716dff

some random comment improvements. · ead1a61b
Chris Lattner authored Dec 02, 2008
```
llvm-svn: 60395
```
ead1a61b

Fix an issue that Chris noticed, where local PRE was not properly instantiating · d930420c

Owen Anderson authored Dec 02, 2008

a new value numbering set after splitting a critical edge.  This increases
the number of instances of PRE on 403.gcc from ~60 to ~570.

llvm-svn: 60393

d930420c

Dec 01, 2008

Consider only references to an IV within the loop when · 069a4eee

Dale Johannesen authored Dec 01, 2008

figuring out the base of the IV.  This produces better
code in the example.  (Addresses use (IV) instead of 
(BASE,IV) - a significant improvement on low-register
machines like x86).

llvm-svn: 60374

069a4eee

Don't rebuild RHSNeg. Just use the one that's already there. · 6f71bce4
Bill Wendling authored Dec 01, 2008
```
llvm-svn: 60370
```
6f71bce4
Document what this check is doing. Also, no need to cast to ConstantInt. · 84f6f253
Bill Wendling authored Dec 01, 2008
```
llvm-svn: 60369
```
84f6f253
Use a simple comparison. Overflow on integer negation can only occur when the · e6c87a49
Bill Wendling authored Dec 01, 2008
```
integer is "minint".

llvm-svn: 60366
```
e6c87a49
Generalize the FoldOrWithConstant method to fold for any two constants which · 47f733e4
Bill Wendling authored Dec 01, 2008
```
don't have overlapping bits.

llvm-svn: 60344
```
47f733e4
Reduce copy-and-paste code by splitting out the code into its own function. · 22e761b3
Bill Wendling authored Dec 01, 2008
```
llvm-svn: 60343
```
22e761b3
Use m_Specific() instead of double matching. · 582fe6b0
Bill Wendling authored Dec 01, 2008
```
llvm-svn: 60341
```
582fe6b0

Move pattern check outside of the if-then statement. This prevents us from... · 4eecfb65

Bill Wendling authored Dec 01, 2008

Move pattern check outside of the if-then statement. This prevents us from fiddling with constants unless we have to.

llvm-svn: 60340

4eecfb65

Rename some variables, only increment BI once at the start of the loop instead of throughout it. · 6f5bf6a7
Chris Lattner authored Dec 01, 2008
```
llvm-svn: 60339
```
6f5bf6a7
pull the predMap densemap out of the inner loop of performPRE, so · f00aae49
Chris Lattner authored Dec 01, 2008
```
that it isn't reallocated all the time.  This is a tiny speedup for
GVN: 3.90->3.88s

llvm-svn: 60338
```
f00aae49
switch a couple more calls to use array_pod_sort. · 2b07d3cc
Chris Lattner authored Dec 01, 2008
```
llvm-svn: 60337
```
2b07d3cc

Introduce a new array_pod_sort function and switch LSR to use it · 2c2dd15a

Chris Lattner authored Dec 01, 2008

instead of std::sort.  This shrinks the release-asserts LSR.o file
by 1100 bytes of code on my system.

We should start using array_pod_sort where possible.

llvm-svn: 60335

2c2dd15a

Eliminate use of setvector for the DeadInsts set, just use a smallvector. · 2aebea57
Chris Lattner authored Dec 01, 2008
```
This is a lot cheaper and conceptually simpler.

llvm-svn: 60332
```
2aebea57
DeleteTriviallyDeadInstructions is always passed the · 4da78e37
Chris Lattner authored Dec 01, 2008
```
DeadInsts ivar, just use it directly.

llvm-svn: 60330
```
4da78e37

simplify DeleteTriviallyDeadInstructions again, unlike my previous · a68a5a47

Chris Lattner authored Dec 01, 2008

buggy rewrite, this notifies ScalarEvolution of a pending instruction
about to be removed and then erases it, instead of erasing it then 
notifying.

llvm-svn: 60329

a68a5a47

simplify these patterns using m_Specific. No need to grep for · 9e6b2434
Chris Lattner authored Dec 01, 2008
```
xor in testcase (or is a substring).

llvm-svn: 60328
```
9e6b2434

Teach jump threading to clean up after itself, DCE and constfolding the · 88a1f021

Chris Lattner authored Dec 01, 2008

new instructions it simplifies. Because we're threading jumps on edges
with constants coming in from PHI's, we inherently are exposing a lot more
constants to the new block. Folding them and deleting dead conditions
allows the cost model in jump threading to be more accurate as it iterates.

llvm-svn: 60327

88a1f021

Change instcombine to use FoldPHIArgGEPIntoPHI to fold two operand PHIs · 084b3a47

Chris Lattner authored Dec 01, 2008

instead of using FoldPHIArgBinOpIntoPHI.  In addition to being more
obvious, this also fixes a problem where instcombine wouldn't merge two
phis that had different variable indices.  This prevented instcombine
from factoring big chunks of code in 403.gcc.  For example:

 insn_cuid.exit:                
-       %tmp336 = load i32** @uid_cuid, align 4      
-       %tmp337 = getelementptr %struct.rtx_def* %insn_addr.0.ph.i, i32 0, i32 3    
-       %tmp338 = bitcast [1 x %struct.rtunion]* %tmp337 to i32*               
-       %tmp339 = load i32* %tmp338, align 4           
-       %tmp340 = getelementptr i32* %tmp336, i32 %tmp339     
        br label %bb62
 
 bb61:       
-       %tmp341 = load i32** @uid_cuid, align 4     
-       %tmp342 = getelementptr %struct.rtx_def* %insn, i32 0, i32 3        
-       %tmp343 = bitcast [1 x %struct.rtunion]* %tmp342 to i32*           
-       %tmp344 = load i32* %tmp343, align 4        
-       %tmp345 = getelementptr i32* %tmp341, i32 %tmp344          
        br label %bb62
 
 bb62:      
-       %iftmp.62.0.in = phi i32* [ %tmp345, %bb61 ], [ %tmp340, %insn_cuid.exit ]         
+       %insn.pn2 = phi %struct.rtx_def* [ %insn, %bb61 ], [ %insn_addr.0.ph.i, %insn_cuid.exit ]         
+       %tmp344.pn.in.in = getelementptr %struct.rtx_def* %insn.pn2, i32 0, i32 3     
+       %tmp344.pn.in = bitcast [1 x %struct.rtunion]* %tmp344.pn.in.in to i32*  
+       %tmp341.pn = load i32** @uid_cuid     
+       %tmp344.pn = load i32* %tmp344.pn.in 
+       %iftmp.62.0.in = getelementptr i32* %tmp341.pn, i32 %tmp344.pn   
        %iftmp.62.0 = load i32* %iftmp.62.0.in     

llvm-svn: 60325

084b3a47