Commits · a71d2cc88d9e2da576feeb8162a68d1827c4102a · Roger Ferrer / llvm-epi-0.8

Jan 04, 2011

Branch instructions don't produce values, so there's no need to generate a... · 41a1550e

Owen Anderson authored Jan 04, 2011

Branch instructions don't produce values, so there's no need to generate a value number for them.  This
avoids adding them to the various value numbering tables, resulting in a minor (~3%) speedup for GVN
on 40.gcc.

llvm-svn: 122819

41a1550e

Remove commented out code. · 22c53e27
Owen Anderson authored Jan 04, 2011
```
llvm-svn: 122817
```
22c53e27
Switch to the new style of asterisk placement. · b2a41e93
Cameron Zwarich authored Jan 04, 2011
```
llvm-svn: 122815
```
b2a41e93

Teach loop-idiom to turn a loop containing a memset into a larger memset · 8643810e

Chris Lattner authored Jan 04, 2011

when safe.

The testcase is basically this nested loop:
void foo(char *X) {
  for (int i = 0; i != 100; ++i) 
    for (int j = 0; j != 100; ++j)
      X[j+i*100] = 0;
}

which gets turned into a single memset now.  clang -O3 doesn't optimize
this yet though due to a phase ordering issue I haven't analyzed yet.

llvm-svn: 122806

8643810e

restructure this a bit. Initialize the WeakVH with "I", the · a62b01dc

Chris Lattner authored Jan 04, 2011

instruction *after* the store.  The store will always be deleted
if the transformation kicks in, so we'd do an N^2 scan of every
loop block.  Whoops.

llvm-svn: 122805

a62b01dc

Avoid finding loop back edges when we are not splitting critical edges in · f4e13699
Cameron Zwarich authored Jan 04, 2011
```
CodeGenPrepare (which is the default behavior).

llvm-svn: 122801
```
f4e13699

Address most of Duncan's review comments. Also, make LoopInstSimplify a simple · e9249693

Cameron Zwarich authored Jan 04, 2011

FunctionPass. It probably doesn't have a reason to be a LoopPass, as it will
probably drop the simple fixed point and either use RPO iteration or Duncan's
approach in instsimplify of only revisiting instructions that have changed.

The next step is to preserve LoopSimplify. This looks like it won't be too hard,
although the pass manager doesn't actually seem to respect when non-loop passes
claim to preserve LCSSA or LoopSimplify. This will have to be fixed.

llvm-svn: 122791

e9249693

use the very-handy getTruncateOrZeroExtend helper function, and · 0ba473c2
Chris Lattner authored Jan 04, 2011
```
stop setting NSW: signed overflow is possible.  Thanks to Dan
for pointing these out.

llvm-svn: 122790
```
0ba473c2
Fix comment. · 0839d393
Owen Anderson authored Jan 03, 2011
```
llvm-svn: 122788
```
0839d393

Use the new addEscapingValue callback to update GlobalsModRef when GVN adds... · d62d3722

Owen Anderson authored Jan 03, 2011

Use the new addEscapingValue callback to update GlobalsModRef when GVN adds PHIs of GEPs. For the moment,
have GlobalsModRef handle this conservatively by simply removing the value from its maps.

llvm-svn: 122787

d62d3722

Duncan deftly points out that readnone functions aren't · bde6ec1d
Chris Lattner authored Jan 03, 2011
```
invalidated by stores, so they can be handled as 'simple'
operations.

llvm-svn: 122785
```
bde6ec1d

Jan 03, 2011
- Simplify GVN's value expression structure, allowing the elimination of a lot of · 3a33d0cc
  Owen Anderson authored Jan 03, 2011
```
almost-but-not-quite-identical code.  No intended functionality change.

llvm-svn: 122760
```
  3a33d0cc
- stength reduce my previous patch a bit. The only instructions · 16ca19ff
  Chris Lattner authored Jan 03, 2011
```
that are allowed to have metadata operands are intrinsic calls,
and the only ones that take metadata currently return void.
Just reject all void instructions, which should not be value
numbered anyway.  To future proof things, add an assert to the
getHashValue impl for calls to check that metadata operands 
aren't present.

llvm-svn: 122759
```
  16ca19ff
- fix PR8895: metadata operands don't have a strong use of their · 142f1cd2
  Chris Lattner authored Jan 03, 2011
```
nested values, so they can change and drop to null, which can
change the hash and cause havok.

It turns out that it isn't a good idea to value number stuff
with metadata operands anyway, so... don't.

llvm-svn: 122758
```
  142f1cd2
- Switch a worklist in CodeGenPrepare to SmallVector and increase the inline · 43cecb12
  Cameron Zwarich authored Jan 03, 2011
```
capacity on the Visited SmallPtrSet. On 403.gcc, this is about a 4.5% speedup of
CodeGenPrepare time (which itself is 10% of time spent in the backend).

This is progress towards PR8889.

llvm-svn: 122741
```
  43cecb12
- earlycse can do trivial with-a-block dead store · 9e5e9ed7
  Chris Lattner authored Jan 03, 2011
```
elimination as well.  This deletes 60 stores in 176.gcc
that largely come from bitfield code.

llvm-svn: 122736
```
  9e5e9ed7
- switch the load table to use a recycling bump pointer allocator, · 4b9a5257
  Chris Lattner authored Jan 03, 2011
```
speeding earlycse up by 6%.

llvm-svn: 122733
```
  4b9a5257
- now that loads are in their own table, we can implement · e0e32a9e
  Chris Lattner authored Jan 03, 2011
```
store->load forwarding.  This allows EarlyCSE to zap 600 more
loads from 176.gcc.

llvm-svn: 122732
```
  e0e32a9e
- split loads and calls into separate tables. Loads are now just indexed · 92bb0f9f
  Chris Lattner authored Jan 03, 2011
```
by their pointer instead of using MemoryValue to wrap it.

llvm-svn: 122731
```
  92bb0f9f
- various cleanups, no functionality change. · 4cb36541
  Chris Lattner authored Jan 03, 2011
```
llvm-svn: 122729
```
  4cb36541
- Teach EarlyCSE to do trivial CSE of loads and read-only calls. · b9a8efc9
  Chris Lattner authored Jan 03, 2011
```
On 176.gcc, this catches 13090 loads and calls, and increases the
number of simple instructions CSE'd from 29658 to 36208.

llvm-svn: 122727
```
  b9a8efc9
- rename InstValue to SimpleValue, add some comments. · 79d83067
  Chris Lattner authored Jan 03, 2011
```
llvm-svn: 122725
```
  79d83067
- CMake: Add missing source file. · edb5bcdd
  Michael J. Spencer authored Jan 03, 2011
```
llvm-svn: 122724
```
  edb5bcdd
- Allocate nodes for the scoped hash table from a recyling bump pointer · d815f69b
  Chris Lattner authored Jan 03, 2011
```
allocator.  This speeds up early cse by about 20%

llvm-svn: 122723
```
  d815f69b
- reduce redundancy in the hashing code and other misc cleanups. · 02a9776b
  Chris Lattner authored Jan 03, 2011
```
llvm-svn: 122720
```
  02a9776b
- Add a new loop-instsimplify pass, with the intention of replacing the instance · cab9a0ab
  Cameron Zwarich authored Jan 03, 2011
```
of instcombine that is currently in the middle of the loop pass pipeline. This
commit only checks in the pass; it will hopefully be enabled by default later.

llvm-svn: 122719
```
  cab9a0ab
- fix some pastos · 0844c76f
  Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122718
```
  0844c76f
- add DEBUG and -stats output to earlycse. · 8fac5db2
  Chris Lattner authored Jan 02, 2011
```
Teach it to CSE the rest of the non-side-effecting instructions.

llvm-svn: 122716
```
  8fac5db2
- Enhance earlycse to do CSE of casts, instsimplify and die. · 18ae5436
  Chris Lattner authored Jan 02, 2011
```
Add a testcase.

llvm-svn: 122715
```
  18ae5436
Jan 02, 2011

split dom frontier handling stuff out to its own DominanceFrontier header, · bf0aa927
Chris Lattner authored Jan 02, 2011
```
so that Dominators.h is *just* domtree.  Also prune #includes a bit.

llvm-svn: 122714
```
bf0aa927
sketch out a new early cse pass. No functionality yet. · 704541bb
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122713
```
704541bb

fix a miscompilation of tramp3d-v4: when forming a memcpy, we have to make · 9c69406f

Chris Lattner authored Jan 02, 2011

sure that the loop we're promoting into a memcpy doesn't mutate the input
of the memcpy.  Before we were just checking that the dest of the memcpy
wasn't mod/ref'd by the loop.

llvm-svn: 122712

9c69406f

If a loop iterates exactly once (has backedge count = 0) then don't · 5702a43c
Chris Lattner authored Jan 02, 2011
```
mess with it.  We'd rather peel/unroll it than convert all of its 
stores into memsets.

llvm-svn: 122711
```
5702a43c

enhance loop idiom recognition to scan *all* unconditionally executed · 8455b6e4

Chris Lattner authored Jan 02, 2011

blocks in a loop, instead of just the header block.  This makes it more
aggressive, able to handle Duncan's Ada examples.

llvm-svn: 122704

8455b6e4

make inSubLoop much more efficient. · 0cdc6f62
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122703
```
0cdc6f62

rip out isExitBlockDominatedByBlockInLoop, calling DomTree::dominates instead. · 27497ece

Chris Lattner authored Jan 02, 2011

isExitBlockDominatedByBlockInLoop is a relic of the days when domtree was 
*just* a tree and didn't have DFS numbers.  Checking DFS numbers is faster
and easier than "limiting the search of the tree".

llvm-svn: 122702

27497ece

add a list of opportunities for future improvement. · 0469e01c
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122701
```
0469e01c

Allow loop-idiom to run on multiple BB loops, but still only scan the loop · ddf58010

Chris Lattner authored Jan 02, 2011

header for now for memset/memcpy opportunities.  It turns out that loop-rotate
is successfully rotating loops, but *DOESN'T MERGE THE BLOCKS*, turning "for 
loops" into 2 basic block loops that loop-idiom was ignoring.

With this fix, we form many *many* more memcpy and memsets than before, including
on the "history" loops in the viterbi benchmark, which look like this:

        for (j=0; j<MAX_history; ++j) {
          history_new[i][j+1] = history[2*i][j];
        }

Transforming these loops into memcpy's speeds up the viterbi benchmark from
11.98s to 3.55s on my machine.  Woo.

llvm-svn: 122685

ddf58010

remove debugging code. · 5b5a043d
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122683
```
5b5a043d
add some -stats output. · 12f91bef
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122682
```
12f91bef