Commits · 0446bb23f89f59a4de1a2c037edc60e4e31e8f75 · Roger Ferrer / llvm-epi-0.8

Jan 03, 2011
- add a testcase for readonly call CSE · 0446bb23
  Chris Lattner authored Jan 03, 2011
```
llvm-svn: 122730
```
  0446bb23
- various cleanups, no functionality change. · 4cb36541
  Chris Lattner authored Jan 03, 2011
```
llvm-svn: 122729
```
  4cb36541
- Add spliceFunction to the CallGraph interface. This allows users to efficiently · 0f87ca77
  Nick Lewycky authored Jan 03, 2011
```
update a callGraph when performing the common operation of splicing the body to
a new function and updating all callers (such as via RAUW).

No users yet, though this is intended for DeadArgumentElimination as part of
PR8887.

llvm-svn: 122728
```
  0f87ca77
- Teach EarlyCSE to do trivial CSE of loads and read-only calls. · b9a8efc9
  Chris Lattner authored Jan 03, 2011
```
On 176.gcc, this catches 13090 loads and calls, and increases the
number of simple instructions CSE'd from 29658 to 36208.

llvm-svn: 122727
```
  b9a8efc9
- add a handy typedef. · effec5f9
  Chris Lattner authored Jan 03, 2011
```
llvm-svn: 122726
```
  effec5f9
- rename InstValue to SimpleValue, add some comments. · 79d83067
  Chris Lattner authored Jan 03, 2011
```
llvm-svn: 122725
```
  79d83067
- CMake: Add missing source file. · edb5bcdd
  Michael J. Spencer authored Jan 03, 2011
```
llvm-svn: 122724
```
  edb5bcdd
- Allocate nodes for the scoped hash table from a recyling bump pointer · d815f69b
  Chris Lattner authored Jan 03, 2011
```
allocator.  This speeds up early cse by about 20%

llvm-svn: 122723
```
  d815f69b
- really get this working with a custom allocator. · 2f1c34f1
  Chris Lattner authored Jan 03, 2011
```
llvm-svn: 122722
```
  2f1c34f1
- Enhance ScopedHashTable to allow it to take an allocator argument. · d11fd779
  Chris Lattner authored Jan 03, 2011
```
llvm-svn: 122721
```
  d11fd779
- reduce redundancy in the hashing code and other misc cleanups. · 02a9776b
  Chris Lattner authored Jan 03, 2011
```
llvm-svn: 122720
```
  02a9776b
- Add a new loop-instsimplify pass, with the intention of replacing the instance · cab9a0ab
  Cameron Zwarich authored Jan 03, 2011
```
of instcombine that is currently in the middle of the loop pass pipeline. This
commit only checks in the pass; it will hopefully be enabled by default later.

llvm-svn: 122719
```
  cab9a0ab
- fix some pastos · 0844c76f
  Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122718
```
  0844c76f
- add DEBUG and -stats output to earlycse. · 8fac5db2
  Chris Lattner authored Jan 02, 2011
```
Teach it to CSE the rest of the non-side-effecting instructions.

llvm-svn: 122716
```
  8fac5db2
- Enhance earlycse to do CSE of casts, instsimplify and die. · 18ae5436
  Chris Lattner authored Jan 02, 2011
```
Add a testcase.

llvm-svn: 122715
```
  18ae5436
Jan 02, 2011

split dom frontier handling stuff out to its own DominanceFrontier header, · bf0aa927
Chris Lattner authored Jan 02, 2011
```
so that Dominators.h is *just* domtree.  Also prune #includes a bit.

llvm-svn: 122714
```
bf0aa927
sketch out a new early cse pass. No functionality yet. · 704541bb
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122713
```
704541bb

fix a miscompilation of tramp3d-v4: when forming a memcpy, we have to make · 9c69406f

Chris Lattner authored Jan 02, 2011

sure that the loop we're promoting into a memcpy doesn't mutate the input
of the memcpy.  Before we were just checking that the dest of the memcpy
wasn't mod/ref'd by the loop.

llvm-svn: 122712

9c69406f

If a loop iterates exactly once (has backedge count = 0) then don't · 5702a43c
Chris Lattner authored Jan 02, 2011
```
mess with it.  We'd rather peel/unroll it than convert all of its 
stores into memsets.

llvm-svn: 122711
```
5702a43c

Try to reuse the value when lowering memset. · 25e6e06e

Benjamin Kramer authored Jan 02, 2011

This allows us to compile:
  void test(char *s, int a) {
    __builtin_memset(s, a, 15);
  }
into 1 mul + 3 stores instead of 3 muls + 3 stores.

llvm-svn: 122710

25e6e06e

Funciton -> Function · 7293698e
Peter Collingbourne authored Jan 02, 2011
```
llvm-svn: 122709
```
7293698e
Unkown -> Unknown · ed12ffb5
Peter Collingbourne authored Jan 02, 2011
```
llvm-svn: 122708
```
ed12ffb5

Lower the i8 extension in memset to a multiply instead of a potentially long... · 2fdea4c8

Benjamin Kramer authored Jan 02, 2011

Lower the i8 extension in memset to a multiply instead of a potentially long series of shifts and ors.

We could implement a DAGCombine to turn x * 0x0101 back into logic operations
on targets that doesn't support the multiply or it is slow (p4) if someone cares
enough.

Example code:
  void test(char *s, int a) {
      __builtin_memset(s, a, 4);
  }
before:
  _test:                                  ## @test
    movzbl  8(%esp), %eax
    movl  %eax, %ecx
    shll  $8, %ecx
    orl %eax, %ecx
    movl  %ecx, %eax
    shll  $16, %eax
    orl %ecx, %eax
    movl  4(%esp), %ecx
    movl  %eax, 4(%ecx)
    movl  %eax, (%ecx)
    ret
after:
  _test:                                  ## @test
    movzbl  8(%esp), %eax
    imull $16843009, %eax, %eax   ## imm = 0x1010101
    movl  4(%esp), %ecx
    movl  %eax, 4(%ecx)
    movl  %eax, (%ecx)
    ret

llvm-svn: 122707

2fdea4c8

A workaround for a bug in cmake 2.8.3 diagnosed on PR 8885. · 68b7bb95
Oscar Fuentes authored Jan 02, 2011
```
llvm-svn: 122706
```
68b7bb95
Also remove functions that use complex constant expressions in terms of · 5361b841
Nick Lewycky authored Jan 02, 2011
```
another function.

llvm-svn: 122705
```
5361b841

enhance loop idiom recognition to scan *all* unconditionally executed · 8455b6e4

Chris Lattner authored Jan 02, 2011

blocks in a loop, instead of just the header block.  This makes it more
aggressive, able to handle Duncan's Ada examples.

llvm-svn: 122704

8455b6e4

make inSubLoop much more efficient. · 0cdc6f62
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122703
```
0cdc6f62

rip out isExitBlockDominatedByBlockInLoop, calling DomTree::dominates instead. · 27497ece

Chris Lattner authored Jan 02, 2011

isExitBlockDominatedByBlockInLoop is a relic of the days when domtree was 
*just* a tree and didn't have DFS numbers.  Checking DFS numbers is faster
and easier than "limiting the search of the tree".

llvm-svn: 122702

27497ece

add a list of opportunities for future improvement. · 0469e01c
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122701
```
0469e01c
update a bunch of entries. · 51415d26
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122700
```
51415d26

Fix PR8702 by not having LoopSimplify claim to preserve LCSSA form. As described · 64f1c0dc

Duncan Sands authored Jan 02, 2011

in the PR, the pass could break LCSSA form when inserting preheaders.  It probably
would be easy enough to fix this, but since currently we always go into LCSSA form
after running this pass, doing so is not urgent.

llvm-svn: 122695

64f1c0dc

Remove an unused member function. · 7e23b722
Cameron Zwarich authored Jan 02, 2011
```
llvm-svn: 122693
```
7e23b722
Propagate to parent scope changes made to CMAKE_CXX_FLAGS. · 3aea80b5
Oscar Fuentes authored Jan 02, 2011
```
llvm-svn: 122692
```
3aea80b5
Fix a typo in a variable name. · 7f59c65e
Cameron Zwarich authored Jan 02, 2011
```
llvm-svn: 122691
```
7f59c65e
Move a load into the only branch where it is used and eliminate a temporary. · 42b1c9dd
Cameron Zwarich authored Jan 02, 2011
```
llvm-svn: 122690
```
42b1c9dd
Add the explanatory comment from r122680's commit message to the code itself. · 718d32cc
Cameron Zwarich authored Jan 02, 2011
```
llvm-svn: 122689
```
718d32cc
Tidy up indentation. · 69b67c01
Cameron Zwarich authored Jan 02, 2011
```
llvm-svn: 122688
```
69b67c01
Fix a typo, which should also fix the failure on llvm-x86_64-linux-checks. · d7f02cc5
Cameron Zwarich authored Jan 02, 2011
```
llvm-svn: 122687
```
d7f02cc5
Remove obsolete comments. · 8e8f492f
Francois Pichet authored Jan 02, 2011
```
llvm-svn: 122686
```
8e8f492f

Allow loop-idiom to run on multiple BB loops, but still only scan the loop · ddf58010

Chris Lattner authored Jan 02, 2011

header for now for memset/memcpy opportunities.  It turns out that loop-rotate
is successfully rotating loops, but *DOESN'T MERGE THE BLOCKS*, turning "for 
loops" into 2 basic block loops that loop-idiom was ignoring.

With this fix, we form many *many* more memcpy and memsets than before, including
on the "history" loops in the viterbi benchmark, which look like this:

        for (j=0; j<MAX_history; ++j) {
          history_new[i][j+1] = history[2*i][j];
        }

Transforming these loops into memcpy's speeds up the viterbi benchmark from
11.98s to 3.55s on my machine.  Woo.

llvm-svn: 122685

ddf58010