Commits · 704541bb232c63481e508b6aa1f837928336b622 · Lorenzo Albano / LLVM bpEVL

Jan 02, 2011

sketch out a new early cse pass. No functionality yet. · 704541bb
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122713
```
704541bb

fix a miscompilation of tramp3d-v4: when forming a memcpy, we have to make · 9c69406f

Chris Lattner authored Jan 02, 2011

sure that the loop we're promoting into a memcpy doesn't mutate the input
of the memcpy.  Before we were just checking that the dest of the memcpy
wasn't mod/ref'd by the loop.

llvm-svn: 122712

9c69406f

If a loop iterates exactly once (has backedge count = 0) then don't · 5702a43c
Chris Lattner authored Jan 02, 2011
```
mess with it.  We'd rather peel/unroll it than convert all of its 
stores into memsets.

llvm-svn: 122711
```
5702a43c
Also remove functions that use complex constant expressions in terms of · 5361b841
Nick Lewycky authored Jan 02, 2011
```
another function.

llvm-svn: 122705
```
5361b841

enhance loop idiom recognition to scan *all* unconditionally executed · 8455b6e4

Chris Lattner authored Jan 02, 2011

blocks in a loop, instead of just the header block.  This makes it more
aggressive, able to handle Duncan's Ada examples.

llvm-svn: 122704

8455b6e4

make inSubLoop much more efficient. · 0cdc6f62
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122703
```
0cdc6f62

rip out isExitBlockDominatedByBlockInLoop, calling DomTree::dominates instead. · 27497ece

Chris Lattner authored Jan 02, 2011

isExitBlockDominatedByBlockInLoop is a relic of the days when domtree was 
*just* a tree and didn't have DFS numbers.  Checking DFS numbers is faster
and easier than "limiting the search of the tree".

llvm-svn: 122702

27497ece

add a list of opportunities for future improvement. · 0469e01c
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122701
```
0469e01c

Fix PR8702 by not having LoopSimplify claim to preserve LCSSA form. As described · 64f1c0dc

Duncan Sands authored Jan 02, 2011

in the PR, the pass could break LCSSA form when inserting preheaders.  It probably
would be easy enough to fix this, but since currently we always go into LCSSA form
after running this pass, doing so is not urgent.

llvm-svn: 122695

64f1c0dc

Allow loop-idiom to run on multiple BB loops, but still only scan the loop · ddf58010

Chris Lattner authored Jan 02, 2011

header for now for memset/memcpy opportunities.  It turns out that loop-rotate
is successfully rotating loops, but *DOESN'T MERGE THE BLOCKS*, turning "for 
loops" into 2 basic block loops that loop-idiom was ignoring.

With this fix, we form many *many* more memcpy and memsets than before, including
on the "history" loops in the viterbi benchmark, which look like this:

        for (j=0; j<MAX_history; ++j) {
          history_new[i][j+1] = history[2*i][j];
        }

Transforming these loops into memcpy's speeds up the viterbi benchmark from
11.98s to 3.55s on my machine.  Woo.

llvm-svn: 122685

ddf58010

remove debugging code. · 5b5a043d
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122683
```
5b5a043d
add some -stats output. · 12f91bef
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122682
```
12f91bef

improve loop rotation to use CodeMetrics to analyze the · 679572e5

Chris Lattner authored Jan 02, 2011

size of a loop header instead of its own code size estimator.
This allows it to handle bitcasts etc more precisely.

llvm-svn: 122681

679572e5

teach loop idiom recognition to form memcpy's from simple loops. · 85b6d81d
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122678
```
85b6d81d

Remove functions from the FnSet when one of their callee's is being merged. This · 4e250c82

Nick Lewycky authored Jan 02, 2011

maintains the guarantee that the DenseSet expects two elements it contains to
not go from inequal to equal under its nose.

As a side-effect, this also lets us switch from iterating to a fixed-point to
actually maintaining a work queue of functions to look at again, and we don't
add thunks to our work queue so we don't need to detect and ignore them.

llvm-svn: 122677

4e250c82

Jan 01, 2011
- fix a globalopt crash on two Adobe-C++ testcases that the recent · 1903c42b
  Chris Lattner authored Jan 01, 2011
```
loop idiom pass exposed.

llvm-svn: 122674
```
  1903c42b
- add a validity check that was missed, fixing a crash on the · a3514441
  Chris Lattner authored Jan 01, 2011
```
new testcase.

llvm-svn: 122662
```
  a3514441
- improve validity check to handle constant-trip-count loops more · 91a44358
  Chris Lattner authored Jan 01, 2011
```
aggressively.  In practice, this doesn't help anything though,
see the todo.

llvm-svn: 122660
```
  91a44358
- implement the "no aliasing accesses in loop" safety check. This pass · 8b3baf6d
  Chris Lattner authored Jan 01, 2011
```
should be correct now.

llvm-svn: 122659
```
  8b3baf6d
Dec 31, 2010
- Simplify this pass by using a depth-first iterator to ensure that all · 2c440fa4
  Duncan Sands authored Dec 31, 2010
```
operands are visited before the instructions themselves.

llvm-svn: 122647
```
  2c440fa4
- Zap dead instructions harder. · 6cc7126e
  Duncan Sands authored Dec 31, 2010
```
llvm-svn: 122645
```
  6cc7126e
Dec 30, 2010
- Make a bunch of symbols internal. · 570dd787
  Benjamin Kramer authored Dec 30, 2010
```
llvm-svn: 122642
```
  570dd787
Dec 28, 2010
- simplify this, isBytewiseValue handles the extra check. We still · 65a699d4
  Chris Lattner authored Dec 28, 2010
```
check for "multiple of a byte" in size to make it clear that the
>> 3 below is safe.

llvm-svn: 122604
```
  65a699d4
- Silence gcc warning about an unused variable when doing a release build. · 5cf10e69
  Duncan Sands authored Dec 28, 2010
```
llvm-svn: 122593
```
  5cf10e69
Dec 27, 2010

fix some issues Frits noticed, add AliasAnalysis as a dependency · cb18bfa3
Chris Lattner authored Dec 27, 2010
```
llvm-svn: 122585
```
cb18bfa3
BuildLibCalls: Nuke EmitMemCpy, EmitMemMove and EmitMemSet. They are dead and... · 84bd73c5
Benjamin Kramer authored Dec 27, 2010
```
BuildLibCalls: Nuke EmitMemCpy, EmitMemMove and EmitMemSet. They are dead and superseded by IRBuilder.

llvm-svn: 122576
```
84bd73c5
SimplifyLibCalls: Use IRBuilder to simplify code. · 7cba269d
Benjamin Kramer authored Dec 27, 2010
```
llvm-svn: 122575
```
7cba269d
have loop-idiom nuke instructions that feed stores that get removed. · b9fe685b
Chris Lattner authored Dec 27, 2010
```
llvm-svn: 122574
```
b9fe685b

implement enough of the memset inference algorithm to recognize and insert · 29e14edc

Chris Lattner authored Dec 26, 2010

memsets.  This is still missing one important validity check, but this is enough
to compile stuff like this:

void test0(std::vector<char> &X) {
  for (std::vector<char>::iterator I = X.begin(), E = X.end(); I != E; ++I)
    *I = 0;
}

void test1(std::vector<int> &X) {
  for (long i = 0, e = X.size(); i != e; ++i)
    X[i] = 0x01010101;
}

With:
 $ clang t.cpp -S -o - -O2 -emit-llvm | opt -loop-idiom | opt -O3 | llc 

to:

__Z5test0RSt6vectorIcSaIcEE:            ## @_Z5test0RSt6vectorIcSaIcEE
## BB#0:                                ## %entry
	subq	$8, %rsp
	movq	(%rdi), %rax
	movq	8(%rdi), %rsi
	cmpq	%rsi, %rax
	je	LBB0_2
## BB#1:                                ## %bb.nph
	subq	%rax, %rsi
	movq	%rax, %rdi
	callq	___bzero
LBB0_2:                                 ## %for.end
	addq	$8, %rsp
	ret
...
__Z5test1RSt6vectorIiSaIiEE:            ## @_Z5test1RSt6vectorIiSaIiEE
## BB#0:                                ## %entry
	subq	$8, %rsp
	movq	(%rdi), %rax
	movq	8(%rdi), %rdx
	subq	%rax, %rdx
	cmpq	$4, %rdx
	jb	LBB1_2
## BB#1:                                ## %for.body.preheader
	andq	$-4, %rdx
	movl	$1, %esi
	movq	%rax, %rdi
	callq	_memset
LBB1_2:                                 ## %for.end
	addq	$8, %rsp
	ret

llvm-svn: 122573

29e14edc

Dec 26, 2010
- start using irbuilder to make mem intrinsics in a few passes. · 6cf8d6cc
  Chris Lattner authored Dec 26, 2010
```
llvm-svn: 122572
```
  6cf8d6cc
- sketch more of this out. · 7c5f9c35
  Chris Lattner authored Dec 26, 2010
```
llvm-svn: 122567
```
  7c5f9c35
- move isBytewiseValue out to ValueTracking.h/cpp · 9cb1035f
  Chris Lattner authored Dec 26, 2010
```
llvm-svn: 122565
```
  9cb1035f
- actually add the file... · 81ae3f29
  Chris Lattner authored Dec 26, 2010
```
llvm-svn: 122563
```
  81ae3f29
- Start of a pass for recognizing memset and memcpy idioms. · 2ef535a4
  Chris Lattner authored Dec 26, 2010
```
No functionality yet.

llvm-svn: 122562
```
  2ef535a4
- Simplify code. · 30342fb1
  Benjamin Kramer authored Dec 26, 2010
```
llvm-svn: 122561
```
  30342fb1
Dec 25, 2010
- don't lose TD info · d729d0dc
  Chris Lattner authored Dec 25, 2010
```
llvm-svn: 122556
```
  d729d0dc
- switch the inliner alignment enforcement stuff to use the · 20fca483
  Chris Lattner authored Dec 25, 2010
```
getOrEnforceKnownAlignment function, which simplifies the code
and makes it stronger.

llvm-svn: 122555
```
  20fca483
- Move getOrEnforceKnownAlignment out of instcombine into Transforms/Utils. · 6fcd32e7
  Chris Lattner authored Dec 25, 2010
```
llvm-svn: 122554
```
  6fcd32e7
Dec 24, 2010

Fix a thinko pointed out by Frits van Bommel: looking through global variables... · b90b2f06
Benjamin Kramer authored Dec 24, 2010
```
Fix a thinko pointed out by Frits van Bommel: looking through global variables in isBytewiseValue is not safe.

llvm-svn: 122550
```
b90b2f06

MemCpyOpt: Turn memcpys from a constant into a memset if possible. · ea9152e5

Benjamin Kramer authored Dec 24, 2010

This allows us to compile "int cst[] = {-1, -1, -1};" into
  movl  $-1, 16(%rsp)
  movq  $-1, 8(%rsp)
instead of
  movl  _cst+8(%rip), %eax
  movl  %eax, 16(%rsp)
  movq  _cst(%rip), %rax
  movq  %rax, 8(%rsp)

llvm-svn: 122548

ea9152e5