Commits · 0cec5cb92cfbc723cb7197dbc32febde77c7e920 · Roger Ferrer / llvm-epi

Apr 15, 2004

Change the canonical induction variable that we insert. · 0cec5cb9

Chris Lattner authored Apr 15, 2004

Instead of producing code like this:

Loop:
  X = phi 0, X2
  ...

  X2 = X + 1
  if (X != N-1) goto Loop

We now generate code that looks like this:

Loop:
  X = phi 0, X2
  ...

  X2 = X + 1
  if (X2 != N) goto Loop

This has two big advantages:
  1. The trip count of the loop is now explicit in the code, allowing
     the direct implementation of Loop::getTripCount()
  2. This reduces register pressure in the loop, and allows X and X2 to be
     put into the same register.

As a consequence of the second point, the code we generate for loops went
from:

.LBB2:  # no_exit.1
	...
        mov %EDI, %ESI
        inc %EDI
        cmp %ESI, 2
        mov %ESI, %EDI
        jne .LBB2 # PC rel: no_exit.1

To:

.LBB2:  # no_exit.1
	...
        inc %ESI
        cmp %ESI, 3
        jne .LBB2 # PC rel: no_exit.1

... which has two fewer moves, and uses one less register.

llvm-svn: 12961

0cec5cb9

Apr 14, 2004
- ADd a trivial instcombine: load null -> null · 6679e46b
  Chris Lattner authored Apr 14, 2004
```
llvm-svn: 12940
```
  6679e46b
Apr 13, 2004
- Add SCCP support for constant folding calls, implementing: · ff9362a8
  Chris Lattner authored Apr 13, 2004
```
test/Regression/Transforms/SCCP/calltest.ll

llvm-svn: 12921
```
  ff9362a8
- Constant propagation should remove the dead instructions · d0dc6d52
  Chris Lattner authored Apr 13, 2004
```
llvm-svn: 12917
```
  d0dc6d52
- Fix LoopSimplify/2004-04-13-LoopSimplifyUpdateDomFrontier.ll · 89e959bb
  Chris Lattner authored Apr 13, 2004
```
LoopSimplify was not updating dominator frontiers correctly in some cases.

llvm-svn: 12890
```
  89e959bb
- Refactor code a bit to make it simpler and eliminate the goto · a6e22814
  Chris Lattner authored Apr 13, 2004
```
llvm-svn: 12888
```
  a6e22814
- This patch addresses PR35: Loop simplify should reconstruct nested loops. · 84170529
  Chris Lattner authored Apr 13, 2004
```
This is fairly straight-forward, but was a real nightmare to get just
perfect.  aarg.  :)

llvm-svn: 12884
```
  84170529
Apr 12, 2004
- Add support for removing invoke instructions · 494a6854
  Chris Lattner authored Apr 12, 2004
```
llvm-svn: 12858
```
  494a6854
Apr 11, 2004
- Fix a bug in my select transformation · 24cf0200
  Chris Lattner authored Apr 11, 2004
```
llvm-svn: 12826
```
  24cf0200
- Update the value numbering interface. · f16fe720
  Chris Lattner authored Apr 10, 2004
```
llvm-svn: 12824
```
  f16fe720
- Implement InstCombine/select.ll:test13* · 623fba11
  Chris Lattner authored Apr 10, 2004
```
llvm-svn: 12821
```
  623fba11
- Implement InstCombine/add.ll:test20 · cf4a996c
  Chris Lattner authored Apr 10, 2004
```
Canonicalize add of sign bit constant into a xor

llvm-svn: 12819
```
  cf4a996c
Apr 10, 2004
- Rewrite the GCSE pass to be *substantially* simpler, a bit more efficient, · 69c49005
  Chris Lattner authored Apr 10, 2004
```
and a bit more powerful

llvm-svn: 12817
```
  69c49005
- Fix spurious warning in release mode · f9d96651
  Chris Lattner authored Apr 10, 2004
```
llvm-svn: 12816
```
  f9d96651
- Simplify code a bit, and fix a bug that was breaking perlbmk · d95ef7ef
  Chris Lattner authored Apr 10, 2004
```
llvm-svn: 12814
```
  d95ef7ef
- Fix a bug in my checkin last night that was breaking programs using invoke. · 7ebfe61d
  Chris Lattner authored Apr 10, 2004
```
llvm-svn: 12813
```
  7ebfe61d
- Fix previous patch · 5093213c
  Chris Lattner authored Apr 10, 2004
```
llvm-svn: 12811
```
  5093213c
- Correctly update counters · 6149ac89
  Chris Lattner authored Apr 10, 2004
```
llvm-svn: 12810
```
  6149ac89
- Simplify code a bit, and use alias analysis to allow us to delete unused · cfa1adcd
  Chris Lattner authored Apr 10, 2004
```
call and invoke instructions that are known to not write to memory.

llvm-svn: 12807
```
  cfa1adcd
- Implement select.ll:test12* · 56e4d3d8
  Chris Lattner authored Apr 09, 2004
```
This transforms code like this:

   %C = or %A, %B
   %D = select %cond, %C, %A
into:
   %C = select %cond, %B, 0
   %D = or %A, %C

Since B is often a constant, the select can often be eliminated.  In any case,
this reduces the usage count of A, allowing subsequent optimizations to happen.

This xform applies when the operator is any of:
  add, sub, mul, or, xor, and, shl, shr

llvm-svn: 12800
```
  56e4d3d8
Apr 09, 2004
- Fold binary operators with a constant operand into select instructions · 183b336a
  Chris Lattner authored Apr 09, 2004
```
that have a constant operand.  This implements
add.ll:test19, shift.ll:test15*, and others that are not tested

llvm-svn: 12794
```
  183b336a
- Implement select.ll:test11 · cf7baf35
  Chris Lattner authored Apr 09, 2004
```
llvm-svn: 12793
```
  cf7baf35
Apr 08, 2004
- Implement InstCombine/cast-propagate.ll · e228ee58
  Chris Lattner authored Apr 08, 2004
```
llvm-svn: 12784
```
  e228ee58
- Implement InstCombine/select.ll:test[7-10] · 1c631e81
  Chris Lattner authored Apr 08, 2004
```
llvm-svn: 12769
```
  1c631e81
Apr 07, 2004
- Implement test/Regression/Transforms/InstCombine/getelementptr_index.ll · 2b2412d0
  Chris Lattner authored Apr 07, 2004
```
llvm-svn: 12762
```
  2b2412d0
Apr 05, 2004
- Fix a bug in yesterdays checkins which broke siod. siod is a great testcase! :) · 4d1fcf1d
  Chris Lattner authored Apr 05, 2004
```
llvm-svn: 12659
```
  4d1fcf1d
- Fix InstCombine/2004-04-04-InstCombineReplaceAllUsesWith.ll · 8953b90a
  Chris Lattner authored Apr 05, 2004
```
llvm-svn: 12658
```
  8953b90a
- Support getelementptr instructions which use uint's to index into structure · 69193f93
  Chris Lattner authored Apr 05, 2004
```
types and can have arbitrary 32- and 64-bit integer types indexing into
sequential types.

llvm-svn: 12653
```
  69193f93
Apr 02, 2004

Rewrite the indvars pass to use the ScalarEvolution analysis. · e61b67d7

Chris Lattner authored Apr 02, 2004

This also implements some new features for the indvars pass, including
linear function test replacement, exit value substitution, and it works with
a much more general class of induction variables and loops.

llvm-svn: 12620

e61b67d7

Apr 01, 2004
- Remove some assertions that are now bogus with the last patch I put in · 59fdf749
  Chris Lattner authored Apr 01, 2004
```
llvm-svn: 12595
```
  59fdf749
- Fix PR306: Loop simplify incorrectly updates dominator information · 146d0df5
  Chris Lattner authored Apr 01, 2004
```
Testcase: LoopSimplify/2004-04-01-IncorrectDomUpdate.ll

llvm-svn: 12592
```
  146d0df5
- Add warning · 61fab140
  Chris Lattner authored Mar 31, 2004
```
llvm-svn: 12573
```
  61fab140
Mar 30, 2004
- Implement select.ll:test[3-6] · 533bc497
  Chris Lattner authored Mar 30, 2004
```
llvm-svn: 12544
```
  533bc497
- Add a simple select instruction lowering pass · 059f3902
  Chris Lattner authored Mar 30, 2004
```
llvm-svn: 12540
```
  059f3902
Mar 26, 2004
- X % -1 == X % 1 == 0 · 56b50514
  Chris Lattner authored Mar 26, 2004
```
llvm-svn: 12520
```
  56b50514
Mar 25, 2004

Two changes: · 57c67b06

Chris Lattner authored Mar 25, 2004

#1 is to unconditionally strip constantpointerrefs out of
instruction operands where they are absolutely pointless and inhibit
optimization.  GRRR!

#2 is to implement InstCombine/getelementptr_const.ll

llvm-svn: 12519

57c67b06

Mar 19, 2004
- Teach the optimizer to delete zero sized alloca's (but not mallocs!) · abb77c99
  Chris Lattner authored Mar 19, 2004
```
llvm-svn: 12507
```
  abb77c99
Mar 17, 2004

Be more accurate · 684fa5ac
Chris Lattner authored Mar 17, 2004
```
llvm-svn: 12464
```
684fa5ac
Fix bug in previous checkin · a3783a57
Chris Lattner authored Mar 16, 2004
```
llvm-svn: 12458
```
a3783a57

Okay, so there is no reasonable way for tail duplication to update SSA form, · 95057f6a

Chris Lattner authored Mar 16, 2004

as it is making effectively arbitrary modifications to the CFG and we don't
have a domset/domfrontier implementations that can handle the dynamic updates.
Instead of having a bunch of code that doesn't actually work in practice,
just demote any potentially tricky values to the stack (causing the problem
to go away entirely).  Later invocations of mem2reg will rebuild SSA for us.

This fixes all of the major performance regressions with tail duplication
from LLVM 1.1.  For example, this loop:

---
int popcount(int x) {
  int result = 0;
  while (x != 0) {
    result = result + (x & 0x1);
    x = x >> 1;
  }
  return result;
}
---
Used to be compiled into:

int %popcount(int %X) {
entry:
	br label %loopentry

loopentry:		; preds = %entry, %no_exit
	%x.0 = phi int [ %X, %entry ], [ %tmp.9, %no_exit ]		; <int> [#uses=3]
	%result.1.0 = phi int [ 0, %entry ], [ %tmp.6, %no_exit ]		; <int> [#uses=2]
	%tmp.1 = seteq int %x.0, 0		; <bool> [#uses=1]
	br bool %tmp.1, label %loopexit, label %no_exit

no_exit:		; preds = %loopentry
	%tmp.4 = and int %x.0, 1		; <int> [#uses=1]
	%tmp.6 = add int %tmp.4, %result.1.0		; <int> [#uses=1]
	%tmp.9 = shr int %x.0, ubyte 1		; <int> [#uses=1]
	br label %loopentry

loopexit:		; preds = %loopentry
	ret int %result.1.0
}

And is now compiled into:

int %popcount(int %X) {
entry:
        br label %no_exit

no_exit:                ; preds = %entry, %no_exit
        %x.0.0 = phi int [ %X, %entry ], [ %tmp.9, %no_exit ]          ; <int> [#uses=2]
        %result.1.0.0 = phi int [ 0, %entry ], [ %tmp.6, %no_exit ]             ; <int> [#uses=1]
        %tmp.4 = and int %x.0.0, 1              ; <int> [#uses=1]
        %tmp.6 = add int %tmp.4, %result.1.0.0          ; <int> [#uses=2]
        %tmp.9 = shr int %x.0.0, ubyte 1                ; <int> [#uses=2]
        %tmp.1 = seteq int %tmp.9, 0            ; <bool> [#uses=1]
        br bool %tmp.1, label %loopexit, label %no_exit

loopexit:               ; preds = %no_exit
        ret int %tmp.6
}

llvm-svn: 12457

95057f6a