Commits · ac86efb6a37fef3f77ab5300f1f906458f794953 · Roger Ferrer / llvm-epi-0.8

Aug 08, 2005

Implement LoopStrengthReduce/share_code_in_preheader.ll by having one · c70bbc0c
Chris Lattner authored Aug 08, 2005
```
rewriter for all code inserted into the preheader, which is never flushed.

llvm-svn: 22702
```
c70bbc0c

Implement a simple optimization for the termination condition of the loop. · 9bfa6f87

Chris Lattner authored Aug 08, 2005

The termination condition actually wants to use the post-incremented value
of the loop, not a new indvar with an unusual base.

On PPC, for example, this allows us to compile
LoopStrengthReduce/exit_compare_live_range.ll to:

_foo:
        li r2, 0
.LBB_foo_1:     ; no_exit
        li r5, 0
        stw r5, 0(r3)
        addi r2, r2, 1
        cmpw cr0, r2, r4
        bne .LBB_foo_1  ; no_exit
        blr

instead of:

_foo:
        li r2, 1                ;; IV starts at 1, not 0
.LBB_foo_1:     ; no_exit
        li r5, 0
        stw r5, 0(r3)
        addi r5, r2, 1
        cmpw cr0, r2, r4
        or r2, r5, r5           ;; Reg-reg copy, extra live range
        bne .LBB_foo_1  ; no_exit
        blr

This implements LoopStrengthReduce/exit_compare_live_range.ll

llvm-svn: 22699

9bfa6f87

Aug 05, 2005

Make sure to clean CastedPointers after casts are potentially deleted. · 11e7a5ed
Chris Lattner authored Aug 05, 2005
```
This fixes LSR crashes on 301.apsi, 191.fma3d, and 189.lucas

llvm-svn: 22673
```
11e7a5ed

Modify how immediates are removed from base expressions to deal with the fact · 45f8b6e7

Chris Lattner authored Aug 04, 2005

that the symbolic evaluator is not always able to use subtraction to remove
expressions.  This makes the code faster, and fixes the last crash on 178.galgel.
Finally, add a statistic to see how many phi nodes are inserted.

On 178.galgel, we get the follow stats:

2562 loop-reduce  - Number of PHIs inserted
3927 loop-reduce  - Number of GEPs strength reduced

llvm-svn: 22662

45f8b6e7

Aug 04, 2005

* Refactor some code into a new BasedUser::RewriteInstructionToUseNewBase · a6d7c355

Chris Lattner authored Aug 04, 2005

  method.
* Fix a crash on 178.galgel, where we would insert expressions before PHI
  nodes instead of into the PHI node predecessor blocks.

llvm-svn: 22657

a6d7c355

Fix a case that caused this to crash on 178.galgel · 0f7c0fa2
Chris Lattner authored Aug 04, 2005
```
llvm-svn: 22653
```
0f7c0fa2

Teach LSR about loop-variant expressions, such as loops like this: · acc42c4d

Chris Lattner authored Aug 04, 2005

  for (i = 0; i < N; ++i)
    A[i][foo()] = 0;

here we still want to strength reduce the A[i] part, even though foo() is
l-v.

This also simplifies some of the 'CanReduce' logic.

This implements Transforms/LoopStrengthReduce/ops_after_indvar.ll

llvm-svn: 22652

acc42c4d

Remove some more dead code. · 456044b7
Nate Begeman authored Aug 04, 2005
```
llvm-svn: 22650
```
456044b7

Refactor this code substantially with the following improvements: · eaf24725

Chris Lattner authored Aug 04, 2005

  1. We only analyze instructions once, guaranteed
  2. AnalyzeGetElementPtrUsers has been ripped apart and replaced with
     something much simpler.

The next step is to handle expressions that are not all indvar+loop-invariant
values (e.g. handling indvar+loopvariant).

llvm-svn: 22649

eaf24725

refactor some code · 6f286b76
Chris Lattner authored Aug 04, 2005
```
llvm-svn: 22643
```
6f286b76
invert to if's to make the logic simpler · 65107490
Chris Lattner authored Aug 04, 2005
```
llvm-svn: 22641
```
65107490

When processing outer loops and we find uses of an IV in inner loops, make · a0102fbc

Chris Lattner authored Aug 04, 2005

sure to handle the use, just don't recurse into it.

This permits us to generate this code for a simple nested loop case:

.LBB_foo_0:     ; entry
        stwu r1, -48(r1)
        stw r29, 44(r1)
        stw r30, 40(r1)
        mflr r11
        stw r11, 56(r1)
        lis r2, ha16(L_A$non_lazy_ptr)
        lwz r30, lo16(L_A$non_lazy_ptr)(r2)
        li r29, 1
.LBB_foo_1:     ; no_exit.0
        bl L_bar$stub
        li r2, 1
        or r3, r30, r30
.LBB_foo_2:     ; no_exit.1
        lfd f0, 8(r3)
        stfd f0, 0(r3)
        addi r4, r2, 1
        addi r3, r3, 8
        cmpwi cr0, r2, 100
        or r2, r4, r4
        bne .LBB_foo_2  ; no_exit.1
.LBB_foo_3:     ; loopexit.1
        addi r30, r30, 800
        addi r2, r29, 1
        cmpwi cr0, r29, 100
        or r29, r2, r2
        bne .LBB_foo_1  ; no_exit.0
.LBB_foo_4:     ; return
        lwz r11, 56(r1)
        mtlr r11
        lwz r30, 40(r1)
        lwz r29, 44(r1)
        lwz r1, 0(r1)
        blr

instead of this:

_foo:
.LBB_foo_0:     ; entry
        stwu r1, -48(r1)
        stw r28, 44(r1)                   ;; uses an extra register.
        stw r29, 40(r1)
        stw r30, 36(r1)
        mflr r11
        stw r11, 56(r1)
        li r30, 1
        li r29, 0
        or r28, r29, r29
.LBB_foo_1:     ; no_exit.0
        bl L_bar$stub
        mulli r2, r28, 800           ;; unstrength-reduced multiply
        lis r3, ha16(L_A$non_lazy_ptr)   ;; loop invariant address computation
        lwz r3, lo16(L_A$non_lazy_ptr)(r3)
        add r2, r2, r3
        mulli r4, r29, 800           ;; unstrength-reduced multiply
        addi r3, r3, 8
        add r3, r4, r3
        li r4, 1
.LBB_foo_2:     ; no_exit.1
        lfd f0, 0(r3)
        stfd f0, 0(r2)
        addi r5, r4, 1
        addi r2, r2, 8                 ;; multiple stride 8 IV's
        addi r3, r3, 8
        cmpwi cr0, r4, 100
        or r4, r5, r5
        bne .LBB_foo_2  ; no_exit.1
.LBB_foo_3:     ; loopexit.1
        addi r28, r28, 1               ;;; Many IV's with stride 1
        addi r29, r29, 1
        addi r2, r30, 1
        cmpwi cr0, r30, 100
        or r30, r2, r2
        bne .LBB_foo_1  ; no_exit.0
.LBB_foo_4:     ; return
        lwz r11, 56(r1)
        mtlr r11
        lwz r30, 36(r1)
        lwz r29, 40(r1)
        lwz r28, 44(r1)
        lwz r1, 0(r1)
        blr

llvm-svn: 22640

a0102fbc

Teach loop-reduce to see into nested loops, to pull out immediate values · fc624704

Chris Lattner authored Aug 03, 2005

pushed down by SCEV.

In a nested loop case, this allows us to emit this:

        lis r3, ha16(L_A$non_lazy_ptr)
        lwz r3, lo16(L_A$non_lazy_ptr)(r3)
        add r2, r2, r3
        li r3, 1
.LBB_foo_2:     ; no_exit.1
        lfd f0, 8(r2)        ;; Uses offset of 8 instead of 0
        stfd f0, 0(r2)
        addi r4, r3, 1
        addi r2, r2, 8
        cmpwi cr0, r3, 100
        or r3, r4, r4
        bne .LBB_foo_2  ; no_exit.1

instead of this:

        lis r3, ha16(L_A$non_lazy_ptr)
        lwz r3, lo16(L_A$non_lazy_ptr)(r3)
        add r2, r2, r3
        addi r3, r3, 8
        li r4, 1
.LBB_foo_2:     ; no_exit.1
        lfd f0, 0(r3)
        stfd f0, 0(r2)
        addi r5, r4, 1
        addi r2, r2, 8
        addi r3, r3, 8
        cmpwi cr0, r4, 100
        or r4, r5, r5
        bne .LBB_foo_2  ; no_exit.1

llvm-svn: 22639

fc624704

improve debug output · bb78c97e
Chris Lattner authored Aug 03, 2005
```
llvm-svn: 22638
```
bb78c97e

Move from Stage 0 to Stage 1. · db23c74e

Chris Lattner authored Aug 03, 2005

Only emit one PHI node for IV uses with identical bases and strides (after
moving foldable immediates to the load/store instruction).

This implements LoopStrengthReduce/dont_insert_redundant_ops.ll, allowing
us to generate this PPC code for test1:

        or r30, r3, r3
.LBB_test1_1:   ; Loop
        li r2, 0
        stw r2, 0(r30)
        stw r2, 4(r30)
        bl L_pred$stub
        addi r30, r30, 8
        cmplwi cr0, r3, 0
        bne .LBB_test1_1        ; Loop

instead of this code:

        or r30, r3, r3
        or r29, r3, r3
.LBB_test1_1:   ; Loop
        li r2, 0
        stw r2, 0(r29)
        stw r2, 4(r30)
        bl L_pred$stub
        addi r30, r30, 8        ;; Two iv's with step of 8
        addi r29, r29, 8
        cmplwi cr0, r3, 0
        bne .LBB_test1_1        ; Loop

llvm-svn: 22635

db23c74e

Rename IVUse to IVUsersOfOneStride, use a struct instead of a pair to · 430d0022

Chris Lattner authored Aug 03, 2005

unify some parallel vectors and get field names more descriptive than
"first" and "second".  This isn't lisp afterall :)

llvm-svn: 22633

430d0022

Aug 03, 2005

Fix a nasty dangling pointer issue. The ScalarEvolution pass would keep a · 84e9baa9

Chris Lattner authored Aug 03, 2005

map from instruction* to SCEVHandles.  When we delete instructions, we have
to tell it about it.  We would run into nasty cases where new instructions
were reallocated at old instruction addresses and get the old map values.
Bad bad bad :(

llvm-svn: 22632

84e9baa9

Aug 02, 2005
- Like the comment says, do not insert cast instructions before phi nodes · 351b891c
  Chris Lattner authored Aug 02, 2005
```
llvm-svn: 22586
```
  351b891c
- add a comment, make a check more lenient · 75a44e15
  Chris Lattner authored Aug 02, 2005
```
llvm-svn: 22581
```
  75a44e15
- Simplify for loop, clear a per-loop map after processing each loop · dcce49e0
  Chris Lattner authored Aug 02, 2005
```
llvm-svn: 22580
```
  dcce49e0
- Add a comment · 9ef12942
  Chris Lattner authored Aug 02, 2005
```
Make LSR ignore GEP's that have loop variant base values, as we currently
cannot codegen them

llvm-svn: 22576
```
  9ef12942
- Fix an iterator invalidation problem · 564900e5
  Chris Lattner authored Aug 02, 2005
```
llvm-svn: 22575
```
  564900e5
Jul 30, 2005
- Keep tabs and trailing spaces out. · 546fd594
  Jeff Cohen authored Jul 30, 2005
```
llvm-svn: 22565
```
  546fd594
- Fix VC++ build problems. · c5009910
  Jeff Cohen authored Jul 30, 2005
```
llvm-svn: 22564
```
  c5009910
- Ack, typo · 17a0e2af
  Nate Begeman authored Jul 30, 2005
```
llvm-svn: 22560
```
  17a0e2af
- Commit a new LoopStrengthReduce pass that can use scalar evolutions and · e68bcd19
  Nate Begeman authored Jul 30, 2005
```
target data to decide which loop induction variables to strength reduce
and how to do so.  This work is mostly by Chris Lattner, with tweaks by
me to get it working on some of MultiSource.

llvm-svn: 22558
```
  e68bcd19
Apr 22, 2005
- Remove trailing whitespace · b1c9317b
  Misha Brukman authored Apr 21, 2005
```
llvm-svn: 21427
```
  b1c9317b
Mar 06, 2005
- fix a bug where we thought arguments were constants :( · 8c795594
  Chris Lattner authored Mar 06, 2005
```
llvm-svn: 20506
```
  8c795594
- Fix Regression/Transforms/LoopStrengthReduce/dont_insert_redundant_ops.ll, · 2ce303b4
  Chris Lattner authored Mar 06, 2005
```
hopefully not breaking too many other things.

llvm-svn: 20505
```
  2ce303b4
- implement Transforms/LoopStrengthReduce/invariant_value_first_arg.ll · 45403e50
  Chris Lattner authored Mar 06, 2005
```
llvm-svn: 20501
```
  45403e50
- minor simplifications of the code. · d3874fad
  Chris Lattner authored Mar 06, 2005
```
llvm-svn: 20497
```
  d3874fad
Mar 05, 2005
- Reformat comments to fix 80 columns. · 4abcea3a
  Jeff Cohen authored Mar 05, 2005
```
llvm-svn: 20467
```
  4abcea3a
- Reuse induction variables created for strength-reduced GEPs by other similar GEPs. · be37fa07
  Jeff Cohen authored Mar 05, 2005
```
llvm-svn: 20466
```
  be37fa07
Mar 04, 2005
- Add support for not strength reducing GEPs where the element size is a small · a2c59b74
  Jeff Cohen authored Mar 04, 2005
```
power of two.  This emphatically includes the zeroeth power of two.

llvm-svn: 20429
```
  a2c59b74
Mar 01, 2005

Fixed the following LSR bugs: · 8ea6f9e8

Jeff Cohen authored Mar 01, 2005

  * Loop invariant code does not dominate the loop header, but rather
    the end of the loop preheader.

  * The base for a reduced GEP isn't a constant unless all of its
    operands (preceding the induction variable) are constant.

  * Allow induction variable elimination for the simple case after all.

Also made changes recommended by Chris for properly deleting
instructions.

llvm-svn: 20383

8ea6f9e8

Feb 28, 2005
- Fix crash in LSR due to attempt to remove original induction variable. However, · dcaa48b5
  Jeff Cohen authored Feb 28, 2005
```
for reasons explained in the comments, I also deactivated this code as it needs
more thought.

llvm-svn: 20367
```
  dcaa48b5
Feb 27, 2005
- PHI nodes were incorrectly placed when more than one GEP is reduced in a loop. · fd63d3af
  Jeff Cohen authored Feb 27, 2005
```
llvm-svn: 20360
```
  fd63d3af
- First pass at improved Loop Strength Reduction. Still not yet ready for prime time. · 39751c3b
  Jeff Cohen authored Feb 27, 2005
```
llvm-svn: 20358
```
  39751c3b
Oct 18, 2004

Initial implementation of the strength reduction for GEP instructions in · b18121e6

Nate Begeman authored Oct 18, 2004

loops.  This optimization is not turned on by default yet, but may be run
with the opt tool's -loop-reduce flag.  There are many FIXMEs listed in the
code that will make it far more applicable to a wide range of code, but you
have to start somewhere :)

This limited version currently triggers on the following tests in the
MultiSource directory:
pcompress2: 7 times
cfrac: 5 times
anagram: 2 times
ks: 6 times
yacr2: 2 times

llvm-svn: 17134

b18121e6