Commits · 295ea906346b189706f504137886a9b115bc2fbf · Roger Ferrer / llvm-epi-0.8

Aug 04, 2005

Use the new subtarget support to automatically choose the correct ABI · 295ea906
Nate Begeman authored Aug 04, 2005
```
and asm printer for PowerPC if one is not specified.

llvm-svn: 22659
```
295ea906

* Refactor some code into a new BasedUser::RewriteInstructionToUseNewBase · a6d7c355

Chris Lattner authored Aug 04, 2005

  method.
* Fix a crash on 178.galgel, where we would insert expressions before PHI
  nodes instead of into the PHI node predecessor blocks.

llvm-svn: 22657

a6d7c355

This should not run lli, that is for llvm-test. · ed87aa97
Chris Lattner authored Aug 04, 2005
```
llvm-svn: 22656
```
ed87aa97
None of these tests should require a working lli, they are codegen tests, · af783c58
Chris Lattner authored Aug 04, 2005
```
not execution tests.

llvm-svn: 22655
```
af783c58
Fix a case that caused this to crash on 178.galgel · 0f7c0fa2
Chris Lattner authored Aug 04, 2005
```
llvm-svn: 22653
```
0f7c0fa2

Teach LSR about loop-variant expressions, such as loops like this: · acc42c4d

Chris Lattner authored Aug 04, 2005

  for (i = 0; i < N; ++i)
    A[i][foo()] = 0;

here we still want to strength reduce the A[i] part, even though foo() is
l-v.

This also simplifies some of the 'CanReduce' logic.

This implements Transforms/LoopStrengthReduce/ops_after_indvar.ll

llvm-svn: 22652

acc42c4d

This testcase now passes · faf17b43
Chris Lattner authored Aug 04, 2005
```
llvm-svn: 22651
```
faf17b43
Remove some more dead code. · 456044b7
Nate Begeman authored Aug 04, 2005
```
llvm-svn: 22650
```
456044b7

Refactor this code substantially with the following improvements: · eaf24725

Chris Lattner authored Aug 04, 2005

  1. We only analyze instructions once, guaranteed
  2. AnalyzeGetElementPtrUsers has been ripped apart and replaced with
     something much simpler.

The next step is to handle expressions that are not all indvar+loop-invariant
values (e.g. handling indvar+loopvariant).

llvm-svn: 22649

eaf24725

No, IDEFs shouldn't be JITed · 5adb830b
Andrew Lenharth authored Aug 04, 2005
```
llvm-svn: 22648
```
5adb830b
* Unbreak release build · a54e201e
Misha Brukman authored Aug 04, 2005
```
* Add comments to #endif pragmas for readability

llvm-svn: 22647
```
a54e201e
* Unbreak optimized build (noticed by Eric van Riet Paap) · 41acd5e0
Misha Brukman authored Aug 04, 2005
```
* Comment #endif clauses for readability

llvm-svn: 22646
```
41acd5e0
Add Subtarget support to PowerPC. Next up, using it. · 3bcfcd94
Nate Begeman authored Aug 04, 2005
```
llvm-svn: 22644
```
3bcfcd94
refactor some code · 6f286b76
Chris Lattner authored Aug 04, 2005
```
llvm-svn: 22643
```
6f286b76
this is not implemented by lsr yet · 9969c861
Chris Lattner authored Aug 04, 2005
```
llvm-svn: 22642
```
9969c861
invert to if's to make the logic simpler · 65107490
Chris Lattner authored Aug 04, 2005
```
llvm-svn: 22641
```
65107490

When processing outer loops and we find uses of an IV in inner loops, make · a0102fbc

Chris Lattner authored Aug 04, 2005

sure to handle the use, just don't recurse into it.

This permits us to generate this code for a simple nested loop case:

.LBB_foo_0:     ; entry
        stwu r1, -48(r1)
        stw r29, 44(r1)
        stw r30, 40(r1)
        mflr r11
        stw r11, 56(r1)
        lis r2, ha16(L_A$non_lazy_ptr)
        lwz r30, lo16(L_A$non_lazy_ptr)(r2)
        li r29, 1
.LBB_foo_1:     ; no_exit.0
        bl L_bar$stub
        li r2, 1
        or r3, r30, r30
.LBB_foo_2:     ; no_exit.1
        lfd f0, 8(r3)
        stfd f0, 0(r3)
        addi r4, r2, 1
        addi r3, r3, 8
        cmpwi cr0, r2, 100
        or r2, r4, r4
        bne .LBB_foo_2  ; no_exit.1
.LBB_foo_3:     ; loopexit.1
        addi r30, r30, 800
        addi r2, r29, 1
        cmpwi cr0, r29, 100
        or r29, r2, r2
        bne .LBB_foo_1  ; no_exit.0
.LBB_foo_4:     ; return
        lwz r11, 56(r1)
        mtlr r11
        lwz r30, 40(r1)
        lwz r29, 44(r1)
        lwz r1, 0(r1)
        blr

instead of this:

_foo:
.LBB_foo_0:     ; entry
        stwu r1, -48(r1)
        stw r28, 44(r1)                   ;; uses an extra register.
        stw r29, 40(r1)
        stw r30, 36(r1)
        mflr r11
        stw r11, 56(r1)
        li r30, 1
        li r29, 0
        or r28, r29, r29
.LBB_foo_1:     ; no_exit.0
        bl L_bar$stub
        mulli r2, r28, 800           ;; unstrength-reduced multiply
        lis r3, ha16(L_A$non_lazy_ptr)   ;; loop invariant address computation
        lwz r3, lo16(L_A$non_lazy_ptr)(r3)
        add r2, r2, r3
        mulli r4, r29, 800           ;; unstrength-reduced multiply
        addi r3, r3, 8
        add r3, r4, r3
        li r4, 1
.LBB_foo_2:     ; no_exit.1
        lfd f0, 0(r3)
        stfd f0, 0(r2)
        addi r5, r4, 1
        addi r2, r2, 8                 ;; multiple stride 8 IV's
        addi r3, r3, 8
        cmpwi cr0, r4, 100
        or r4, r5, r5
        bne .LBB_foo_2  ; no_exit.1
.LBB_foo_3:     ; loopexit.1
        addi r28, r28, 1               ;;; Many IV's with stride 1
        addi r29, r29, 1
        addi r2, r30, 1
        cmpwi cr0, r30, 100
        or r30, r2, r2
        bne .LBB_foo_1  ; no_exit.0
.LBB_foo_4:     ; return
        lwz r11, 56(r1)
        mtlr r11
        lwz r30, 36(r1)
        lwz r29, 40(r1)
        lwz r28, 44(r1)
        lwz r1, 0(r1)
        blr

llvm-svn: 22640

a0102fbc

Teach loop-reduce to see into nested loops, to pull out immediate values · fc624704

Chris Lattner authored Aug 03, 2005

pushed down by SCEV.

In a nested loop case, this allows us to emit this:

        lis r3, ha16(L_A$non_lazy_ptr)
        lwz r3, lo16(L_A$non_lazy_ptr)(r3)
        add r2, r2, r3
        li r3, 1
.LBB_foo_2:     ; no_exit.1
        lfd f0, 8(r2)        ;; Uses offset of 8 instead of 0
        stfd f0, 0(r2)
        addi r4, r3, 1
        addi r2, r2, 8
        cmpwi cr0, r3, 100
        or r3, r4, r4
        bne .LBB_foo_2  ; no_exit.1

instead of this:

        lis r3, ha16(L_A$non_lazy_ptr)
        lwz r3, lo16(L_A$non_lazy_ptr)(r3)
        add r2, r2, r3
        addi r3, r3, 8
        li r4, 1
.LBB_foo_2:     ; no_exit.1
        lfd f0, 0(r3)
        stfd f0, 0(r2)
        addi r5, r4, 1
        addi r2, r2, 8
        addi r3, r3, 8
        cmpwi cr0, r4, 100
        or r4, r5, r5
        bne .LBB_foo_2  ; no_exit.1

llvm-svn: 22639

fc624704

improve debug output · bb78c97e
Chris Lattner authored Aug 03, 2005
```
llvm-svn: 22638
```
bb78c97e

Scalar SSE: load +0.0 -> xorps/xorpd · 8d394eb7

Nate Begeman authored Aug 03, 2005

Scalar SSE: a < b ? c : 0.0 -> cmpss, andps
Scalar SSE: float -> i16 needs to be promoted

llvm-svn: 22637

8d394eb7

this now passes · 47b57322
Chris Lattner authored Aug 03, 2005
```
llvm-svn: 22636
```
47b57322

Move from Stage 0 to Stage 1. · db23c74e

Chris Lattner authored Aug 03, 2005

Only emit one PHI node for IV uses with identical bases and strides (after
moving foldable immediates to the load/store instruction).

This implements LoopStrengthReduce/dont_insert_redundant_ops.ll, allowing
us to generate this PPC code for test1:

        or r30, r3, r3
.LBB_test1_1:   ; Loop
        li r2, 0
        stw r2, 0(r30)
        stw r2, 4(r30)
        bl L_pred$stub
        addi r30, r30, 8
        cmplwi cr0, r3, 0
        bne .LBB_test1_1        ; Loop

instead of this code:

        or r30, r3, r3
        or r29, r3, r3
.LBB_test1_1:   ; Loop
        li r2, 0
        stw r2, 0(r29)
        stw r2, 4(r30)
        bl L_pred$stub
        addi r30, r30, 8        ;; Two iv's with step of 8
        addi r29, r29, 8
        cmplwi cr0, r3, 0
        bne .LBB_test1_1        ; Loop

llvm-svn: 22635

db23c74e

Alpha ABI specifies stack is always 16 byte alligned, and gcc does it, so I will too · 3a18a395
Andrew Lenharth authored Aug 03, 2005
```
llvm-svn: 22634
```
3a18a395

Rename IVUse to IVUsersOfOneStride, use a struct instead of a pair to · 430d0022

Chris Lattner authored Aug 03, 2005

unify some parallel vectors and get field names more descriptive than
"first" and "second".  This isn't lisp afterall :)

llvm-svn: 22633

430d0022

Aug 03, 2005
- Fix a nasty dangling pointer issue. The ScalarEvolution pass would keep a · 84e9baa9
  Chris Lattner authored Aug 03, 2005
```
map from instruction* to SCEVHandles.  When we delete instructions, we have
to tell it about it.  We would run into nasty cases where new instructions
were reallocated at old instruction addresses and get the old map values.
Bad bad bad :(

llvm-svn: 22632
```
  84e9baa9
- Fix this to test the BE we care about · 938ebaa2
  Chris Lattner authored Aug 03, 2005
```
llvm-svn: 22631
```
  938ebaa2
- Fix an obvious bug in the Log2 stuff that broke SingleSource/UnitTests/2005-05-12-Int64ToFP · 1d465e89
  Chris Lattner authored Aug 03, 2005
```
last night.

llvm-svn: 22630
```
  1d465e89
- Fix PR611, codegen'ing SREM of FP operands to fmod or fmodf instead of · 81914425
  Chris Lattner authored Aug 03, 2005
```
the sequence used for integer ops

llvm-svn: 22629
```
  81914425
- The correct fix for PR612, which also fixes · 3de05cc9
  Chris Lattner authored Aug 03, 2005
```
Transforms/LowerInvoke/2005-08-03-InvokeWithPHIUse.ll

llvm-svn: 22628
```
  3de05cc9
- new testcase for PR612 · c519a7e0
  Chris Lattner authored Aug 03, 2005
```
llvm-svn: 22627
```
  c519a7e0
- When inserting code, make sure not to insert it before PHI nodes. This · f8a81a98
  Chris Lattner authored Aug 03, 2005
```
fixes PR612 and Transforms/LowerInvoke/2005-08-03-InvokeWithPHI.ll

llvm-svn: 22626
```
  f8a81a98
- new testcase for PR612 · 89ba7922
  Chris Lattner authored Aug 03, 2005
```
llvm-svn: 22625
```
  89ba7922
- Add a couple rlwinm tests for bitfield clears · b3b86d56
  Nate Begeman authored Aug 03, 2005
```
llvm-svn: 22624
```
  b3b86d56
- Update rlwimi tests to catch all the cases we care about · 134628b4
  Nate Begeman authored Aug 03, 2005
```
llvm-svn: 22623
```
  134628b4
- Testcase that used to crash simplifycfg · 0ca5d9cb
  Chris Lattner authored Aug 03, 2005
```
llvm-svn: 22622
```
  0ca5d9cb
- Fix Transforms/SimplifyCFG/2005-08-03-PHIFactorCrash.ll, a problem that · d683bdd0
  Chris Lattner authored Aug 03, 2005
```
occurred while bugpointing another testcase

llvm-svn: 22621
```
  d683bdd0
- add support for Graphviz when viewing CFGs · 590642eb
  Chris Lattner authored Aug 03, 2005
```
llvm-svn: 22620
```
  590642eb
- Fix grammar: apostrophe-s ('s) is possessive, not plural; also iff vs. if. · fce32858
  Misha Brukman authored Aug 03, 2005
```
llvm-svn: 22619
```
  fce32858
- Wrap comments to 80 cols, fix code sequence for CountLeadingZeros_64 on · 8aa621c4
  Chris Lattner authored Aug 03, 2005
```
non-ppc GCC 4.0 machines.  Patch by Jim Laskey!

llvm-svn: 22618
```
  8aa621c4
- minor capitalization thing, patch by Jim Laskey · 8dc82b79
  Chris Lattner authored Aug 03, 2005
```
llvm-svn: 22617
```
  8dc82b79