Commits · 55149d7835815899fc412cfe4b4b5eee50294b4d · Roger Ferrer / llvm-epi-0.8

Oct 05, 2005

Fix a bug in the local spiller, where we could take code like this: · 55149d78

Chris Lattner authored Oct 05, 2005

  store r12 -> [ss#2]
  R3 = load [ss#1]
  use R3
  R3 = load [ss#2]
  R4 = load [ss#1]

and turn it into this code:

  store R12 -> [ss#2]
  R3 = load [ss#1]
  use R3
  R3 = R12
  R4 = R3    <- oops!

The problem was that promoting R3 = load[ss#2] to a copy missed the fact that
the instruction invalidated R3 at that point.

llvm-svn: 23638

55149d78

silence some warnings · 05da0d96
Chris Lattner authored Oct 05, 2005
```
llvm-svn: 23637
```
05da0d96

implement visitBR_CC so that PowerPC/inverted-bool-compares.ll passes · a49e16fe

Chris Lattner authored Oct 05, 2005

with the dag combiner.  This speeds up espresso by 8%, reaching performance
parity with the dag-combiner-disabled llc.

llvm-svn: 23636

a49e16fe

fix some pastos · b11d1563
Chris Lattner authored Oct 05, 2005
```
llvm-svn: 23635
```
b11d1563

Add a new HandleNode class, which is used to handle (haha) cases in the · 06f1d0f7

Chris Lattner authored Oct 05, 2005

dead node elim and dag combiner passes where the root is potentially updated.
This fixes a fixme in the dag combiner.

llvm-svn: 23634

06f1d0f7

add a helper class · 5fc36727
Chris Lattner authored Oct 05, 2005
```
llvm-svn: 23633
```
5fc36727

Implement the code for PowerPC/inverted-bool-compares.ll, even though it · a6895d18

Chris Lattner authored Oct 05, 2005

that testcase still does not pass with the dag combiner.  This is because
not all forms of br* are folded yet.

Also, when we combine a node into another one, delete the node immediately
instead of waiting for the node to potentially come up in the future.

llvm-svn: 23632

a6895d18

make sure that -view-isel-dags is the input to the isel, not the input to · 6bd8fd09
Chris Lattner authored Oct 05, 2005
```
the second phase of dag combining

llvm-svn: 23631
```
6bd8fd09
Fix a crash compiling Olden/tsp · 746d50a0
Chris Lattner authored Oct 05, 2005
```
llvm-svn: 23630
```
746d50a0
Add some rules for building preprocessed files · bb087956
Chris Lattner authored Oct 05, 2005
```
llvm-svn: 23629
```
bb087956

Oct 04, 2005

refactor a bit of code. · 3b793c65

Chris Lattner authored Oct 04, 2005

When moving constant entries in 'Map' if the entry is the representative
constant for the abstractypemap, make sure to update it as well.  This
fixes the bcreader failures from last night on several C++ apps.

llvm-svn: 23628

3b793c65

Minor speedup to avoid array searches given a Use*. This speeds up bc reading · dff59118
Chris Lattner authored Oct 04, 2005
```
of the python test from 1:00 to 54s.

llvm-svn: 23627
```
dff59118
Change the signature of replaceUsesOfWithOnConstant. The bool was always · 7a1450db
Chris Lattner authored Oct 04, 2005
```
true dynamically.  Finally, pass the Use* that replaceAllUsesWith has into
the method for future use.

llvm-svn: 23626
```
7a1450db
Change the signature of replaceUsesOfWithOnConstant to take a Use* and not · 51887163
Chris Lattner authored Oct 04, 2005
```
take the bool.  The bool is always true dynamically.

llvm-svn: 23625
```
51887163

For large constants (e.g. arrays and structs with many elements) just · 935aa922

Chris Lattner authored Oct 04, 2005

creating the keys and doing comparisons to index into 'Map' takes a lot
of time.  For these large constants, keep an inverse map so that 'remove'
and move operations are much faster.

This speeds up a release build of the bc reader on Eric's nasty python
bytecode file from 1:39 to 1:00s.

llvm-svn: 23624

935aa922

minor cleanup/fastpath for the bcreader. This speeds up the bcreader · 5bbf60a5
Chris Lattner authored Oct 04, 2005
```
from 1:41 -> 1:39 on the large python .bc file in a release build.

llvm-svn: 23623
```
5bbf60a5
Reverting to version - until problem isolated. · 327d4298
Jim Laskey authored Oct 04, 2005
```
llvm-svn: 23622
```
327d4298
Add a forward def · d1a5bc8d
Chris Lattner authored Oct 04, 2005
```
llvm-svn: 23621
```
d1a5bc8d

Fix some faulty logic in the libcall inserter. · 5da6908d

Nate Begeman authored Oct 04, 2005

Since calls return more than one value, don't bail if one of their uses
happens to be a node that's not an MVT::Other when following the chain
from CALLSEQ_START to CALLSEQ_END.

Once we've found a CALLSEQ_START, we can just return; there's no need to
tail-recurse further up the graph.

Most importantly, just because something only has one use doesn't mean we
should use it's one use to follow from start to end.  This faulty logic
caused us to follow a chain of one-use FP operations back to a much earlier
call, putting a cycle in the graph from a later start to an earlier end.

This is a better fix that reverting to the workaround committed earlier
today.

llvm-svn: 23620

5da6908d

implement the struct version of the array speedup, speeding up the · 8760ec73
Chris Lattner authored Oct 04, 2005
```
testcase a bit more from 1:48 -> 1.40.

llvm-svn: 23619
```
8760ec73
Fix DemoteRegToStack on an invoke. This fixes PR634. · 20b0754c
Chris Lattner authored Oct 04, 2005
```
llvm-svn: 23618
```
20b0754c

Add back a workaround that fixes some breakages from chris's last change. · 54fb5002

Nate Begeman authored Oct 04, 2005

Neither of us have yet figured out why this code is necessary, but stuff
breaks if its not there.  Still tracking this down...

llvm-svn: 23617

54fb5002

Clean up the code a bit. Use isInstructionTriviallyDead to be more aggressive · 4c3b2b53
Chris Lattner authored Oct 03, 2005
```
and more correct than use_empty().  This fixes PR635 and
SimplifyCFG/2005-10-02-InvokeSimplify.ll

llvm-svn: 23616
```
4c3b2b53
new testcase for PR635 · a6e98f2e
Chris Lattner authored Oct 03, 2005
```
llvm-svn: 23615
```
a6e98f2e

Change ConstantArray::replaceUsesOfWithOnConstant to attempt to update · b64419ac

Chris Lattner authored Oct 03, 2005

constant arrays in place instead of reallocating them and replaceAllUsesOf'ing
the result.  This speeds up a release build of the bcreader from:

136.987u 120.866s 4:24.38
to
49.790u 49.890s 1:40.14

... a 2.6x speedup parsing a large python bc file.

llvm-svn: 23614

b64419ac

Oct 03, 2005

move some methods, no other changes · c4062ba6
Chris Lattner authored Oct 03, 2005
```
llvm-svn: 23613
```
c4062ba6
minor microoptimizations · 0144fadc
Chris Lattner authored Oct 03, 2005
```
llvm-svn: 23612
```
0144fadc

Use a map to cache the ModuleType information, so we can do logarithmic · bad09e71

Chris Lattner authored Oct 03, 2005

lookups instead of linear time lookups.  This speeds up bc parsing of a
large file from

137.834u 118.256s 4:27.96
to
132.611u 114.436s 4:08.53

with a release build.

llvm-svn: 23611

bad09e71

Refactor gathering node info and emission. · 409a6b20
Jim Laskey authored Oct 03, 2005
```
llvm-svn: 23610
```
409a6b20
clean up this code a bit, no functionality change · 57b21f9f
Chris Lattner authored Oct 03, 2005
```
llvm-svn: 23609
```
57b21f9f
Speed up the asm printer a lot by not printing formatted LLVM asm output · afef68ba
Chris Lattner authored Oct 03, 2005
```
for globals

llvm-svn: 23608
```
afef68ba
Break the body of the loop out into a new method · 5f096e28
Chris Lattner authored Oct 03, 2005
```
llvm-svn: 23606
```
5f096e28
Fix case of path · 16874595
Chris Lattner authored Oct 03, 2005
```
llvm-svn: 23605
```
16874595

Make IVUseShouldUsePostIncValue more aggressive when the use is a PHI. In · f07a587c

Chris Lattner authored Oct 03, 2005

particular, it should realize that phi's use their values in the pred block
not the phi block itself.  This change turns our em3d loop from this:

_test:
        cmpwi cr0, r4, 0
        bgt cr0, LBB_test_2     ; entry.no_exit_crit_edge
LBB_test_1:     ; entry.loopexit_crit_edge
        li r2, 0
        b LBB_test_6    ; loopexit
LBB_test_2:     ; entry.no_exit_crit_edge
        li r6, 0
LBB_test_3:     ; no_exit
        or r2, r6, r6
        lwz r6, 0(r3)
        cmpw cr0, r6, r5
        beq cr0, LBB_test_6     ; loopexit
LBB_test_4:     ; endif
        addi r3, r3, 4
        addi r6, r2, 1
        cmpw cr0, r6, r4
        blt cr0, LBB_test_3     ; no_exit
LBB_test_5:     ; endif.loopexit.loopexit_crit_edge
        addi r3, r2, 1
        blr
LBB_test_6:     ; loopexit
        or r3, r2, r2
        blr

into:

_test:
        cmpwi cr0, r4, 0
        bgt cr0, LBB_test_2     ; entry.no_exit_crit_edge
LBB_test_1:     ; entry.loopexit_crit_edge
        li r2, 0
        b LBB_test_5    ; loopexit
LBB_test_2:     ; entry.no_exit_crit_edge
        li r6, 0
LBB_test_3:     ; no_exit
        lwz r2, 0(r3)
        cmpw cr0, r2, r5
        or r2, r6, r6
        beq cr0, LBB_test_5     ; loopexit
LBB_test_4:     ; endif
        addi r3, r3, 4
        addi r6, r6, 1
        cmpw cr0, r6, r4
        or r2, r6, r6
        blt cr0, LBB_test_3     ; no_exit
LBB_test_5:     ; loopexit
        or r3, r2, r2
        blr


Unfortunately, this is actually worse code, because the register coallescer
is getting confused somehow.  If it were doing its job right, it could turn the
code into this:

_test:
        cmpwi cr0, r4, 0
        bgt cr0, LBB_test_2     ; entry.no_exit_crit_edge
LBB_test_1:     ; entry.loopexit_crit_edge
        li r6, 0
        b LBB_test_5    ; loopexit
LBB_test_2:     ; entry.no_exit_crit_edge
        li r6, 0
LBB_test_3:     ; no_exit
        lwz r2, 0(r3)
        cmpw cr0, r2, r5
        beq cr0, LBB_test_5     ; loopexit
LBB_test_4:     ; endif
        addi r3, r3, 4
        addi r6, r6, 1
        cmpw cr0, r6, r4
        blt cr0, LBB_test_3     ; no_exit
LBB_test_5:     ; loopexit
        or r3, r6, r6
        blr

... which I'll work on next. :)

llvm-svn: 23604

f07a587c

Refactor some code into a function · e4ed42a4
Chris Lattner authored Oct 03, 2005
```
llvm-svn: 23603
```
e4ed42a4

This break is bogus and I have no idea why it was there. Basically it prevents · 360928db

Chris Lattner authored Oct 03, 2005

memoizing code when IV's are used by phinodes outside of loops.  In a simple
example, we were getting this code before (note that r6 and r7 are isomorphic
IV's):

        li r6, 0
        or r7, r6, r6
LBB_test_3:     ; no_exit
        lwz r2, 0(r3)
        cmpw cr0, r2, r5
        or r2, r7, r7
        beq cr0, LBB_test_5     ; loopexit
LBB_test_4:     ; endif
        addi r2, r7, 1
        addi r7, r7, 1
        addi r3, r3, 4
        addi r6, r6, 1
        cmpw cr0, r6, r4
        blt cr0, LBB_test_3     ; no_exit

Now we get:

        li r6, 0
LBB_test_3:     ; no_exit
        or r2, r6, r6
        lwz r6, 0(r3)
        cmpw cr0, r6, r5
        beq cr0, LBB_test_6     ; loopexit
LBB_test_4:     ; endif
        addi r3, r3, 4
        addi r6, r2, 1
        cmpw cr0, r6, r4
        blt cr0, LBB_test_3     ; no_exit

this was noticed in em3d.

llvm-svn: 23602

360928db

when checking if we should move a split edge block outside of a loop, · 8fcce170

Chris Lattner authored Oct 03, 2005

check the presplit pred, not the post-split pred.  This was causing us
to make the wrong decision in some cases, leaving the critical edge block
in the loop.

llvm-svn: 23601

8fcce170

This member can be const too · 77676d5b
Chris Lattner authored Oct 03, 2005
```
llvm-svn: 23600
```
77676d5b

Oct 02, 2005
- put the right labels on the data · e51d6a9f
  Chris Lattner authored Oct 02, 2005
```
llvm-svn: 23599
```
  e51d6a9f
- Fix a problem where the legalizer would run out of stack space on extremely · 9cfccfb5
  Chris Lattner authored Oct 02, 2005
```
large basic blocks because it was purely recursive.  This switches it to an
iterative/recursive hybrid.

llvm-svn: 23596
```
  9cfccfb5