Commits · b7a3894e7c54a3f1a7a6abcd699c81c482b47ae3 · Roger Ferrer / llvm-epi-0.8

Oct 11, 2005
- Fix another lsr-is-nondeterministic case · b7a3894e
  Chris Lattner authored Oct 11, 2005
```
llvm-svn: 23695
```
  b7a3894e
Oct 10, 2005
- Make MaskedValueIsZero a bit more aggressive · 03b9eb50
  Chris Lattner authored Oct 09, 2005
```
llvm-svn: 23677
```
  03b9eb50
Oct 09, 2005
- Fix funky xcode indentation · 62010c45
  Chris Lattner authored Oct 09, 2005
```
llvm-svn: 23674
```
  62010c45
- Hrm, you didn't see this. · eb4be8b9
  Chris Lattner authored Oct 09, 2005
```
llvm-svn: 23673
```
  eb4be8b9
- Fix a source of non-determinism in the backend: the order of processing · 4ea0a3ea
  Chris Lattner authored Oct 09, 2005
```
IV strides dependend on the pointer order of the strides in memory.
Non-determinism is bad.

llvm-svn: 23672
```
  4ea0a3ea
Oct 07, 2005
- Remove useless variable. · 572910c9
  Jeff Cohen authored Oct 07, 2005
```
llvm-svn: 23656
```
  572910c9
Oct 04, 2005
- Fix DemoteRegToStack on an invoke. This fixes PR634. · 20b0754c
  Chris Lattner authored Oct 04, 2005
```
llvm-svn: 23618
```
  20b0754c
- Clean up the code a bit. Use isInstructionTriviallyDead to be more aggressive · 4c3b2b53
  Chris Lattner authored Oct 03, 2005
```
and more correct than use_empty().  This fixes PR635 and
SimplifyCFG/2005-10-02-InvokeSimplify.ll

llvm-svn: 23616
```
  4c3b2b53
Oct 03, 2005

Make IVUseShouldUsePostIncValue more aggressive when the use is a PHI. In · f07a587c

Chris Lattner authored Oct 03, 2005

particular, it should realize that phi's use their values in the pred block
not the phi block itself.  This change turns our em3d loop from this:

_test:
        cmpwi cr0, r4, 0
        bgt cr0, LBB_test_2     ; entry.no_exit_crit_edge
LBB_test_1:     ; entry.loopexit_crit_edge
        li r2, 0
        b LBB_test_6    ; loopexit
LBB_test_2:     ; entry.no_exit_crit_edge
        li r6, 0
LBB_test_3:     ; no_exit
        or r2, r6, r6
        lwz r6, 0(r3)
        cmpw cr0, r6, r5
        beq cr0, LBB_test_6     ; loopexit
LBB_test_4:     ; endif
        addi r3, r3, 4
        addi r6, r2, 1
        cmpw cr0, r6, r4
        blt cr0, LBB_test_3     ; no_exit
LBB_test_5:     ; endif.loopexit.loopexit_crit_edge
        addi r3, r2, 1
        blr
LBB_test_6:     ; loopexit
        or r3, r2, r2
        blr

into:

_test:
        cmpwi cr0, r4, 0
        bgt cr0, LBB_test_2     ; entry.no_exit_crit_edge
LBB_test_1:     ; entry.loopexit_crit_edge
        li r2, 0
        b LBB_test_5    ; loopexit
LBB_test_2:     ; entry.no_exit_crit_edge
        li r6, 0
LBB_test_3:     ; no_exit
        lwz r2, 0(r3)
        cmpw cr0, r2, r5
        or r2, r6, r6
        beq cr0, LBB_test_5     ; loopexit
LBB_test_4:     ; endif
        addi r3, r3, 4
        addi r6, r6, 1
        cmpw cr0, r6, r4
        or r2, r6, r6
        blt cr0, LBB_test_3     ; no_exit
LBB_test_5:     ; loopexit
        or r3, r2, r2
        blr


Unfortunately, this is actually worse code, because the register coallescer
is getting confused somehow.  If it were doing its job right, it could turn the
code into this:

_test:
        cmpwi cr0, r4, 0
        bgt cr0, LBB_test_2     ; entry.no_exit_crit_edge
LBB_test_1:     ; entry.loopexit_crit_edge
        li r6, 0
        b LBB_test_5    ; loopexit
LBB_test_2:     ; entry.no_exit_crit_edge
        li r6, 0
LBB_test_3:     ; no_exit
        lwz r2, 0(r3)
        cmpw cr0, r2, r5
        beq cr0, LBB_test_5     ; loopexit
LBB_test_4:     ; endif
        addi r3, r3, 4
        addi r6, r6, 1
        cmpw cr0, r6, r4
        blt cr0, LBB_test_3     ; no_exit
LBB_test_5:     ; loopexit
        or r3, r6, r6
        blr

... which I'll work on next. :)

llvm-svn: 23604

f07a587c

Refactor some code into a function · e4ed42a4
Chris Lattner authored Oct 03, 2005
```
llvm-svn: 23603
```
e4ed42a4

This break is bogus and I have no idea why it was there. Basically it prevents · 360928db

Chris Lattner authored Oct 03, 2005

memoizing code when IV's are used by phinodes outside of loops.  In a simple
example, we were getting this code before (note that r6 and r7 are isomorphic
IV's):

        li r6, 0
        or r7, r6, r6
LBB_test_3:     ; no_exit
        lwz r2, 0(r3)
        cmpw cr0, r2, r5
        or r2, r7, r7
        beq cr0, LBB_test_5     ; loopexit
LBB_test_4:     ; endif
        addi r2, r7, 1
        addi r7, r7, 1
        addi r3, r3, 4
        addi r6, r6, 1
        cmpw cr0, r6, r4
        blt cr0, LBB_test_3     ; no_exit

Now we get:

        li r6, 0
LBB_test_3:     ; no_exit
        or r2, r6, r6
        lwz r6, 0(r3)
        cmpw cr0, r6, r5
        beq cr0, LBB_test_6     ; loopexit
LBB_test_4:     ; endif
        addi r3, r3, 4
        addi r6, r2, 1
        cmpw cr0, r6, r4
        blt cr0, LBB_test_3     ; no_exit

this was noticed in em3d.

llvm-svn: 23602

360928db

when checking if we should move a split edge block outside of a loop, · 8fcce170

Chris Lattner authored Oct 03, 2005

check the presplit pred, not the post-split pred.  This was causing us
to make the wrong decision in some cases, leaving the critical edge block
in the loop.

llvm-svn: 23601

8fcce170

Oct 01, 2005
- Fix VC++ warnings. · f8a5e5ae
  Jeff Cohen authored Oct 01, 2005
```
llvm-svn: 23579
```
  f8a5e5ae
Sep 29, 2005
- Insert stores after phi nodes in the normal dest. This fixes · a554c947
  Chris Lattner authored Sep 29, 2005
```
LowerInvoke/2005-08-03-InvokeWithPHI.ll

llvm-svn: 23525
```
  a554c947
- Fold isascii into a simple comparison. This speeds up 197.parser by 7.4%, · 87ef943a
  Chris Lattner authored Sep 29, 2005
```
bringing the LLC time down to the CBE time.

llvm-svn: 23521
```
  87ef943a
- remove a bunch of unneeded stuff, or self evident comments · 5f6035fe
  Chris Lattner authored Sep 29, 2005
```
llvm-svn: 23519
```
  5f6035fe
- Implement a couple of memcmp folds from the todo list · c244e7c1
  Chris Lattner authored Sep 29, 2005
```
llvm-svn: 23517
```
  c244e7c1
Sep 28, 2005
- Constant fold llvm.sqrt · ea7214b2
  Chris Lattner authored Sep 28, 2005
```
llvm-svn: 23487
```
  ea7214b2
- add a note about a way to improve this code further, that I won't be getting · 3b63bb37
  Chris Lattner authored Sep 27, 2005
```
to right now.

llvm-svn: 23485
```
  3b63bb37
- Fix a regression in my previous patch, fixing GlobalOpt/2005-09-27-Crash.ll · eb953f0e
  Chris Lattner authored Sep 27, 2005
```
and PR632.

llvm-svn: 23484
```
  eb953f0e
Sep 27, 2005
- Avoid spilling stack slots... to stack slots. · e285f5ed
  Chris Lattner authored Sep 27, 2005
```
llvm-svn: 23478
```
  e285f5ed
- Completely rewrite 'correct' eh support. This changes how setjmp insertion · 87eb2493
  Chris Lattner authored Sep 27, 2005
```
is performed so it is only at most once per function that contains an invoke
instead of once per invoke in the function.  This patch has the following perks:

1. It fixes PR631, which complains about slowness.
2. If fixes PR240, which complains about non-volatile vars being live across
   setjmp/longjmps.
3. It improves (but does not fix) the jmpbuf alignment issue on itanium by not
   forcing the jmpbufs to always be 8-bytes off the alignment of the structure.
4. It speeds up 253.perlbmk from 338s to 13.70s (a 25x improvement!), making us
   now about 4% faster than GCC.

Further improvements are also possible.

llvm-svn: 23477
```
  87eb2493
- Make the pass name simpler · 92233d21
  Chris Lattner authored Sep 27, 2005
```
llvm-svn: 23476
```
  92233d21
- allow demotion to volatile values, add support for invoke · 16cd356f
  Chris Lattner authored Sep 27, 2005
```
llvm-svn: 23473
```
  16cd356f
- Add support for external calls that we know how to constant fold. This implements · 3d27e7f2
  Chris Lattner authored Sep 27, 2005
```
ctor-list-opt.ll:CTOR8

llvm-svn: 23465
```
  3d27e7f2
- Fix a bug where we would evaluate stores into linkonce objects which could be · 29b2780c
  Chris Lattner authored Sep 27, 2005
```
potentially replaced at link-time.

llvm-svn: 23463
```
  29b2780c
- Implement support for static constructors with calls in them. This is useful · 65a3a091
  Chris Lattner authored Sep 27, 2005
```
because gccas runs globalopt before inlining.

This implements ctor-list-opt.ll:CTOR7

llvm-svn: 23462
```
  65a3a091
- Refactor this code a bit, no functionality changes. · da1889b7
  Chris Lattner authored Sep 27, 2005
```
llvm-svn: 23460
```
  da1889b7
Sep 26, 2005
- Remove some dead code. ctor evaluation subsumes empty ctor elim · f2f89af6
  Chris Lattner authored Sep 26, 2005
```
llvm-svn: 23453
```
  f2f89af6
- Add support for alloca, implementing ctor-list-opt.ll:CTOR6 · 6bf2cd57
  Chris Lattner authored Sep 26, 2005
```
llvm-svn: 23452
```
  6bf2cd57
- Add a debug printout, fix a crash on kc++ · 46d9ff08
  Chris Lattner authored Sep 26, 2005
```
llvm-svn: 23450
```
  46d9ff08
- Implement loads/stores through GEP's of globals. This implements · 46af55e0
  Chris Lattner authored Sep 26, 2005
```
ctor-list-opt.ll:CTOR5.

llvm-svn: 23449
```
  46af55e0
- Replace TraverseGEPInitializer with ConstantFoldLoadThroughGEPConstantExpr · 61ff32cd
  Chris Lattner authored Sep 26, 2005
```
llvm-svn: 23447
```
  61ff32cd
- Eliminate GetGEPGlobalInitializer in favor of the more powerful · 02ae21e1
  Chris Lattner authored Sep 26, 2005
```
ConstantFoldLoadThroughGEPConstantExpr function in the utils lib.

llvm-svn: 23446
```
  02ae21e1
- Factor the GetGEPGlobalInitializer out of this pass and into Transforms/Utils · 0b011ec8
  Chris Lattner authored Sep 26, 2005
```
as ConstantFoldLoadThroughGEPConstantExpr.

llvm-svn: 23445
```
  0b011ec8
- Move the ConstantFoldLoadThroughGEPConstantExpr function out of the InstCombine · c13c7b93
  Chris Lattner authored Sep 26, 2005
```
pass.

llvm-svn: 23444
```
  c13c7b93
- add a comment · b009663e
  Chris Lattner authored Sep 26, 2005
```
llvm-svn: 23442
```
  b009663e
- Add support for getelementptr, load, and correctly reject volatile stores. · 4b05c322
  Chris Lattner authored Sep 26, 2005
```
llvm-svn: 23441
```
  4b05c322
- Add support for br/brcond/switch and phi · 3e9ea5ff
  Chris Lattner authored Sep 26, 2005
```
llvm-svn: 23439
```
  3e9ea5ff
- Add a simple interpreter to this code, allowing us to statically evaluate · 99e23fa7
  Chris Lattner authored Sep 26, 2005
```
global ctors that are simple enough.  This implements ctor-list-opt.ll:CTOR2.

llvm-svn: 23437
```
  99e23fa7