Commits · 05da0d966eca48b0fa5a9693e69bfdb18919e8bc · Roger Ferrer / llvm-epi-0.8

Oct 05, 2005
- implement visitBR_CC so that PowerPC/inverted-bool-compares.ll passes · a49e16fe
  Chris Lattner authored Oct 05, 2005
```
with the dag combiner.  This speeds up espresso by 8%, reaching performance
parity with the dag-combiner-disabled llc.

llvm-svn: 23636
```
  a49e16fe
- fix some pastos · b11d1563
  Chris Lattner authored Oct 05, 2005
```
llvm-svn: 23635
```
  b11d1563
- Add a new HandleNode class, which is used to handle (haha) cases in the · 06f1d0f7
  Chris Lattner authored Oct 05, 2005
```
dead node elim and dag combiner passes where the root is potentially updated.
This fixes a fixme in the dag combiner.

llvm-svn: 23634
```
  06f1d0f7
- Implement the code for PowerPC/inverted-bool-compares.ll, even though it · a6895d18
  Chris Lattner authored Oct 05, 2005
```
that testcase still does not pass with the dag combiner.  This is because
not all forms of br* are folded yet.

Also, when we combine a node into another one, delete the node immediately
instead of waiting for the node to potentially come up in the future.

llvm-svn: 23632
```
  a6895d18
- make sure that -view-isel-dags is the input to the isel, not the input to · 6bd8fd09
  Chris Lattner authored Oct 05, 2005
```
the second phase of dag combining

llvm-svn: 23631
```
  6bd8fd09
- Fix a crash compiling Olden/tsp · 746d50a0
  Chris Lattner authored Oct 05, 2005
```
llvm-svn: 23630
```
  746d50a0
Oct 04, 2005

Reverting to version - until problem isolated. · 327d4298
Jim Laskey authored Oct 04, 2005
```
llvm-svn: 23622
```
327d4298

Fix some faulty logic in the libcall inserter. · 5da6908d

Nate Begeman authored Oct 04, 2005

Since calls return more than one value, don't bail if one of their uses
happens to be a node that's not an MVT::Other when following the chain
from CALLSEQ_START to CALLSEQ_END.

Once we've found a CALLSEQ_START, we can just return; there's no need to
tail-recurse further up the graph.

Most importantly, just because something only has one use doesn't mean we
should use it's one use to follow from start to end.  This faulty logic
caused us to follow a chain of one-use FP operations back to a much earlier
call, putting a cycle in the graph from a later start to an earlier end.

This is a better fix that reverting to the workaround committed earlier
today.

llvm-svn: 23620

5da6908d

Add back a workaround that fixes some breakages from chris's last change. · 54fb5002

Nate Begeman authored Oct 04, 2005

Neither of us have yet figured out why this code is necessary, but stuff
breaks if its not there.  Still tracking this down...

llvm-svn: 23617

54fb5002

Oct 03, 2005
- Refactor gathering node info and emission. · 409a6b20
  Jim Laskey authored Oct 03, 2005
```
llvm-svn: 23610
```
  409a6b20
- clean up this code a bit, no functionality change · 57b21f9f
  Chris Lattner authored Oct 03, 2005
```
llvm-svn: 23609
```
  57b21f9f
- Break the body of the loop out into a new method · 5f096e28
  Chris Lattner authored Oct 03, 2005
```
llvm-svn: 23606
```
  5f096e28
Oct 02, 2005
- Fix a problem where the legalizer would run out of stack space on extremely · 9cfccfb5
  Chris Lattner authored Oct 02, 2005
```
large basic blocks because it was purely recursive.  This switches it to an
iterative/recursive hybrid.

llvm-svn: 23596
```
  9cfccfb5
- silence a bogus warning · 7f718e61
  Chris Lattner authored Oct 02, 2005
```
llvm-svn: 23595
```
  7f718e61
- Add assertions to the trivial scheduler to check that the value types match · 704d97f8
  Chris Lattner authored Oct 02, 2005
```
up between defs and uses.

llvm-svn: 23590
```
  704d97f8
- Codegen CopyFromReg using the regclass that matches the valuetype of the · a038d901
  Chris Lattner authored Oct 02, 2005
```
destination vreg.

llvm-svn: 23586
```
  a038d901
Oct 01, 2005
- Add some very paranoid checking for operand/result reg class matchup · 5a7bfe0b
  Chris Lattner authored Oct 01, 2005
```
For instructions that define multiple results, use the right regclass
to define the result, not always the rc of result #0

llvm-svn: 23580
```
  5a7bfe0b
- Fix VC++ warnings. · f8a5e5ae
  Jeff Cohen authored Oct 01, 2005
```
llvm-svn: 23579
```
  f8a5e5ae
- add a method · fda6944c
  Chris Lattner authored Oct 01, 2005
```
llvm-svn: 23575
```
  fda6944c
- typo · d3850457
  Jim Laskey authored Oct 01, 2005
```
llvm-svn: 23574
```
  d3850457
- 1. Simplify the gathering of node groups. · 9d969328
  Jim Laskey authored Oct 01, 2005
```
2. Printing node groups when displaying nodes.

llvm-svn: 23573
```
  9d969328
Sep 30, 2005
- 1. Made things node-centric (from operand). · 3fe3841c
  Jim Laskey authored Sep 30, 2005
```
2. Added node groups to handle flagged nodes.

3. Started weaning simple scheduling off existing emitter.

llvm-svn: 23566
```
  3fe3841c
- now that we have a reg class to spill with, get this info from the regclass · 2e794c91
  Chris Lattner authored Sep 30, 2005
```
llvm-svn: 23559
```
  2e794c91
- Now that we have getCalleeSaveRegClasses() info, use it to pass the register · 51878189
  Chris Lattner authored Sep 30, 2005
```
class into the spill/reload methods.  Targets can now rely on that argument.

llvm-svn: 23556
```
  51878189
- Change this code ot pass register classes into the stack slot spiller/reloader · 5a6199f3
  Chris Lattner authored Sep 30, 2005
```
code.  PrologEpilogInserter hasn't been updated yet though, so targets cannot
use this info.

llvm-svn: 23536
```
  5a6199f3
Sep 29, 2005
- Fix two bugs in my patch earlier today that broke int->fp conversion on X86. · 5b2be1f8
  Chris Lattner authored Sep 29, 2005
```
llvm-svn: 23522
```
  5b2be1f8
- Silence VC++ redeclaration warnings. · b01a41a0
  Jeff Cohen authored Sep 29, 2005
```
llvm-svn: 23516
```
  b01a41a0
- Add FP versions of the binary operators, keeping the int and fp worlds seperate. · 6f3b577e
  Chris Lattner authored Sep 28, 2005
```
Though I have done extensive testing, it is possible that this will break
things in configs I can't test.  Please let me know if this causes a problem
and I'll fix it ASAP.

llvm-svn: 23504
```
  6f3b577e
Sep 28, 2005

If the target prefers it, use _setjmp/_longjmp should be used instead of... · 0fd8f9fb

Chris Lattner authored Sep 27, 2005

If the target prefers it, use _setjmp/_longjmp should be used instead of setjmp/longjmp for llvm.setjmp/llvm.longjmp.

llvm-svn: 23481

0fd8f9fb

Sep 27, 2005
- Remove some redundancies. · 63523f98
  Jim Laskey authored Sep 27, 2005
```
llvm-svn: 23469
```
  63523f98
Sep 26, 2005

Addition of a simple two pass scheduler. This version is currently hacked up · 5f2443c8

Jim Laskey authored Sep 26, 2005

for testing and will require target machine info to do a proper scheduling.
The simple scheduler can be turned on using -sched=simple (defaults
to -sched=none)

llvm-svn: 23455

5f2443c8

Sep 23, 2005

Turn (X^C1) == C2 into X == C1^C2 iff X&~C1 = 0 (and move a function) · 59a05bdd

Chris Lattner authored Sep 23, 2005

This happens all the time on PPC for bool values, e.g. eliminating a xori
in inverted-bool-compares.ll.

This should be added to the dag combiner as well.

llvm-svn: 23403

59a05bdd

Sep 21, 2005
- Expose the LiveInterval interfaces as public headers. · b1f8982f
  Chris Lattner authored Sep 21, 2005
```
llvm-svn: 23400
```
  b1f8982f
Sep 20, 2005
- Stub out the rest of the DAG Combiner. Just need to fill in the · c760f80f
  Nate Begeman authored Sep 19, 2005
```
select_cc bits and then wrap it in a convenience function for  use with
regular select.

llvm-svn: 23389
```
  c760f80f
Sep 19, 2005

Teach the local spiller to turn stack slot loads into register-register copies · 2f838f21

Chris Lattner authored Sep 19, 2005

when possible, avoiding the load (and avoiding the copy if the value is already
in the right register).

This patch came about when I noticed code like the following being generated:

  store R17 -> [SS1]
  ...blah...
  R4 = load [SS1]

This was causing an LSU reject on the G5.  This problem was due to the register
allocator folding spill code into a reg-reg copy (producing the load), which
prevented the spiller from being able to rewrite the load into a copy, despite
the fact that the value was already available in a register.  In the case
above, we now rip out the R4 load and replace it with a R4 = R17 copy.

This speeds up several programs on X86 (which spills a lot :) ), e.g.
smg2k from 22.39->20.60s, povray from 12.93->12.66s, 168.wupwise from
68.54->53.83s (!), 197.parser from 7.33->6.62s (!), etc.  This may have a larger
impact in some cases on the G5 (by avoiding LSU rejects), though it probably
won't trigger as often (less spilling in general).

Targets that implement folding of loads/stores into copies should implement
the isLoadFromStackSlot hook to get this.

llvm-svn: 23388

2f838f21

Sep 16, 2005
- More DAG combining. Still need the branch instructions, and select_cc · 24a7eca2
  Nate Begeman authored Sep 16, 2005
```
llvm-svn: 23371
```
  24a7eca2
Sep 13, 2005
- If a function has liveins, and if the target requested that they be plopped · d4382f0a
  Chris Lattner authored Sep 13, 2005
```
into particular vregs, emit copies into the entry MBB.

llvm-svn: 23331
```
  d4382f0a
Sep 10, 2005
- Allow targets to say they don't support truncstore i1 (which includes a mask · 2d454bf5
  Chris Lattner authored Sep 10, 2005
```
when storing to an 8-bit memory location), as most don't.

llvm-svn: 23303
```
  2d454bf5
- Add a missing #include, patch courtesy of Baptiste Lepilleur. · bd39c1a4
  Chris Lattner authored Sep 09, 2005
```
llvm-svn: 23302
```
  bd39c1a4
- Fix a problem duraid encountered on itanium where this folding: · 331b311f
  Chris Lattner authored Sep 09, 2005
```
select (x < y), 1, 0 -> (x < y) incorrectly: the setcc returns i1 but the
select returned i32.  Add the zero extend as needed.

llvm-svn: 23301
```
  331b311f