Commits · f02994d782458f99e00cf92925ed85e0c1560ad1 · Roger Ferrer / llvm-epi-0.8

Sep 14, 2005
- Fix some issues exposed by more testing. XORIS had the wrong operands · f006d15e
  Chris Lattner authored Sep 14, 2005
```
specified.  The various *imm operands defined by PPC are really all i32,
even though the actual immediate is restricted to a smaller value in it.

llvm-svn: 23352
```
  f006d15e
- Fix some bugs noticed by new checking code · 6b013fc9
  Chris Lattner authored Sep 14, 2005
```
llvm-svn: 23350
```
  6b013fc9
- Fix the regression last night compiling povray · a393e4d4
  Chris Lattner authored Sep 14, 2005
```
llvm-svn: 23348
```
  a393e4d4
- fix a major regression from my patch this afternoon · b42e962d
  Chris Lattner authored Sep 14, 2005
```
llvm-svn: 23347
```
  b42e962d
- we don't need this proto any longer · b011cb27
  Chris Lattner authored Sep 13, 2005
```
llvm-svn: 23342
```
  b011cb27
- move the #include for the generated code into the isel class body so we · 03e08eef
  Chris Lattner authored Sep 13, 2005
```
can use/define class methods

llvm-svn: 23339
```
  03e08eef
Sep 13, 2005
- Change the arg lowering code to use copyfromreg from vregs associated · 0f965a61
  Chris Lattner authored Sep 13, 2005
```
with incoming arguments instead of the pregs themselves.  This fixes
the scheduler from causing problems by moving a copyfromreg for an argument
to after a select_cc node (now it can, and bad things won't happen).

llvm-svn: 23334
```
  0f965a61
- This has been moved to the target-indep code · ee811329
  Chris Lattner authored Sep 13, 2005
```
llvm-svn: 23333
```
  ee811329
- This code is no longer needed, it is moved to the target-indep code · fb96e50b
  Chris Lattner authored Sep 13, 2005
```
llvm-svn: 23332
```
  fb96e50b
- If a function has liveins, and if the target requested that they be plopped · d4382f0a
  Chris Lattner authored Sep 13, 2005
```
into particular vregs, emit copies into the entry MBB.

llvm-svn: 23331
```
  d4382f0a
- Majik numbers are bad · 64685b4c
  Chris Lattner authored Sep 13, 2005
```
llvm-svn: 23330
```
  64685b4c
- Remove some dead vectors · aa6cbd90
  Chris Lattner authored Sep 13, 2005
```
llvm-svn: 23329
```
  aa6cbd90
- Add a simple xform to simplify array accesses with casts in the way. · 2a893296
  Chris Lattner authored Sep 13, 2005
```
This is useful for 178.galgel where resolution of dope vectors (by the
optimizer) causes the scales to become apparent.

llvm-svn: 23328
```
  2a893296
- Fix an issue where LSR would miss rewriting a use of an IV expression by a PHI... · fd018c8d
  Chris Lattner authored Sep 13, 2005
```
Fix an issue where LSR would miss rewriting a use of an IV expression by a PHI node that is not the original PHI.

This fixes up a dot-product loop in galgel, speeding it up from 18.47s to
16.13s.

llvm-svn: 23327
```
  fd018c8d
- Add a helper function, allowing us to simplify some code a bit, changing · 567b81f0
  Chris Lattner authored Sep 13, 2005
```
indentation, no functionality change

llvm-svn: 23325
```
  567b81f0
- Implement a simple xform to turn code like this: · 219175c8
  Chris Lattner authored Sep 12, 2005
```
  if () { store A -> P; } else { store B -> P; }

into a PHI node with one store, in the most trival case.  This implements
load.ll:test10.

llvm-svn: 23324
```
  219175c8
- Another load-peephole optimization: do gcse when two loads are next to · e0bfdf14
  Chris Lattner authored Sep 12, 2005
```
each other.  This implements InstCombine/load.ll:test9

llvm-svn: 23322
```
  e0bfdf14
- Implement a trivial form of store->load forwarding where the store and the · b990f7d8
  Chris Lattner authored Sep 12, 2005
```
load are exactly consequtive.  This is picked up by other passes, but this
triggers thousands of times in fortran programs that use static locals
(and is thus a compile-time speedup).

llvm-svn: 23320
```
  b990f7d8
Sep 12, 2005

Fix a regression from last night, which caused this pass to create invalid · 8048b85e

Chris Lattner authored Sep 12, 2005

code for IV uses outside of loops that are not dominated by the latch block.
We should only convert these uses to use the post-inc value if they ARE
dominated by the latch block.

Also use a new LoopInfo method to simplify some code.

This fixes Transforms/LoopStrengthReduce/2005-09-12-UsesOutOutsideOfLoop.ll

llvm-svn: 23318

8048b85e

Add a new getLoopLatch() method. · b35df5f5
Chris Lattner authored Sep 12, 2005
```
llvm-svn: 23315
```
b35df5f5

_test: · a6764839

Chris Lattner authored Sep 12, 2005

        li r2, 0
LBB_test_1:     ; no_exit.2
        li r5, 0
        stw r5, 0(r3)
        addi r2, r2, 1
        addi r3, r3, 4
        cmpwi cr0, r2, 701
        blt cr0, LBB_test_1     ; no_exit.2
LBB_test_2:     ; loopexit.2.loopexit
        addi r2, r2, 1
        stw r2, 0(r4)
        blr
[zion ~/llvm]$ cat > ~/xx
Uses of IV's outside of the loop should use hte post-incremented version
of the IV, not the preincremented version.  This helps many loops (e.g. in sixtrack)
which used to generate code like this (this is the code from the
dont-hoist-simple-loop-constants.ll testcase):

_test:
        li r2, 0                 **** IV starts at 0
LBB_test_1:     ; no_exit.2
        or r5, r2, r2            **** Copy for loop exit
        li r2, 0
        stw r2, 0(r3)
        addi r3, r3, 4
        addi r2, r5, 1
        addi r6, r5, 2           **** IV+2
        cmpwi cr0, r6, 701
        blt cr0, LBB_test_1     ; no_exit.2
LBB_test_2:     ; loopexit.2.loopexit
        addi r2, r5, 2       ****  IV+2
        stw r2, 0(r4)
        blr

And now generated code like this:

_test:
        li r2, 1               *** IV starts at 1
LBB_test_1:     ; no_exit.2
        li r5, 0
        stw r5, 0(r3)
        addi r2, r2, 1
        addi r3, r3, 4
        cmpwi cr0, r2, 701     *** IV.postinc + 0
        blt cr0, LBB_test_1
LBB_test_2:     ; loopexit.2.loopexit
        stw r2, 0(r4)          *** IV.postinc + 0
        blr

llvm-svn: 23313

a6764839

Sep 10, 2005

implement Transforms/LoopStrengthReduce/dont-hoist-simple-loop-constants.ll. · 530fe6ab

Chris Lattner authored Sep 10, 2005

We used to emit this code for it:

_test:
        li r2, 1     ;; Value tying up a register for the whole loop
        li r5, 0
LBB_test_1:     ; no_exit.2
        or r6, r5, r5
        li r5, 0
        stw r5, 0(r3)
        addi r5, r6, 1
        addi r3, r3, 4
        add r7, r2, r5  ;; should be addi r7, r5, 1
        cmpwi cr0, r7, 701
        blt cr0, LBB_test_1     ; no_exit.2
LBB_test_2:     ; loopexit.2.loopexit
        addi r2, r6, 2
        stw r2, 0(r4)
        blr

now we emit this:

_test:
        li r2, 0
LBB_test_1:     ; no_exit.2
        or r5, r2, r2
        li r2, 0
        stw r2, 0(r3)
        addi r3, r3, 4
        addi r2, r5, 1
        addi r6, r5, 2   ;; whoa, fold those adds!
        cmpwi cr0, r6, 701
        blt cr0, LBB_test_1     ; no_exit.2
LBB_test_2:     ; loopexit.2.loopexit
        addi r2, r5, 2
        stw r2, 0(r4)
        blr

more improvement coming.

llvm-svn: 23306

530fe6ab

PowerPC cannot truncstore i1 natively · 4309c3a7
Chris Lattner authored Sep 10, 2005
```
llvm-svn: 23304
```
4309c3a7
Allow targets to say they don't support truncstore i1 (which includes a mask · 2d454bf5
Chris Lattner authored Sep 10, 2005
```
when storing to an 8-bit memory location), as most don't.

llvm-svn: 23303
```
2d454bf5
Add a missing #include, patch courtesy of Baptiste Lepilleur. · bd39c1a4
Chris Lattner authored Sep 09, 2005
```
llvm-svn: 23302
```
bd39c1a4

Fix a problem duraid encountered on itanium where this folding: · 331b311f

Chris Lattner authored Sep 09, 2005

select (x < y), 1, 0 -> (x < y) incorrectly: the setcc returns i1 but the
select returned i32.  Add the zero extend as needed.

llvm-svn: 23301

331b311f

Fix a crash viewing dags that have target nodes in them · 16e5cb87
Chris Lattner authored Sep 09, 2005
```
llvm-svn: 23300
```
16e5cb87

Sep 09, 2005

I forgot that we always spill fp values as 64-bits. Implement spill folding · 0f2146bb
Chris Lattner authored Sep 09, 2005
```
for FP as well.  This triggers a couple dozen times on 177.mesa (for example).

llvm-svn: 23299
```
0f2146bb

Fix a problem that Nate noticed, where spill code was not getting coallesced · 712e78ee

Chris Lattner authored Sep 09, 2005

with copies, leading to code like this:

       lwz r4, 380(r1)
       or r10, r4, r4    ;; Last use of r4

By teaching the PPC backend how to fold spills into copies, we now get this
code:

       lwz r10, 380(r1)

wow. :)

This reduces a testcase nate sent me from 1505 instructions to 1484.

Note that this could handle FP values but doesn't currently, for reasons
mentioned in the patch

llvm-svn: 23298

712e78ee

code cleanup · f540c1a2
Chris Lattner authored Sep 09, 2005
```
llvm-svn: 23297
```
f540c1a2

Use continue in the use-processing loop to make it clear what the early exits · 14100037

Chris Lattner authored Sep 09, 2005

are, simplify logic, and cause things to not be nested as deeply.  This also
uses MRI->areAliases instead of an explicit loop.

No functionality change, just code cleanup.

llvm-svn: 23296

14100037

Last round of 2-node folds from SD.cpp. Will move on to 3 node ops such · 049b748c
Nate Begeman authored Sep 09, 2005
```
as setcc and select next.

llvm-svn: 23295
```
049b748c
remove debugging code *slaps head* · ce3662f2
Chris Lattner authored Sep 09, 2005
```
llvm-svn: 23294
```
ce3662f2

When spilling a live range that is used multiple times by one instruction, · c9053083

Chris Lattner authored Sep 09, 2005

only add a reload live range once for the instruction.  This is one step
towards fixing a regalloc pessimization that Nate notice, but is later undone
by the spiller (so no code is changed).

llvm-svn: 23293

c9053083

Teach the code generator that rlwimi is commutable if the rotate amount · c37a2f13

Chris Lattner authored Sep 09, 2005

is zero.  This lets the register allocator elide some copies in some cases.

This implements CodeGen/PowerPC/rlwimi-commute.ll

llvm-svn: 23292

c37a2f13

Introduce two new concepts: · 39b4d83f

Chris Lattner authored Sep 09, 2005

1. Add support for defining Pattern's, which can match expressions when there
   is no instruction that directly implements something.  Instructions usually
   implicitly define patterns.
2. Add support for defining SDNodeXForm's, which are node transformations.
   This seperates the concept of a node xform out from the existing predicate
   support.

Using this new stuff, we add a few instruction patterns, one for testing, and
two for OR/XOR by an arbitrary immediate.

llvm-svn: 23286

39b4d83f

whitespace/comment changes, no functionality diffs · 4b09f3c6
Chris Lattner authored Sep 08, 2005
```
llvm-svn: 23283
```
4b09f3c6

Sep 08, 2005
- Move yet more folds over to the dag combiner from sd.cpp · 85c1cc45
  Nate Begeman authored Sep 08, 2005
```
llvm-svn: 23278
```
  85c1cc45
- Add a bunch of stuff needed for node type inference. Move 'BLR' down with · 0ec8fa08
  Chris Lattner authored Sep 08, 2005
```
the rest of the instructions, add comment markers to seperate portions of
the file into logical parts

llvm-svn: 23277
```
  0ec8fa08
- add patterns for x?oris? · 76cb006e
  Chris Lattner authored Sep 08, 2005
```
llvm-svn: 23268
```
  76cb006e