Commits · e5b23a6d678f7214a39961d205d97b921470ede4 · Roger Ferrer / llvm-epi-0.8

Sep 18, 2005

Remove unintentionally committed code · e5b23a6d
Chris Lattner authored Sep 18, 2005
```
llvm-svn: 23382
```
e5b23a6d

implement shift.ll:test25. This compiles: · 27cb9dbd

Chris Lattner authored Sep 18, 2005

struct S { unsigned int i : 6, j : 11, k : 15; } b;
void plus3 (unsigned int x) {
  b.k += x;
}

to:

_plus3:
        lis r2, ha16(L_b$non_lazy_ptr)
        lwz r2, lo16(L_b$non_lazy_ptr)(r2)
        lwz r3, 0(r2)
        rlwinm r4, r3, 0, 0, 14
        add r4, r4, r3
        rlwimi r4, r3, 0, 15, 31
        stw r4, 0(r2)
        blr

instead of:

_plus3:
        lis r2, ha16(L_b$non_lazy_ptr)
        lwz r2, lo16(L_b$non_lazy_ptr)(r2)
        lwz r4, 0(r2)
        srwi r5, r4, 17
        add r3, r5, r3
        slwi r3, r3, 17
        rlwimi r3, r4, 0, 15, 31
        stw r3, 0(r2)
        blr

llvm-svn: 23381

27cb9dbd

Implement add.ll:test29. Codegening: · af517574

Chris Lattner authored Sep 18, 2005

struct S { unsigned int i : 6, j : 11, k : 15; } b;
void plus1 (unsigned int x) {
  b.i += x;
}

as:
_plus1:
        lis r2, ha16(L_b$non_lazy_ptr)
        lwz r2, lo16(L_b$non_lazy_ptr)(r2)
        lwz r4, 0(r2)
        add r3, r4, r3
        rlwimi r3, r4, 0, 0, 25
        stw r3, 0(r2)
        blr

instead of:

_plus1:
        lis r2, ha16(L_b$non_lazy_ptr)
        lwz r2, lo16(L_b$non_lazy_ptr)(r2)
        lwz r4, 0(r2)
        rlwinm r5, r4, 0, 26, 31
        add r3, r5, r3
        rlwimi r3, r4, 0, 0, 25
        stw r3, 0(r2)
        blr

llvm-svn: 23379

af517574

remove debug output · 027eaf01
Chris Lattner authored Sep 18, 2005
```
llvm-svn: 23377
```
027eaf01

Implement or.ll:test21. This teaches instcombine to be able to turn this: · 15212989

Chris Lattner authored Sep 18, 2005

struct {
   unsigned int bit0:1;
   unsigned int ubyte:31;
} sdata;

void foo() {
  sdata.ubyte++;
}

into this:

foo:
        add DWORD PTR [sdata], 2
        ret

instead of this:

foo:
        mov %EAX, DWORD PTR [sdata]
        mov %ECX, %EAX
        add %ECX, 2
        and %ECX, -2
        and %EAX, 1
        or %EAX, %ECX
        mov DWORD PTR [sdata], %EAX
        ret

llvm-svn: 23376

15212989

Sep 17, 2005
- Implement hook for ppc · 4d9cf680
  Chris Lattner authored Sep 17, 2005
  
  llvm-svn: 23374
  4d9cf680
Sep 16, 2005
- More DAG combining. Still need the branch instructions, and select_cc · 24a7eca2
  Nate Begeman authored Sep 16, 2005
  
  llvm-svn: 23371
  24a7eca2
Sep 15, 2005
- disable this for now · 0ebec066
  Chris Lattner authored Sep 15, 2005
  
  llvm-svn: 23366
  0ebec066
Sep 14, 2005
- Give all operands names · 9e4a4ee3
  Chris Lattner authored Sep 14, 2005
  
  llvm-svn: 23357
  9e4a4ee3
- give all operands names · 2e84be22
  Chris Lattner authored Sep 14, 2005
  
  llvm-svn: 23356
  2e84be22
- Fix some issues exposed by more testing. XORIS had the wrong operands · f006d15e
  Chris Lattner authored Sep 14, 2005
  
  specified. The various *imm operands defined by PPC are really all i32, even though the actual immediate is restricted to a smaller value in it. llvm-svn: 23352
  f006d15e
- Fix some bugs noticed by new checking code · 6b013fc9
  Chris Lattner authored Sep 14, 2005
  
  llvm-svn: 23350
  6b013fc9
- Fix the regression last night compiling povray · a393e4d4
  Chris Lattner authored Sep 14, 2005
  
  llvm-svn: 23348
  a393e4d4
- fix a major regression from my patch this afternoon · b42e962d
  Chris Lattner authored Sep 14, 2005
  
  llvm-svn: 23347
  b42e962d
- we don't need this proto any longer · b011cb27
  Chris Lattner authored Sep 13, 2005
  
  llvm-svn: 23342
  b011cb27
- move the #include for the generated code into the isel class body so we · 03e08eef
  Chris Lattner authored Sep 13, 2005
  
  can use/define class methods llvm-svn: 23339
  03e08eef
Sep 13, 2005
- Change the arg lowering code to use copyfromreg from vregs associated · 0f965a61
  Chris Lattner authored Sep 13, 2005
  
  with incoming arguments instead of the pregs themselves. This fixes the scheduler from causing problems by moving a copyfromreg for an argument to after a select_cc node (now it can, and bad things won't happen). llvm-svn: 23334
  0f965a61
- This has been moved to the target-indep code · ee811329
  Chris Lattner authored Sep 13, 2005
  
  llvm-svn: 23333
  ee811329
- This code is no longer needed, it is moved to the target-indep code · fb96e50b
  Chris Lattner authored Sep 13, 2005
  
  llvm-svn: 23332
  fb96e50b
- If a function has liveins, and if the target requested that they be plopped · d4382f0a
  Chris Lattner authored Sep 13, 2005
  
  into particular vregs, emit copies into the entry MBB. llvm-svn: 23331
  d4382f0a
- Majik numbers are bad · 64685b4c
  Chris Lattner authored Sep 13, 2005
  
  llvm-svn: 23330
  64685b4c
- Remove some dead vectors · aa6cbd90
  Chris Lattner authored Sep 13, 2005
  
  llvm-svn: 23329
  aa6cbd90
- Add a simple xform to simplify array accesses with casts in the way. · 2a893296
  Chris Lattner authored Sep 13, 2005
  
  This is useful for 178.galgel where resolution of dope vectors (by the optimizer) causes the scales to become apparent. llvm-svn: 23328
  2a893296
- Fix an issue where LSR would miss rewriting a use of an IV expression by a PHI... · fd018c8d
  Chris Lattner authored Sep 13, 2005
  
  Fix an issue where LSR would miss rewriting a use of an IV expression by a PHI node that is not the original PHI. This fixes up a dot-product loop in galgel, speeding it up from 18.47s to 16.13s. llvm-svn: 23327
  fd018c8d
- Add a helper function, allowing us to simplify some code a bit, changing · 567b81f0
  Chris Lattner authored Sep 13, 2005
  
  indentation, no functionality change llvm-svn: 23325
  567b81f0
- Implement a simple xform to turn code like this: · 219175c8
  Chris Lattner authored Sep 12, 2005
  
  if () { store A -> P; } else { store B -> P; } into a PHI node with one store, in the most trival case. This implements load.ll:test10. llvm-svn: 23324
  219175c8
- Another load-peephole optimization: do gcse when two loads are next to · e0bfdf14
  Chris Lattner authored Sep 12, 2005
  
  each other. This implements InstCombine/load.ll:test9 llvm-svn: 23322
  e0bfdf14
- Implement a trivial form of store->load forwarding where the store and the · b990f7d8
  Chris Lattner authored Sep 12, 2005
  
  load are exactly consequtive. This is picked up by other passes, but this triggers thousands of times in fortran programs that use static locals (and is thus a compile-time speedup). llvm-svn: 23320
  b990f7d8
Sep 12, 2005

Fix a regression from last night, which caused this pass to create invalid · 8048b85e

Chris Lattner authored Sep 12, 2005

code for IV uses outside of loops that are not dominated by the latch block.
We should only convert these uses to use the post-inc value if they ARE
dominated by the latch block.

Also use a new LoopInfo method to simplify some code.

This fixes Transforms/LoopStrengthReduce/2005-09-12-UsesOutOutsideOfLoop.ll

llvm-svn: 23318

8048b85e

Add a new getLoopLatch() method. · b35df5f5
Chris Lattner authored Sep 12, 2005
```
llvm-svn: 23315
```
b35df5f5

_test: · a6764839

Chris Lattner authored Sep 12, 2005

        li r2, 0
LBB_test_1:     ; no_exit.2
        li r5, 0
        stw r5, 0(r3)
        addi r2, r2, 1
        addi r3, r3, 4
        cmpwi cr0, r2, 701
        blt cr0, LBB_test_1     ; no_exit.2
LBB_test_2:     ; loopexit.2.loopexit
        addi r2, r2, 1
        stw r2, 0(r4)
        blr
[zion ~/llvm]$ cat > ~/xx
Uses of IV's outside of the loop should use hte post-incremented version
of the IV, not the preincremented version.  This helps many loops (e.g. in sixtrack)
which used to generate code like this (this is the code from the
dont-hoist-simple-loop-constants.ll testcase):

_test:
        li r2, 0                 **** IV starts at 0
LBB_test_1:     ; no_exit.2
        or r5, r2, r2            **** Copy for loop exit
        li r2, 0
        stw r2, 0(r3)
        addi r3, r3, 4
        addi r2, r5, 1
        addi r6, r5, 2           **** IV+2
        cmpwi cr0, r6, 701
        blt cr0, LBB_test_1     ; no_exit.2
LBB_test_2:     ; loopexit.2.loopexit
        addi r2, r5, 2       ****  IV+2
        stw r2, 0(r4)
        blr

And now generated code like this:

_test:
        li r2, 1               *** IV starts at 1
LBB_test_1:     ; no_exit.2
        li r5, 0
        stw r5, 0(r3)
        addi r2, r2, 1
        addi r3, r3, 4
        cmpwi cr0, r2, 701     *** IV.postinc + 0
        blt cr0, LBB_test_1
LBB_test_2:     ; loopexit.2.loopexit
        stw r2, 0(r4)          *** IV.postinc + 0
        blr

llvm-svn: 23313

a6764839

Sep 10, 2005

implement Transforms/LoopStrengthReduce/dont-hoist-simple-loop-constants.ll. · 530fe6ab

Chris Lattner authored Sep 10, 2005

We used to emit this code for it:

_test:
        li r2, 1     ;; Value tying up a register for the whole loop
        li r5, 0
LBB_test_1:     ; no_exit.2
        or r6, r5, r5
        li r5, 0
        stw r5, 0(r3)
        addi r5, r6, 1
        addi r3, r3, 4
        add r7, r2, r5  ;; should be addi r7, r5, 1
        cmpwi cr0, r7, 701
        blt cr0, LBB_test_1     ; no_exit.2
LBB_test_2:     ; loopexit.2.loopexit
        addi r2, r6, 2
        stw r2, 0(r4)
        blr

now we emit this:

_test:
        li r2, 0
LBB_test_1:     ; no_exit.2
        or r5, r2, r2
        li r2, 0
        stw r2, 0(r3)
        addi r3, r3, 4
        addi r2, r5, 1
        addi r6, r5, 2   ;; whoa, fold those adds!
        cmpwi cr0, r6, 701
        blt cr0, LBB_test_1     ; no_exit.2
LBB_test_2:     ; loopexit.2.loopexit
        addi r2, r5, 2
        stw r2, 0(r4)
        blr

more improvement coming.

llvm-svn: 23306

530fe6ab

PowerPC cannot truncstore i1 natively · 4309c3a7
Chris Lattner authored Sep 10, 2005
```
llvm-svn: 23304
```
4309c3a7
Allow targets to say they don't support truncstore i1 (which includes a mask · 2d454bf5
Chris Lattner authored Sep 10, 2005
```
when storing to an 8-bit memory location), as most don't.

llvm-svn: 23303
```
2d454bf5
Add a missing #include, patch courtesy of Baptiste Lepilleur. · bd39c1a4
Chris Lattner authored Sep 09, 2005
```
llvm-svn: 23302
```
bd39c1a4

Fix a problem duraid encountered on itanium where this folding: · 331b311f

Chris Lattner authored Sep 09, 2005

select (x < y), 1, 0 -> (x < y) incorrectly: the setcc returns i1 but the
select returned i32.  Add the zero extend as needed.

llvm-svn: 23301

331b311f

Fix a crash viewing dags that have target nodes in them · 16e5cb87
Chris Lattner authored Sep 09, 2005
```
llvm-svn: 23300
```
16e5cb87

Sep 09, 2005

I forgot that we always spill fp values as 64-bits. Implement spill folding · 0f2146bb
Chris Lattner authored Sep 09, 2005
```
for FP as well.  This triggers a couple dozen times on 177.mesa (for example).

llvm-svn: 23299
```
0f2146bb

Fix a problem that Nate noticed, where spill code was not getting coallesced · 712e78ee

Chris Lattner authored Sep 09, 2005

with copies, leading to code like this:

       lwz r4, 380(r1)
       or r10, r4, r4    ;; Last use of r4

By teaching the PPC backend how to fold spills into copies, we now get this
code:

       lwz r10, 380(r1)

wow. :)

This reduces a testcase nate sent me from 1505 instructions to 1484.

Note that this could handle FP values but doesn't currently, for reasons
mentioned in the patch

llvm-svn: 23298

712e78ee

code cleanup · f540c1a2
Chris Lattner authored Sep 09, 2005
```
llvm-svn: 23297
```
f540c1a2