Commits · 1e89e36dcd6e2dac68725ab59c0b16396f36aea9 · Roger Ferrer / llvm-epi-0.8

Sep 02, 2005
- Fix some buggy logic where we would try to remove nodes with two operands · 1e89e36d
  Chris Lattner authored Sep 02, 2005
```
from the binary ops map, even if they had multiple results.  This latent bug
caused a few failures with the dag isel last night.

To prevent stuff like this from happening in the future, add some really
strict checking to make sure that the CSE maps always match up with reality!

llvm-svn: 23221
```
  1e89e36d
- Pull out Lowering in preperation for multiple ISels. Oh, and get rid of some stuff · 9690a4f3
  Andrew Lenharth authored Sep 02, 2005
```
llvm-svn: 23220
```
  9690a4f3
- Don't create zero sized stack objects even for array allocas with a zero · b0b4ec56
  Chris Lattner authored Sep 02, 2005
```
number of elements.

llvm-svn: 23219
```
  b0b4ec56
- Decouple fsqrt from gpul optimizations, implementing fsqrt.ll. · aa3b1fcc
  Chris Lattner authored Sep 02, 2005
```
Remove the -enable-gpopt option which is subsumed by feature flags.

llvm-svn: 23218
```
  aa3b1fcc
- new testcase to ensure fsqrt is generated for correct subtargets only, and · ffb99034
  Chris Lattner authored Sep 02, 2005
```
that the fsqrt feature works.

llvm-svn: 23217
```
  ffb99034
- Move a bunch of non-deprecated methods above the "deprecated line" · 38ad3f69
  Chris Lattner authored Sep 02, 2005
```
llvm-svn: 23216
```
  38ad3f69
- Fix the release build, noticed by Eric van Riet Paap · b6cde17d
  Chris Lattner authored Sep 02, 2005
```
llvm-svn: 23215
```
  b6cde17d
- Fix a problem that Dan Berlin noticed, where reassociation would not succeed · b5e381a8
  Chris Lattner authored Sep 02, 2005
```
in building maximal expressions before simplifying them.  In particular, i
cases like this:

X-(A+B+X)

the code would consider A+B+X to be a maximal expression (not understanding
that the single use '-' would be turned into a + later), simplify it (a noop)
then later get simplified again.

Each of these simplify steps is where the cost of reassociation comes from,
so this patch should speed up the already fast pass a bit.

Thanks to Dan for noticing this!

llvm-svn: 23214
```
  b5e381a8
- Avoid creating garbage instructions, just move the old add instruction · 9fe263aa
  Chris Lattner authored Sep 02, 2005
```
to where we need it when converting -(A+B+C) -> -A + -B + -C.

llvm-svn: 23213
```
  9fe263aa
- new testcase for recent bugfix · b944d5a4
  Chris Lattner authored Sep 02, 2005
```
llvm-svn: 23212
```
  b944d5a4
- add some assertions and fix problems where reassociate could access the · d1325da0
  Chris Lattner authored Sep 02, 2005
```
Ops vector out of range

llvm-svn: 23211
```
  d1325da0
- Fix VC++ build errors · a6dde996
  Jeff Cohen authored Sep 02, 2005
```
llvm-svn: 23210
```
  a6dde996
- Restore this patch now that the latent bug has been fixed · 763a3a0f
  Chris Lattner authored Sep 02, 2005
```
llvm-svn: 23209
```
  763a3a0f
- Make sure to legalize assert[zs]ext's operand correctly · d9af1aab
  Chris Lattner authored Sep 02, 2005
```
llvm-svn: 23208
```
  d9af1aab
- Revert the previous patch which causes a mysterious regression in toast. · 06d440f2
  Chris Lattner authored Sep 02, 2005
```
llvm-svn: 23207
```
  06d440f2
- Teach live intervals to not crash on dead livein regs · 7138f914
  Chris Lattner authored Sep 02, 2005
```
llvm-svn: 23206
```
  7138f914
- For values that are live across basic blocks and need promotion, use ANY_EXTEND · a66403db
  Chris Lattner authored Sep 02, 2005
```
instead of ZERO_EXTEND to eliminate extraneous extensions.  This eliminates
dead zero extensions on formal arguments and other cases on PPC, implementing
the newly tightened up test/Regression/CodeGen/PowerPC/small-arguments.ll test.

llvm-svn: 23205
```
  a66403db
- legalize ANY_EXTEND appropriately · 7753f175
  Chris Lattner authored Sep 02, 2005
```
llvm-svn: 23204
```
  7753f175
- Add support for ANY_EXTEND and add a few minor folds for it · 8c393c21
  Chris Lattner authored Sep 02, 2005
```
llvm-svn: 23203
```
  8c393c21
- Handle any_extend like zext · 210975cf
  Chris Lattner authored Sep 02, 2005
```
llvm-svn: 23202
```
  210975cf
- Handle ANY_EXTEND like ZERO_EXTEND. Simplify the extend/truncate code on · 2493f0e5
  Chris Lattner authored Sep 02, 2005
```
the observation that it only has to handle i1 -> i64 and i64 -> i1.

llvm-svn: 23201
```
  2493f0e5
- Add a new ANY_EXTEND node, which operates like an extension but has undefined · 969af57d
  Chris Lattner authored Sep 02, 2005
```
top bits.

llvm-svn: 23200
```
  969af57d
- This should permit NOT and's, not even dead ones. · 97ac77fd
  Chris Lattner authored Sep 02, 2005
```
llvm-svn: 23199
```
  97ac77fd
- Implement small-arguments.ll:test3 by teaching the DAG optimizer that · 9ee867b9
  Chris Lattner authored Sep 01, 2005
```
the results of calls to functions returning small values are properly
sign/zero extended.

llvm-svn: 23198
```
  9ee867b9
- Another case that shouldn't have sign extends: functions returning short · 907123b1
  Chris Lattner authored Sep 01, 2005
```
are known to return properly sign extended values, no need for an explicit
extension.

llvm-svn: 23197
```
  907123b1
- Fix some code in the current node combining code, spotted when it was moved · d78d9754
  Nate Begeman authored Sep 01, 2005
```
over to DAGCombiner.cpp

1. Don't assume that SetCC returns i1 when folding (xor (setcc) constant)
2. Don't duplicate code in folding AND with AssertZext that is handled by
   MaskedValueIsZero

llvm-svn: 23196
```
  d78d9754
- Implement first round of feedback from chris (there's still a couple things · 2504fe26
  Nate Begeman authored Sep 01, 2005
```
left to do).

llvm-svn: 23195
```
  2504fe26
- Align functions to 16-byte boundaries, to eliminate noise in performance... · 68d15fdf
  Chris Lattner authored Sep 01, 2005
```
Align functions to 16-byte boundaries, to eliminate noise in performance measurements.  This improves the performance of 'treeadd' by about 20% with the dag
isel, restoring it to the pattern-isel level (which happens to get the alignment right).

llvm-svn: 23194
```
  68d15fdf
Sep 01, 2005

Local labels on darwin apparently start with just 'L', not .L like other · e40a3ccd

Chris Lattner authored Sep 01, 2005

platforms.  This reduces executable size and makes shark realize the actual
bounds of functions instead of showing each MBB as a function :)

llvm-svn: 23193

e40a3ccd

· 19058c39

Jim Laskey authored Sep 01, 2005

1. Use SubtargetFeatures in llc/lli.

2. Propagate feature "string" to all targets.

3. Implement use of SubtargetFeatures in PowerPCTargetSubtarget.

llvm-svn: 23192

19058c39

This new class provides support for platform specific "features". The intent · 3fee6a51

Jim Laskey authored Sep 01, 2005

is to manage processor specific attributes from the command line.  See examples
of use in llc/lli and PowerPCTargetSubtarget.

llvm-svn: 23191

3fee6a51

Implement dynamic allocas correctly. In particular, because we were copying · a305d28c

Chris Lattner authored Sep 01, 2005

directly out of R1 (without using a CopyFromReg, which uses a chain), multiple
allocas were getting CSE'd together, producing bogus code.  For this:

int %foo(bool %X, int %A, int %B) {
        br bool %X, label %T, label %F
F:
        %G = alloca int
        %H = alloca int
        store int %A, int* %G
        store int %B, int* %H
        %R = load int* %G
        ret int %R
T:
        ret int 0
}

We were generating:

_foo:
        stwu r1, -16(r1)
        stw r31, 4(r1)
        or r31, r1, r1
        stw r1, 12(r31)
        cmpwi cr0, r3, 0
        bne cr0, .LBB_foo_2     ; T
.LBB_foo_1:     ; F
        li r2, 16
        subf r2, r2, r1   ;; One alloca
        or r1, r2, r2
        or r3, r1, r1
        or r1, r2, r2
        or r2, r1, r1
        stw r4, 0(r3)
        stw r5, 0(r2)
        lwz r3, 0(r3)
        lwz r1, 12(r31)
        lwz r31, 4(r31)
        lwz r1, 0(r1)
        blr
.LBB_foo_2:     ; T
        li r3, 0
        lwz r1, 12(r31)
        lwz r31, 4(r31)
        lwz r1, 0(r1)
        blr

Now we generate:

_foo:
        stwu r1, -16(r1)
        stw r31, 4(r1)
        or r31, r1, r1
        stw r1, 12(r31)
        cmpwi cr0, r3, 0
        bne cr0, .LBB_foo_2     ; T
.LBB_foo_1:     ; F
        or r2, r1, r1
        li r3, 16
        subf r2, r3, r2  ;; Alloca 1
        or r1, r2, r2
        or r2, r1, r1
        or r6, r1, r1
        subf r3, r3, r6  ;; Alloca 2
        or r1, r3, r3
        or r3, r1, r1
        stw r4, 0(r2)
        stw r5, 0(r3)
        lwz r3, 0(r2)
        lwz r1, 12(r31)
        lwz r31, 4(r31)
        lwz r1, 0(r1)
        blr
.LBB_foo_2:     ; T
        li r3, 0
        lwz r1, 12(r31)
        lwz r31, 4(r31)
        lwz r1, 0(r1)
        blr

This fixes Povray and SPASS with the dag isel, the last two failing cases.
Tommorow we will hopefully turn it on by default! :)

llvm-svn: 23190

a305d28c

Fix a bug where we were useing HA to get the high part, which seems like it · 293b3a68

Chris Lattner authored Sep 01, 2005

could cause a miscompile.  Fixing this didn't fix the two programs that fail
though.  :(

This also changes the implementation to follow the pattern selector more
closely, causing us to select 0 to li instead of lis.

llvm-svn: 23189

293b3a68

Do not select the operands being passed into SelectCC. IT does this itself · 34182aff
Chris Lattner authored Sep 01, 2005
```
and selecting early prevents folding immediates into the cmpw* instructions

llvm-svn: 23188
```
34182aff
It is NDEBUG not _NDEBUG · 975f5c9f
Chris Lattner authored Sep 01, 2005
```
llvm-svn: 23186
```
975f5c9f
Add the rest of the currently implemented visit routines to the switch · e8f78d1a
Nate Begeman authored Sep 01, 2005
```
statement in visit().

llvm-svn: 23185
```
e8f78d1a

First pass at the DAG Combiner. It isn't used anywhere yet, but it should · 21158fc4

Nate Begeman authored Sep 01, 2005

be mostly functional.  It currently has all folds from SelectionDAG.cpp
that do not involve a condition code.

llvm-svn: 23184

21158fc4

Add regression test for efficient codegen of i32 x i32 -> hi32(i64) as · 13990fc2
Nate Begeman authored Sep 01, 2005
```
mulhs.

llvm-svn: 23183
```
13990fc2
remove an inappropriate comment · b3d2e790
Chris Lattner authored Aug 31, 2005
```
llvm-svn: 23182
```
b3d2e790
If a function has live ins/outs, print them · d4d10fff
Chris Lattner authored Aug 31, 2005
```
llvm-svn: 23181
```
d4d10fff