Commits · 7a4c694ef7067e68d307b057901fccdf1aa265f6 · Roger Ferrer / llvm-epi-0.8

Nov 05, 2005

Turn sdiv into udiv if both operands have a clear sign bit. This occurs · dd0c1740

Chris Lattner authored Nov 05, 2005

a few times in crafty:

OLD:    %tmp.36 = div int %tmp.35, 8            ; <int> [#uses=1]
NEW:    %tmp.36 = div uint %tmp.35, 8           ; <uint> [#uses=0]
OLD:    %tmp.19 = div int %tmp.18, 8            ; <int> [#uses=1]
NEW:    %tmp.19 = div uint %tmp.18, 8           ; <uint> [#uses=0]
OLD:    %tmp.117 = div int %tmp.116, 8          ; <int> [#uses=1]
NEW:    %tmp.117 = div uint %tmp.116, 8         ; <uint> [#uses=0]
OLD:    %tmp.92 = div int %tmp.91, 8            ; <int> [#uses=1]
NEW:    %tmp.92 = div uint %tmp.91, 8           ; <uint> [#uses=0]

Which all turn into shrs.

llvm-svn: 24190

dd0c1740

Turn srem -> urem when neither input has their sign bit set. This triggers · e9ff0eaf

Chris Lattner authored Nov 05, 2005

8 times in vortex, allowing the srems to be turned into shrs:

OLD: %tmp.104 = rem int %tmp.5.i37, 16 ; <int> [#uses=1]
NEW: %tmp.104 = rem uint %tmp.5.i37, 16 ; <uint> [#uses=0]
OLD: %tmp.98 = rem int %tmp.5.i24, 16 ; <int> [#uses=1]
NEW: %tmp.98 = rem uint %tmp.5.i24, 16 ; <uint> [#uses=0]
OLD: %tmp.91 = rem int %tmp.5.i19, 8 ; <int> [#uses=1]
NEW: %tmp.91 = rem uint %tmp.5.i19, 8 ; <uint> [#uses=0]
OLD: %tmp.88 = rem int %tmp.5.i14, 8 ; <int> [#uses=1]
NEW: %tmp.88 = rem uint %tmp.5.i14, 8 ; <uint> [#uses=0]
OLD: %tmp.85 = rem int %tmp.5.i9, 1024 ; <int> [#uses=2]
NEW: %tmp.85 = rem uint %tmp.5.i9, 1024 ; <uint> [#uses=0]
OLD: %tmp.82 = rem int %tmp.5.i, 512 ; <int> [#uses=2]
NEW: %tmp.82 = rem uint %tmp.5.i1, 512 ; <uint> [#uses=0]
OLD: %tmp.48.i = rem int %tmp.5.i.i161, 4 ; <int> [#uses=1]
NEW: %tmp.48.i = rem uint %tmp.5.i.i161, 4 ; <uint> [#uses=0]
OLD: %tmp.20.i2 = rem int %tmp.5.i.i, 4 ; <int> [#uses=1]
NEW: %tmp.20.i2 = rem uint %tmp.5.i.i, 4 ; <uint> [#uses=0]

it also occurs 9 times in gcc, but with odd constant divisors (1009 and 61)
so the payoff isn't as great.

llvm-svn: 24189

e9ff0eaf

Fix logic bug in finding retry slot in tally. · 904dbb4a
Jim Laskey authored Nov 05, 2005
```
llvm-svn: 24188
```
904dbb4a

Nov 04, 2005
- Fix a warning · ded4759d
  Jim Laskey authored Nov 04, 2005
```
llvm-svn: 24187
```
  ded4759d
- oops, forgot to load GP for indirect calls, though the old code now commented · 31071b74
  Duraid Madina authored Nov 04, 2005
```
out failed (e.g. methcall) - now the code compiles, though it's not quite
right just yet (tm) ;)

would fix this but it's 3am! :O

llvm-svn: 24186
```
  31071b74
- kill redundant SP/GP/RP save/restores across calls · d3260128
  Duraid Madina authored Nov 04, 2005
```
llvm-svn: 24183
```
  d3260128
- add support for loading bools · fc1d1b24
  Duraid Madina authored Nov 04, 2005
```
llvm-svn: 24182
```
  fc1d1b24
- Scheduling now uses itinerary data. · e682b677
  Jim Laskey authored Nov 04, 2005
```
llvm-svn: 24180
```
  e682b677
- fun with predicates! (add TRUNC i64->i1, AND i1 i1, fix XOR i1 i1) · 7ac646ef
  Duraid Madina authored Nov 04, 2005
```
llvm-svn: 24175
```
  7ac646ef
Nov 03, 2005
- add pattern to load constant 0 into a predicate reg · f0f22a55
  Duraid Madina authored Nov 03, 2005
```
llvm-svn: 24164
```
  f0f22a55
- Fix a bug that prevented this pattern from matching · 674660ff
  Chris Lattner authored Nov 03, 2005
```
llvm-svn: 24161
```
  674660ff
Nov 02, 2005
- Fix a crash that Andrew noticed, and add a pair of braces to unfconfuse · ee065281
  Nate Begeman authored Nov 02, 2005
```
XCode's indenting.

llvm-svn: 24159
```
  ee065281
- make this 64 bit clean, fixed test30 of /Regression/Transforms/InstCombine/add.ll · 66229558
  Andrew Lenharth authored Nov 02, 2005
```
llvm-svn: 24158
```
  66229558
- Fix a QOI issue noticed by Markus F.X.J. Oberhumer. · 9b9a8396
  Chris Lattner authored Nov 02, 2005
```
This fixes PR641

llvm-svn: 24154
```
  9b9a8396
- "fix" support for FP constants (this code asserts in the scheduler, · 955ffafd
  Duraid Madina authored Nov 02, 2005
```
though)

llvm-svn: 24152
```
  955ffafd
- add F0 and F1 to the FP register class · 4480dcdc
  Duraid Madina authored Nov 02, 2005
```
llvm-svn: 24151
```
  4480dcdc
- This works now · b5310bdb
  Chris Lattner authored Nov 02, 2005
```
llvm-svn: 24150
```
  b5310bdb
- add support for SELECT to TargetSelectionDAG.td, add support for · 17decbb2
  Duraid Madina authored Nov 02, 2005
```
selecting ints to IA64, and a few other ia64 bits and pieces

llvm-svn: 24147
```
  17decbb2
- add support for loading FP constants +0.0 and +1.0 to the dag isel, · 9abf1650
  Duraid Madina authored Nov 02, 2005
```
stop pretending -0.0 and -1.0 are machine constants

llvm-svn: 24146
```
  9abf1650
- Fix a source of undefined behavior when dealing with 64-bit types. This · 17df6087
  Chris Lattner authored Nov 02, 2005
```
may fix PR652.  Thanks to Andrew for tracking down the problem.

llvm-svn: 24145
```
  17df6087
Nov 01, 2005
- Allow itineraries to be passed through the Target Machine. · 802748cd
  Jim Laskey authored Nov 01, 2005
```
llvm-svn: 24139
```
  802748cd
- heh, scheduling was easy? · 5a087ff8
  Duraid Madina authored Nov 01, 2005
```
need to send chris, jim and sampo a box of fish each

llvm-svn: 24135
```
  5a087ff8
- FORTRAN!!! :( and other similarly unfortunate things mean that on ia64 · 9b61d3c1
  Duraid Madina authored Nov 01, 2005
```
one sometimes needs to pass FP args in both FP *and* integer registers.

llvm-svn: 24134
```
  9b61d3c1
- so tablegen was thinking I might want to convert FPs to predicates. · b81b6133
  Duraid Madina authored Nov 01, 2005
```
clever little tablegen!

llvm-svn: 24133
```
  b81b6133
- add support for int->FP and FP->int ops, and add ia64 patterns for these · 6c912bff
  Duraid Madina authored Nov 01, 2005
```
llvm-svn: 24132
```
  6c912bff
- add zeroextend predicate->integer · a284b663
  Duraid Madina authored Nov 01, 2005
```
llvm-svn: 24131
```
  a284b663
- Add a flag to enable a darwin linker optimization · 7432ceef
  Chris Lattner authored Nov 01, 2005
```
llvm-svn: 24130
```
  7432ceef
Oct 31, 2005
- Make constant pool entries use private labels. This is important when you're · 6b63e0c6
  Chris Lattner authored Oct 31, 2005
```
not compiling a whole program at a time :)

llvm-svn: 24129
```
  6b63e0c6
- Fix an iterator invalidation problem in code used by the -strip pass · 71d73eb4
  Chris Lattner authored Oct 31, 2005
```
llvm-svn: 24124
```
  71d73eb4
- Limit the search depth of MaskedValueIsZero to 6 instructions, to avoid · 09efd4e5
  Chris Lattner authored Oct 31, 2005
```
bad cases.  This fixes Markus's second testcase in PR639, and should
seal it for good.

llvm-svn: 24123
```
  09efd4e5
- · 5ce05382
  Jim Laskey authored Oct 31, 2005
```
1. Embed and not inherit vector for NodeGroup.

2. Iterate operands and not uses (performance.)

3. Some long pending comment changes.

llvm-svn: 24119
```
  5ce05382
- add FP compares and implicit register defs to the dag isel · 88fc69f6
  Duraid Madina authored Oct 31, 2005
```
llvm-svn: 24118
```
  88fc69f6
Oct 30, 2005

Significantly simplify this code and make it more aggressive. Instead of having · 6871b23d

Chris Lattner authored Oct 30, 2005

a special case hack for X86, make the hack more general: if an incoming argument
register is not used in any block other than the entry block, don't copy it to
a vreg.  This helps us compile code like this:

%struct.foo = type { int, int, [0 x ubyte] }
int %test(%struct.foo* %X) {
        %tmp1 = getelementptr %struct.foo* %X, int 0, uint 2, int 100
        %tmp = load ubyte* %tmp1                ; <ubyte> [#uses=1]
        %tmp2 = cast ubyte %tmp to int          ; <int> [#uses=1]
        ret int %tmp2
}

to:

_test:
        lbz r3, 108(r3)
        blr

instead of:

_test:
        lbz r2, 108(r3)
        or r3, r2, r2
        blr

The (dead) copy emitted to copy r3 into a vreg for extra-block uses was
increasing the live range of r3 past the load, preventing the coallescing.

This implements CodeGen/PowerPC/reg-coallesce-simple.ll

llvm-svn: 24115

6871b23d

Reduce the number of copies emitted as machine instructions by · dd5663df

Chris Lattner authored Oct 30, 2005

generating results in vregs that will need them.  In the case of something
like this:  CopyToReg((add X, Y), reg1024), we no longer emit code like
this:

   reg1025 = add X, Y
   reg1024 = reg 1025

Instead, we emit:

   reg1024 = add X, Y

Whoa! :)

llvm-svn: 24111

dd5663df

If the module has no t-t and the host is an alpha, default to using the Alpha BE · 5c7d7318
Chris Lattner authored Oct 30, 2005
```
llvm-svn: 24110
```
5c7d7318
fix some broken comparisons, this affected the Pattern isel too. · 57b7ee9d
Duraid Madina authored Oct 30, 2005
```
llvm-svn: 24109
```
57b7ee9d
This is implemented · e507a151
Chris Lattner authored Oct 30, 2005
```
llvm-svn: 24107
```
e507a151

Codegen mul by negative power of two with a shift and negate. · a70878d4

Chris Lattner authored Oct 30, 2005

This implements test/Regression/CodeGen/PowerPC/mul-neg-power-2.ll,
producing:

_foo:
        slwi r2, r3, 1
        subfic r3, r2, 63
        blr

instead of:

_foo:
        mulli r2, r3, -2
        addi r3, r2, 63
        blr

llvm-svn: 24106

a70878d4

Fix a problem that Nate noticed with LSR: · f0b77f9a

Chris Lattner authored Oct 30, 2005

When inserting code for an addrec expression with a non-unit stride, be
more careful where we insert the multiply. In particular, insert the multiply
in the outermost loop we can, instead of the requested insertion point.

This allows LSR to notice the mul in the right loop, reducing it when it gets
to it. This allows it to reduce the multiply, where before it missed it.

This happens quite a bit in the test suite, for example, eliminating 2
multiplies in art, 3 in ammp, 4 in apsi, reducing from 1050 multiplies to
910 muls in galgel (!), from 877 to 859 in applu, and 36 to 30 in bzip2.

This speeds up galgel from 16.45s to 16.01s, applu from 14.21 to 13.94s and
fourinarow from 66.67s to 63.48s.

This implements Transforms/LoopStrengthReduce/nested-reduce.ll

llvm-svn: 24102

f0b77f9a

Oct 29, 2005
- Make -time-passes output prettier · 85b184b2
  Chris Lattner authored Oct 29, 2005
```
llvm-svn: 24096
```
  85b184b2