Commits · cdc0cbbcd091fa70ebe22c2bb0a0474d6a40bbd5 · Roger Ferrer / llvm-epi-0.8

Aug 24, 2005

Simplify this code by using higher-level LiveVariables methods · 77415823
Chris Lattner authored Aug 23, 2005
```
llvm-svn: 22989
```
77415823

Keep track of which registers are related to which other registers. · 22e91cc3

Chris Lattner authored Aug 23, 2005

Use this information to avoid doing expensive interval intersections for
registers that could not possible be interesting.  This speeds up linscan
on ia64 compiling kc++ in release mode from taking 7.82s to 4.8s(!), total
itanium llc time on this program is 27.3s now.  This marginally speeds up
PPC and X86, but they appear to be limited by other parts of linscan, not
this code.

On this program, on itanium, live intervals now takes 41% of llc time.

llvm-svn: 22986

22e91cc3

Aug 23, 2005
- Teach the SelectionDAG how to transform select_cc eq, X, 0, 1, 0 into · bf8c3939
  Nate Begeman authored Aug 23, 2005
```
either seteq X, 0 or srl (ctlz X), size(X-1), depending on what's legal
for the target.

llvm-svn: 22978
```
  bf8c3939
- Teach Legalize how to turn setcc into select_cc · 987121a6
  Nate Begeman authored Aug 23, 2005
```
llvm-svn: 22977
```
  987121a6
Aug 22, 2005

Try to avoid scanning the fixed list. On architectures with a non-stupid · 834a2316

Chris Lattner authored Aug 22, 2005

number of regs (e.g. most riscs), many functions won't need to use callee
clobbered registers.  Do a speculative check to see if we can get a free
register without processing the fixed list (which has all of these).  This
saves a lot of time on machines with lots of callee clobbered regs (e.g.
ppc and itanium, also x86).

This reduces ppc llc compile time from 184s -> 172s on kc++.  This is probably
worth FAR FAR more on itanium though.

llvm-svn: 22972

834a2316

Move some code in the register assignment case that only needs to happen if · 95a157ae

Chris Lattner authored Aug 22, 2005

we spill out of the fast path.  The scan of active_ and the calls to
updateSpillWeights don't need to happen unless a spill occurs.  This reduces
debug llc time of kc++ with ppc from 187.3s to 183.2s.

llvm-svn: 22971

95a157ae

Fix a problem where constant expr shifts would not have their shift amount · 7f9e078d
Chris Lattner authored Aug 22, 2005
```
promoted to the right type.  This fixes: IA64/2005-08-22-LegalizerCrash.ll

llvm-svn: 22969
```
7f9e078d
Speed up this loop a bit, based on some observations that Nate made, and · 83b821b5
Chris Lattner authored Aug 22, 2005
```
add some comments.  This loop really needs to be reevaluated!

llvm-svn: 22966
```
83b821b5

Add a fast-path for register values. Add support for constant pool entries, · 92626b9b

Chris Lattner authored Aug 22, 2005

allowing us to compile this:

float %test2(float* %P) {
        %Q = load float* %P
        %R = add float %Q, 10.1
        ret float %R
}

to this:

_test2:
        lfs r2, 0(r3)
        lis r3, ha16(.CPI_test2_0)
        lfs r3, lo16(.CPI_test2_0)(r3)
        fadds f1, r2, r3
        blr

llvm-svn: 22962

92626b9b

add anew method · 466fecee
Chris Lattner authored Aug 21, 2005
```
llvm-svn: 22957
```
466fecee

Aug 21, 2005

Add support for frame index nodes · 48663569
Chris Lattner authored Aug 21, 2005
```
llvm-svn: 22956
```
48663569
add a method · 0548f505
Chris Lattner authored Aug 21, 2005
```
llvm-svn: 22955
```
0548f505
add a method · 707b39fb
Chris Lattner authored Aug 21, 2005
```
llvm-svn: 22949
```
707b39fb
Add support for basic blocks, fix a bug in result # computation · 154b2bc5
Chris Lattner authored Aug 21, 2005
```
llvm-svn: 22948
```
154b2bc5

When legalizing brcond ->brcc or select -> selectcc, make sure to truncate · 539c3fa8

Chris Lattner authored Aug 21, 2005

the old condition to a one bit value.  The incoming value must have been
promoted, and the top bits are undefined.  This causes us to generate:

_test:
        rlwinm r2, r3, 0, 31, 31
        li r3, 17
        cmpwi cr0, r2, 0
        bne .LBB_test_2 ;
.LBB_test_1:    ;
        li r3, 1
.LBB_test_2:    ;
        blr

instead of:

_test:
        rlwinm r2, r3, 0, 31, 31
        li r2, 17
        cmpwi cr0, r3, 0
        bne .LBB_test_2 ;
.LBB_test_1:    ;
        li r2, 1
.LBB_test_2:    ;
        or r3, r2, r2
        blr

for:

int %test(bool %c) {
        %retval = select bool %c, int 17, int 1
        ret int %retval
}

llvm-svn: 22947

539c3fa8

Aug 20, 2005
- fix bogus warning · 4b08ba26
  Chris Lattner authored Aug 20, 2005
```
llvm-svn: 22943
```
  4b08ba26
- Add support for global address nodes · 319e6569
  Chris Lattner authored Aug 19, 2005
```
llvm-svn: 22940
```
  319e6569
- Add support for TargetGlobalAddress nodes · 1be7edde
  Chris Lattner authored Aug 19, 2005
```
llvm-svn: 22938
```
  1be7edde
Aug 19, 2005

Implement CopyFromReg, TokenFactor, and fix a bug in CopyToReg. This allows · 6d7f814b

Chris Lattner authored Aug 19, 2005

us to compile stuff like this:

double %test(double %A, double %B, double %C, double %E) {
        %F = mul double %A, %A
        %G = add double %F, %B
        %H = sub double -0.0, %G
        %I = mul double %H, %C
        %J = add double %I, %E
        ret double %J
}

to:

_test:
        fnmadd f0, f1, f1, f2
        fmadd f1, f0, f3, f4
        blr

woot!

llvm-svn: 22937

6d7f814b

Fix a bug in previous commit · 0875d1ab
Chris Lattner authored Aug 19, 2005
```
llvm-svn: 22936
```
0875d1ab
Print physreg register nodes with target names (e.g. F1) instead of numbers · 4990335e
Chris Lattner authored Aug 19, 2005
```
llvm-svn: 22934
```
4990335e

Before implementing copyfromreg, we'll implement copytoreg correctly. · 78b200eb

Chris Lattner authored Aug 19, 2005

This gets us this for the previous testcase:

_test:
        lis r2, 0
        ori r3, r2, 65535
        blr

Note that we actually write to r3 (the return reg) correctly now :)

llvm-svn: 22933

78b200eb

Now that we have operand info for machine instructions, use it to create · cc3035e9

Chris Lattner authored Aug 19, 2005

temporary registers for things that define a register.  This allows dag->dag
isel to compile this:

int %test() { ret int 65535 }

into:

_test:
        lis r2, 0
        ori r2, r2, 65535
        blr

Next up, getting CopyFromReg to work, allowing arguments and cross-bb values.

llvm-svn: 22932

cc3035e9

Fix VC++ constant truncation warning. · 486e36cf
Jeff Cohen authored Aug 19, 2005
```
llvm-svn: 22907
```
486e36cf
Fix VC++ precedence warning. · d1f22b12
Jeff Cohen authored Aug 19, 2005
```
llvm-svn: 22902
```
d1f22b12
Fix computation of # operands, add a temporary hack for CopyToReg · d18beab9
Chris Lattner authored Aug 19, 2005
```
llvm-svn: 22896
```
d18beab9

Aug 18, 2005
- add a new -view-sched-dags option to view dags as they are sent to the scheduler. · 0c8c2c10
  Chris Lattner authored Aug 18, 2005
```
llvm-svn: 22878
```
  0c8c2c10
- Implement the first chunk of a code emitter. This is sophisticated enough to · d342de9a
  Chris Lattner authored Aug 18, 2005
```
codegen:

_empty:
.LBB_empty_0:   ;
        blr

but can't do anything more (yet). :)

llvm-svn: 22876
```
  d342de9a
- new file, obviously just a stub · 1b4727de
  Chris Lattner authored Aug 18, 2005
```
llvm-svn: 22868
```
  1b4727de
- Enable critical edge splitting by default · 1a908c89
  Chris Lattner authored Aug 18, 2005
```
llvm-svn: 22863
```
  1a908c89
- Add support for target DAG nodes that take 4 operands, such as PowerPC's · 19a271a6
  Nate Begeman authored Aug 18, 2005
```
rlwinm.

llvm-svn: 22856
```
  19a271a6
- Fix printing of VTSDNodes · 802080d8
  Chris Lattner authored Aug 18, 2005
```
llvm-svn: 22853
```
  802080d8
Aug 17, 2005

Move the code dependency for MathExtras.h from SelectionDAGNodes.h. · d66e6165
Jim Laskey authored Aug 17, 2005
```
Added some class dividers in SelectionDAG.cpp.

llvm-svn: 22841
```
d66e6165
Culling out use of unions for converting FP to bits and vice versa. · b74c6661
Jim Laskey authored Aug 17, 2005
```
llvm-svn: 22838
```
b74c6661
Fix a bug in RemoveDeadNodes where it would crash when its "optional" · ab0de9d7
Chris Lattner authored Aug 17, 2005
```
argument is not specified.

Implement ReplaceAllUsesWith.

llvm-svn: 22834
```
ab0de9d7
Switched to using BitsToDouble for int_to_float to avoid aliasing problem. · 686d6a1c
Jim Laskey authored Aug 17, 2005
```
llvm-svn: 22831
```
686d6a1c
Change hex float constants for the sake of VC++. · 898ba557
Jim Laskey authored Aug 17, 2005
```
llvm-svn: 22828
```
898ba557

Add a new beta option for critical edge splitting, to avoid a problem that · c9950c11

Chris Lattner authored Aug 17, 2005

Nate noticed in yacr2 (and I know occurs in other places as well).

This is still rough, as the critical edge blocks are not intelligently placed
but is added to get some idea to see if this improves performance.

llvm-svn: 22825

c9950c11

Fix a regression on X86, where FP values can be promoted too. · ba28c273
Chris Lattner authored Aug 17, 2005
```
llvm-svn: 22822
```
ba28c273

· f2516a91

Jim Laskey authored Aug 17, 2005

Added generic code expansion for [signed|unsigned] i32 to [f32|f64] casts in the
legalizer.  PowerPC now uses this expansion instead of ISel version.

Example:

// signed integer to double conversion
double f1(signed x) {
  return (double)x;
}

// unsigned integer to double conversion
double f2(unsigned x) {
  return (double)x;
}

// signed integer to float conversion
float f3(signed x) {
  return (float)x;
}

// unsigned integer to float conversion
float f4(unsigned x) {
  return (float)x;
}


Byte Code:

internal fastcc double %_Z2f1i(int %x) {
entry:
        %tmp.1 = cast int %x to double          ; <double> [#uses=1]
        ret double %tmp.1
}

internal fastcc double %_Z2f2j(uint %x) {
entry:
        %tmp.1 = cast uint %x to double         ; <double> [#uses=1]
        ret double %tmp.1
}

internal fastcc float %_Z2f3i(int %x) {
entry:
        %tmp.1 = cast int %x to float           ; <float> [#uses=1]
        ret float %tmp.1
}

internal fastcc float %_Z2f4j(uint %x) {
entry:
        %tmp.1 = cast uint %x to float          ; <float> [#uses=1]
        ret float %tmp.1
}

internal fastcc double %_Z2g1i(int %x) {
entry:
        %buffer = alloca [2 x uint]             ; <[2 x uint]*> [#uses=3]
        %tmp.0 = getelementptr [2 x uint]* %buffer, int 0, int 0                ; <uint*> [#uses=1]
        store uint 1127219200, uint* %tmp.0
        %tmp.2 = cast int %x to uint            ; <uint> [#uses=1]
        %tmp.3 = xor uint %tmp.2, 2147483648            ; <uint> [#uses=1]
        %tmp.5 = getelementptr [2 x uint]* %buffer, int 0, int 1                ; <uint*> [#uses=1]
        store uint %tmp.3, uint* %tmp.5
        %tmp.9 = cast [2 x uint]* %buffer to double*            ; <double*> [#uses=1]
        %tmp.10 = load double* %tmp.9           ; <double> [#uses=1]
        %tmp.13 = load double* cast (long* %signed_bias to double*)             ; <double> [#uses=1]
        %tmp.14 = sub double %tmp.10, %tmp.13           ; <double> [#uses=1]
        ret double %tmp.14
}

internal fastcc double %_Z2g2j(uint %x) {
entry:
        %buffer = alloca [2 x uint]             ; <[2 x uint]*> [#uses=3]
        %tmp.0 = getelementptr [2 x uint]* %buffer, int 0, int 0                ; <uint*> [#uses=1]
        store uint 1127219200, uint* %tmp.0
        %tmp.1 = getelementptr [2 x uint]* %buffer, int 0, int 1                ; <uint*> [#uses=1]
        store uint %x, uint* %tmp.1
        %tmp.4 = cast [2 x uint]* %buffer to double*            ; <double*> [#uses=1]
        %tmp.5 = load double* %tmp.4            ; <double> [#uses=1]
        %tmp.8 = load double* cast (long* %unsigned_bias to double*)            ; <double> [#uses=1]
        %tmp.9 = sub double %tmp.5, %tmp.8              ; <double> [#uses=1]
        ret double %tmp.9
}

internal fastcc float %_Z2g3i(int %x) {
entry:
        %buffer = alloca [2 x uint]             ; <[2 x uint]*> [#uses=3]
        %tmp.0 = getelementptr [2 x uint]* %buffer, int 0, int 0                ; <uint*> [#uses=1]
        store uint 1127219200, uint* %tmp.0
        %tmp.2 = cast int %x to uint            ; <uint> [#uses=1]
        %tmp.3 = xor uint %tmp.2, 2147483648            ; <uint> [#uses=1]
        %tmp.5 = getelementptr [2 x uint]* %buffer, int 0, int 1                ; <uint*> [#uses=1]
        store uint %tmp.3, uint* %tmp.5
        %tmp.9 = cast [2 x uint]* %buffer to double*            ; <double*> [#uses=1]
        %tmp.10 = load double* %tmp.9           ; <double> [#uses=1]
        %tmp.13 = load double* cast (long* %signed_bias to double*)             ; <double> [#uses=1]
        %tmp.14 = sub double %tmp.10, %tmp.13           ; <double> [#uses=1]
        %tmp.16 = cast double %tmp.14 to float          ; <float> [#uses=1]
        ret float %tmp.16
}

internal fastcc float %_Z2g4j(uint %x) {
entry:
        %buffer = alloca [2 x uint]             ; <[2 x uint]*> [#uses=3]
        %tmp.0 = getelementptr [2 x uint]* %buffer, int 0, int 0                ; <uint*> [#uses=1]
        store uint 1127219200, uint* %tmp.0
        %tmp.1 = getelementptr [2 x uint]* %buffer, int 0, int 1                ; <uint*> [#uses=1]
        store uint %x, uint* %tmp.1
        %tmp.4 = cast [2 x uint]* %buffer to double*            ; <double*> [#uses=1]
        %tmp.5 = load double* %tmp.4            ; <double> [#uses=1]
        %tmp.8 = load double* cast (long* %unsigned_bias to double*)            ; <double> [#uses=1]
        %tmp.9 = sub double %tmp.5, %tmp.8              ; <double> [#uses=1]
        %tmp.11 = cast double %tmp.9 to float           ; <float> [#uses=1]
        ret float %tmp.11
}


PowerPC Code:

        .machine ppc970


        .const
        .align  2
.CPIl1__Z2f1i_0:                                        ; float 0x4330000080000000
        .long   1501560836      ; float 4.5036e+15
        .text
        .align  2
        .globl  l1__Z2f1i
l1__Z2f1i:
.LBBl1__Z2f1i_0:        ; entry
        xoris r2, r3, 32768
        stw r2, -4(r1)
        lis r2, 17200
        stw r2, -8(r1)
        lfd f0, -8(r1)
        lis r2, ha16(.CPIl1__Z2f1i_0)
        lfs f1, lo16(.CPIl1__Z2f1i_0)(r2)
        fsub f1, f0, f1
        blr


        .const
        .align  2
.CPIl2__Z2f2j_0:                                        ; float 0x4330000000000000
        .long   1501560832      ; float 4.5036e+15
        .text
        .align  2
        .globl  l2__Z2f2j
l2__Z2f2j:
.LBBl2__Z2f2j_0:        ; entry
        stw r3, -4(r1)
        lis r2, 17200
        stw r2, -8(r1)
        lfd f0, -8(r1)
        lis r2, ha16(.CPIl2__Z2f2j_0)
        lfs f1, lo16(.CPIl2__Z2f2j_0)(r2)
        fsub f1, f0, f1
        blr


        .const
        .align  2
.CPIl3__Z2f3i_0:                                        ; float 0x4330000080000000
        .long   1501560836      ; float 4.5036e+15
        .text
        .align  2
        .globl  l3__Z2f3i
l3__Z2f3i:
.LBBl3__Z2f3i_0:        ; entry
        xoris r2, r3, 32768
        stw r2, -4(r1)
        lis r2, 17200
        stw r2, -8(r1)
        lfd f0, -8(r1)
        lis r2, ha16(.CPIl3__Z2f3i_0)
        lfs f1, lo16(.CPIl3__Z2f3i_0)(r2)
        fsub f0, f0, f1
        frsp f1, f0
        blr


        .const
        .align  2
.CPIl4__Z2f4j_0:                                        ; float 0x4330000000000000
        .long   1501560832      ; float 4.5036e+15
        .text
        .align  2
        .globl  l4__Z2f4j
l4__Z2f4j:
.LBBl4__Z2f4j_0:        ; entry
        stw r3, -4(r1)
        lis r2, 17200
        stw r2, -8(r1)
        lfd f0, -8(r1)
        lis r2, ha16(.CPIl4__Z2f4j_0)
        lfs f1, lo16(.CPIl4__Z2f4j_0)(r2)
        fsub f0, f0, f1
        frsp f1, f0
        blr

llvm-svn: 22814

f2516a91