Commits · 83b821b584e0a4017b76337354ca3306824db8e2 · Roger Ferrer / llvm-epi-0.8

Aug 22, 2005

Speed up this loop a bit, based on some observations that Nate made, and · 83b821b5
Chris Lattner authored Aug 22, 2005
```
add some comments.  This loop really needs to be reevaluated!

llvm-svn: 22966
```
83b821b5

Add a fast-path for register values. Add support for constant pool entries, · 92626b9b

Chris Lattner authored Aug 22, 2005

allowing us to compile this:

float %test2(float* %P) {
        %Q = load float* %P
        %R = add float %Q, 10.1
        ret float %R
}

to this:

_test2:
        lfs r2, 0(r3)
        lis r3, ha16(.CPI_test2_0)
        lfs r3, lo16(.CPI_test2_0)(r3)
        fadds f1, r2, r3
        blr

llvm-svn: 22962

92626b9b

add anew method · 466fecee
Chris Lattner authored Aug 21, 2005
```
llvm-svn: 22957
```
466fecee

Aug 21, 2005

Add support for frame index nodes · 48663569
Chris Lattner authored Aug 21, 2005
```
llvm-svn: 22956
```
48663569
add a method · 0548f505
Chris Lattner authored Aug 21, 2005
```
llvm-svn: 22955
```
0548f505
add a method · 707b39fb
Chris Lattner authored Aug 21, 2005
```
llvm-svn: 22949
```
707b39fb
Add support for basic blocks, fix a bug in result # computation · 154b2bc5
Chris Lattner authored Aug 21, 2005
```
llvm-svn: 22948
```
154b2bc5

When legalizing brcond ->brcc or select -> selectcc, make sure to truncate · 539c3fa8

Chris Lattner authored Aug 21, 2005

the old condition to a one bit value.  The incoming value must have been
promoted, and the top bits are undefined.  This causes us to generate:

_test:
        rlwinm r2, r3, 0, 31, 31
        li r3, 17
        cmpwi cr0, r2, 0
        bne .LBB_test_2 ;
.LBB_test_1:    ;
        li r3, 1
.LBB_test_2:    ;
        blr

instead of:

_test:
        rlwinm r2, r3, 0, 31, 31
        li r2, 17
        cmpwi cr0, r3, 0
        bne .LBB_test_2 ;
.LBB_test_1:    ;
        li r2, 1
.LBB_test_2:    ;
        or r3, r2, r2
        blr

for:

int %test(bool %c) {
        %retval = select bool %c, int 17, int 1
        ret int %retval
}

llvm-svn: 22947

539c3fa8

Aug 20, 2005
- fix bogus warning · 4b08ba26
  Chris Lattner authored Aug 20, 2005
```
llvm-svn: 22943
```
  4b08ba26
- Add support for global address nodes · 319e6569
  Chris Lattner authored Aug 19, 2005
```
llvm-svn: 22940
```
  319e6569
- Add support for TargetGlobalAddress nodes · 1be7edde
  Chris Lattner authored Aug 19, 2005
```
llvm-svn: 22938
```
  1be7edde
Aug 19, 2005

Implement CopyFromReg, TokenFactor, and fix a bug in CopyToReg. This allows · 6d7f814b

Chris Lattner authored Aug 19, 2005

us to compile stuff like this:

double %test(double %A, double %B, double %C, double %E) {
        %F = mul double %A, %A
        %G = add double %F, %B
        %H = sub double -0.0, %G
        %I = mul double %H, %C
        %J = add double %I, %E
        ret double %J
}

to:

_test:
        fnmadd f0, f1, f1, f2
        fmadd f1, f0, f3, f4
        blr

woot!

llvm-svn: 22937

6d7f814b

Fix a bug in previous commit · 0875d1ab
Chris Lattner authored Aug 19, 2005
```
llvm-svn: 22936
```
0875d1ab
Print physreg register nodes with target names (e.g. F1) instead of numbers · 4990335e
Chris Lattner authored Aug 19, 2005
```
llvm-svn: 22934
```
4990335e

Before implementing copyfromreg, we'll implement copytoreg correctly. · 78b200eb

Chris Lattner authored Aug 19, 2005

This gets us this for the previous testcase:

_test:
        lis r2, 0
        ori r3, r2, 65535
        blr

Note that we actually write to r3 (the return reg) correctly now :)

llvm-svn: 22933

78b200eb

Now that we have operand info for machine instructions, use it to create · cc3035e9

Chris Lattner authored Aug 19, 2005

temporary registers for things that define a register.  This allows dag->dag
isel to compile this:

int %test() { ret int 65535 }

into:

_test:
        lis r2, 0
        ori r2, r2, 65535
        blr

Next up, getting CopyFromReg to work, allowing arguments and cross-bb values.

llvm-svn: 22932

cc3035e9

Fix VC++ constant truncation warning. · 486e36cf
Jeff Cohen authored Aug 19, 2005
```
llvm-svn: 22907
```
486e36cf
Fix VC++ precedence warning. · d1f22b12
Jeff Cohen authored Aug 19, 2005
```
llvm-svn: 22902
```
d1f22b12
Fix computation of # operands, add a temporary hack for CopyToReg · d18beab9
Chris Lattner authored Aug 19, 2005
```
llvm-svn: 22896
```
d18beab9

Aug 18, 2005
- add a new -view-sched-dags option to view dags as they are sent to the scheduler. · 0c8c2c10
  Chris Lattner authored Aug 18, 2005
```
llvm-svn: 22878
```
  0c8c2c10
- Implement the first chunk of a code emitter. This is sophisticated enough to · d342de9a
  Chris Lattner authored Aug 18, 2005
```
codegen:

_empty:
.LBB_empty_0:   ;
        blr

but can't do anything more (yet). :)

llvm-svn: 22876
```
  d342de9a
- new file, obviously just a stub · 1b4727de
  Chris Lattner authored Aug 18, 2005
```
llvm-svn: 22868
```
  1b4727de
- Enable critical edge splitting by default · 1a908c89
  Chris Lattner authored Aug 18, 2005
```
llvm-svn: 22863
```
  1a908c89
- Add support for target DAG nodes that take 4 operands, such as PowerPC's · 19a271a6
  Nate Begeman authored Aug 18, 2005
```
rlwinm.

llvm-svn: 22856
```
  19a271a6
- Fix printing of VTSDNodes · 802080d8
  Chris Lattner authored Aug 18, 2005
```
llvm-svn: 22853
```
  802080d8
Aug 17, 2005

Move the code dependency for MathExtras.h from SelectionDAGNodes.h. · d66e6165
Jim Laskey authored Aug 17, 2005
```
Added some class dividers in SelectionDAG.cpp.

llvm-svn: 22841
```
d66e6165
Culling out use of unions for converting FP to bits and vice versa. · b74c6661
Jim Laskey authored Aug 17, 2005
```
llvm-svn: 22838
```
b74c6661
Fix a bug in RemoveDeadNodes where it would crash when its "optional" · ab0de9d7
Chris Lattner authored Aug 17, 2005
```
argument is not specified.

Implement ReplaceAllUsesWith.

llvm-svn: 22834
```
ab0de9d7
Switched to using BitsToDouble for int_to_float to avoid aliasing problem. · 686d6a1c
Jim Laskey authored Aug 17, 2005
```
llvm-svn: 22831
```
686d6a1c
Change hex float constants for the sake of VC++. · 898ba557
Jim Laskey authored Aug 17, 2005
```
llvm-svn: 22828
```
898ba557

Add a new beta option for critical edge splitting, to avoid a problem that · c9950c11

Chris Lattner authored Aug 17, 2005

Nate noticed in yacr2 (and I know occurs in other places as well).

This is still rough, as the critical edge blocks are not intelligently placed
but is added to get some idea to see if this improves performance.

llvm-svn: 22825

c9950c11

Fix a regression on X86, where FP values can be promoted too. · ba28c273
Chris Lattner authored Aug 17, 2005
```
llvm-svn: 22822
```
ba28c273

· f2516a91

Jim Laskey authored Aug 17, 2005

Added generic code expansion for [signed|unsigned] i32 to [f32|f64] casts in the
legalizer.  PowerPC now uses this expansion instead of ISel version.

Example:

// signed integer to double conversion
double f1(signed x) {
  return (double)x;
}

// unsigned integer to double conversion
double f2(unsigned x) {
  return (double)x;
}

// signed integer to float conversion
float f3(signed x) {
  return (float)x;
}

// unsigned integer to float conversion
float f4(unsigned x) {
  return (float)x;
}


Byte Code:

internal fastcc double %_Z2f1i(int %x) {
entry:
        %tmp.1 = cast int %x to double          ; <double> [#uses=1]
        ret double %tmp.1
}

internal fastcc double %_Z2f2j(uint %x) {
entry:
        %tmp.1 = cast uint %x to double         ; <double> [#uses=1]
        ret double %tmp.1
}

internal fastcc float %_Z2f3i(int %x) {
entry:
        %tmp.1 = cast int %x to float           ; <float> [#uses=1]
        ret float %tmp.1
}

internal fastcc float %_Z2f4j(uint %x) {
entry:
        %tmp.1 = cast uint %x to float          ; <float> [#uses=1]
        ret float %tmp.1
}

internal fastcc double %_Z2g1i(int %x) {
entry:
        %buffer = alloca [2 x uint]             ; <[2 x uint]*> [#uses=3]
        %tmp.0 = getelementptr [2 x uint]* %buffer, int 0, int 0                ; <uint*> [#uses=1]
        store uint 1127219200, uint* %tmp.0
        %tmp.2 = cast int %x to uint            ; <uint> [#uses=1]
        %tmp.3 = xor uint %tmp.2, 2147483648            ; <uint> [#uses=1]
        %tmp.5 = getelementptr [2 x uint]* %buffer, int 0, int 1                ; <uint*> [#uses=1]
        store uint %tmp.3, uint* %tmp.5
        %tmp.9 = cast [2 x uint]* %buffer to double*            ; <double*> [#uses=1]
        %tmp.10 = load double* %tmp.9           ; <double> [#uses=1]
        %tmp.13 = load double* cast (long* %signed_bias to double*)             ; <double> [#uses=1]
        %tmp.14 = sub double %tmp.10, %tmp.13           ; <double> [#uses=1]
        ret double %tmp.14
}

internal fastcc double %_Z2g2j(uint %x) {
entry:
        %buffer = alloca [2 x uint]             ; <[2 x uint]*> [#uses=3]
        %tmp.0 = getelementptr [2 x uint]* %buffer, int 0, int 0                ; <uint*> [#uses=1]
        store uint 1127219200, uint* %tmp.0
        %tmp.1 = getelementptr [2 x uint]* %buffer, int 0, int 1                ; <uint*> [#uses=1]
        store uint %x, uint* %tmp.1
        %tmp.4 = cast [2 x uint]* %buffer to double*            ; <double*> [#uses=1]
        %tmp.5 = load double* %tmp.4            ; <double> [#uses=1]
        %tmp.8 = load double* cast (long* %unsigned_bias to double*)            ; <double> [#uses=1]
        %tmp.9 = sub double %tmp.5, %tmp.8              ; <double> [#uses=1]
        ret double %tmp.9
}

internal fastcc float %_Z2g3i(int %x) {
entry:
        %buffer = alloca [2 x uint]             ; <[2 x uint]*> [#uses=3]
        %tmp.0 = getelementptr [2 x uint]* %buffer, int 0, int 0                ; <uint*> [#uses=1]
        store uint 1127219200, uint* %tmp.0
        %tmp.2 = cast int %x to uint            ; <uint> [#uses=1]
        %tmp.3 = xor uint %tmp.2, 2147483648            ; <uint> [#uses=1]
        %tmp.5 = getelementptr [2 x uint]* %buffer, int 0, int 1                ; <uint*> [#uses=1]
        store uint %tmp.3, uint* %tmp.5
        %tmp.9 = cast [2 x uint]* %buffer to double*            ; <double*> [#uses=1]
        %tmp.10 = load double* %tmp.9           ; <double> [#uses=1]
        %tmp.13 = load double* cast (long* %signed_bias to double*)             ; <double> [#uses=1]
        %tmp.14 = sub double %tmp.10, %tmp.13           ; <double> [#uses=1]
        %tmp.16 = cast double %tmp.14 to float          ; <float> [#uses=1]
        ret float %tmp.16
}

internal fastcc float %_Z2g4j(uint %x) {
entry:
        %buffer = alloca [2 x uint]             ; <[2 x uint]*> [#uses=3]
        %tmp.0 = getelementptr [2 x uint]* %buffer, int 0, int 0                ; <uint*> [#uses=1]
        store uint 1127219200, uint* %tmp.0
        %tmp.1 = getelementptr [2 x uint]* %buffer, int 0, int 1                ; <uint*> [#uses=1]
        store uint %x, uint* %tmp.1
        %tmp.4 = cast [2 x uint]* %buffer to double*            ; <double*> [#uses=1]
        %tmp.5 = load double* %tmp.4            ; <double> [#uses=1]
        %tmp.8 = load double* cast (long* %unsigned_bias to double*)            ; <double> [#uses=1]
        %tmp.9 = sub double %tmp.5, %tmp.8              ; <double> [#uses=1]
        %tmp.11 = cast double %tmp.9 to float           ; <float> [#uses=1]
        ret float %tmp.11
}


PowerPC Code:

        .machine ppc970


        .const
        .align  2
.CPIl1__Z2f1i_0:                                        ; float 0x4330000080000000
        .long   1501560836      ; float 4.5036e+15
        .text
        .align  2
        .globl  l1__Z2f1i
l1__Z2f1i:
.LBBl1__Z2f1i_0:        ; entry
        xoris r2, r3, 32768
        stw r2, -4(r1)
        lis r2, 17200
        stw r2, -8(r1)
        lfd f0, -8(r1)
        lis r2, ha16(.CPIl1__Z2f1i_0)
        lfs f1, lo16(.CPIl1__Z2f1i_0)(r2)
        fsub f1, f0, f1
        blr


        .const
        .align  2
.CPIl2__Z2f2j_0:                                        ; float 0x4330000000000000
        .long   1501560832      ; float 4.5036e+15
        .text
        .align  2
        .globl  l2__Z2f2j
l2__Z2f2j:
.LBBl2__Z2f2j_0:        ; entry
        stw r3, -4(r1)
        lis r2, 17200
        stw r2, -8(r1)
        lfd f0, -8(r1)
        lis r2, ha16(.CPIl2__Z2f2j_0)
        lfs f1, lo16(.CPIl2__Z2f2j_0)(r2)
        fsub f1, f0, f1
        blr


        .const
        .align  2
.CPIl3__Z2f3i_0:                                        ; float 0x4330000080000000
        .long   1501560836      ; float 4.5036e+15
        .text
        .align  2
        .globl  l3__Z2f3i
l3__Z2f3i:
.LBBl3__Z2f3i_0:        ; entry
        xoris r2, r3, 32768
        stw r2, -4(r1)
        lis r2, 17200
        stw r2, -8(r1)
        lfd f0, -8(r1)
        lis r2, ha16(.CPIl3__Z2f3i_0)
        lfs f1, lo16(.CPIl3__Z2f3i_0)(r2)
        fsub f0, f0, f1
        frsp f1, f0
        blr


        .const
        .align  2
.CPIl4__Z2f4j_0:                                        ; float 0x4330000000000000
        .long   1501560832      ; float 4.5036e+15
        .text
        .align  2
        .globl  l4__Z2f4j
l4__Z2f4j:
.LBBl4__Z2f4j_0:        ; entry
        stw r3, -4(r1)
        lis r2, 17200
        stw r2, -8(r1)
        lfd f0, -8(r1)
        lis r2, ha16(.CPIl4__Z2f4j_0)
        lfs f1, lo16(.CPIl4__Z2f4j_0)(r2)
        fsub f0, f0, f1
        frsp f1, f0
        blr

llvm-svn: 22814

f2516a91

add a new TargetConstant node · 0d2456e1
Chris Lattner authored Aug 17, 2005
```
llvm-svn: 22813
```
0d2456e1

Aug 16, 2005

Eliminate the RegSDNode class, which 3 nodes (CopyFromReg/CopyToReg/ImplicitDef) · 33182325

Chris Lattner authored Aug 16, 2005

used to tack a register number onto the node.

Instead of doing this, make a new node, RegisterSDNode, which is a leaf
containing a register number.  These three operations just become normal
DAG nodes now, instead of requiring special handling.

Note that with this change, it is no longer correct to make illegal
CopyFromReg/CopyToReg nodes.  The legalizer will not touch them, and this
is bad, so don't do it. :)

llvm-svn: 22806

33182325

Implement BR_CC and BRTWOWAY_CC. This allows the removal of a rather nasty · 371e4951
Nate Begeman authored Aug 16, 2005
```
fixme from the PowerPC backend.  Emit slightly better code for legalizing
select_cc.

llvm-svn: 22805
```
371e4951

Allow passing a dag into dump and getOperationName. If one is available · bc892265

Chris Lattner authored Aug 16, 2005

when printing a node, use it to render target operations with their
target instruction name instead of "<<unknown>>".

llvm-svn: 22804

bc892265

Use a extant helper to do this. · 7e57d18b
Chris Lattner authored Aug 16, 2005
```
llvm-svn: 22802
```
7e57d18b
Add some methods for dag->dag isel. · 1973278b
Chris Lattner authored Aug 16, 2005
```
Split RemoveNodeFromCSEMaps out of DeleteNodesIfDead to do it.

llvm-svn: 22801
```
1973278b

Aug 14, 2005

Fix last night's PPC32 regressions by · d5e739dc

Nate Begeman authored Aug 14, 2005

1. Not selecting the false value of a select_cc in the false arm, which
   isn't legal for nested selects.
2. Actually returning the node we created and Legalized in the FP_TO_UINT
   Expander.

llvm-svn: 22789

d5e739dc