Commits · 10d6341618d792accb16a959a8481cbed4c7daec · Roger Ferrer / llvm-epi-0.8

May 04, 2006
- Move some methods out of MachineInstr into MachineOperand · 10d63416
  Chris Lattner authored May 04, 2006
```
llvm-svn: 28102
```
  10d63416
- There shalt be only one "immediate" operand type! · fef7a2d0
  Chris Lattner authored May 04, 2006
```
llvm-svn: 28099
```
  fef7a2d0
- Revert Nate's CR patch from last night, which caused many regressions (e.g. fhourstones). · 13d5f3eb
  Chris Lattner authored May 04, 2006
```
Loading and storing off R0 isn't what we wanted.  Also, taking some CR's out of
CRRC seems to cause failures as well.  Further investigation is required.

llvm-svn: 28097
```
  13d5f3eb
- Remove a bunch more SparcV9 specific stuff · 940cc978
  Chris Lattner authored May 04, 2006
```
llvm-svn: 28093
```
  940cc978
- Remove some more unused stuff from MachineInstr that was leftover from V9. · 9f6639b6
  Chris Lattner authored May 04, 2006
```
llvm-svn: 28091
```
  9f6639b6
May 03, 2006

Change from using MachineRelocation ctors to using static methods · e3a9c70b
Chris Lattner authored May 03, 2006
```
in MachineRelocation to create Relocations.

llvm-svn: 28088
```
e3a9c70b

Suck block address tracking out of targets into the JIT Emitter. This · 1d8ee1fc

Chris Lattner authored May 03, 2006

simplifies the MachineCodeEmitter interface just a little bit and makes
BasicBlocks work like constant pools and jump tables.

llvm-svn: 28082

1d8ee1fc

Refactor TargetMachine, pushing handling of TargetData into the... · 20a631fd

Owen Anderson authored May 03, 2006

Refactor TargetMachine, pushing handling of TargetData into the target-specific subclasses. This has one caller-visible change: getTargetData() now returns a pointer instead of a reference.

This fixes PR 759.

llvm-svn: 28074

20a631fd

Change the BasicBlockAddrs map to be a vector, indexed by MBB number. · d8b192ba
Chris Lattner authored May 03, 2006
```
llvm-svn: 28069
```
d8b192ba

Several related changes: · b8065a9a

Chris Lattner authored May 02, 2006

1. Change several methods in the MachineCodeEmitter class to be pure virtual.
2. Suck emitConstantPool/initJumpTableInfo into startFunction, removing them
   from the MachineCodeEmitter interface, and reducing the amount of target-
   specific code.
3. Change the JITEmitter so that it allocates constantpools and jump tables
   *right* next to the functions that they belong to, instead of in a separate
   pool of memory.  This makes all memory for a function be contiguous, and
   means the JITEmitter only tracks one block of memory now.

llvm-svn: 28065

b8065a9a

May 02, 2006

Fix a purely hypothetical problem (for now): emitWord emits in the host · e1c96369

Chris Lattner authored May 02, 2006

byte format.  This doesn't work when using the code emitter in a cross target
environment.  Since the code emitter is only really used by the JIT, this
isn't a current problem, but if we ever start emitting .o files, it would be.

llvm-svn: 28060

e1c96369

Refactor the machine code emitter interface to pull the pointers for the current · c9aa3715

Chris Lattner authored May 02, 2006

code emission location into the base class, instead of being in the derived classes.

This change means that low-level methods like emitByte/emitWord now are no longer
virtual (yaay for speed), and we now have a framework to support growable code
segments. This implements feature request #1 of PR469.

llvm-svn: 28059

c9aa3715

Since we don't handle callee-save CRs right yet, don't allocate them. Also · bbcbf48a
Nate Begeman authored May 02, 2006
```
don't step on R11 in the middle of a function when saving and restoring CRs

llvm-svn: 28058
```
bbcbf48a
Hooray, everyone now uses the same printBasicBlockLabel implementation · 287dc5be
Nate Begeman authored May 02, 2006
```
llvm-svn: 28056
```
287dc5be
Extend printBasicBlockLabel a bit so that it can be used to print all · b9d4f832
Nate Begeman authored May 02, 2006
```
basic block labels, consolidating the code to do so in one place for each
target.

llvm-svn: 28050
```
b9d4f832
Update the PPC compilation callback code to not need weird abi-violating · 01364fbb
Nate Begeman authored May 02, 2006
```
prologs and epilogs, keep all the asm in one place, and remove use of
compiler builtin functions.

llvm-svn: 28049
```
01364fbb

Apr 28, 2006
- Fix CodeGen/Generic/2006-04-28-Sign-extend-bool.ll · 84b49d51
  Chris Lattner authored Apr 28, 2006
```
llvm-svn: 28017
```
  84b49d51
- Add a note · a4c2c4a2
  Chris Lattner authored Apr 28, 2006
```
llvm-svn: 27999
```
  a4c2c4a2
Apr 25, 2006
- No functionality changes, but cleaner code with correct comments. · 318bb96f
  Nate Begeman authored Apr 25, 2006
```
llvm-svn: 27966
```
  318bb96f
Apr 22, 2006
- JumpTable support! What this represents is working asm and jit support for · 4ca2ea5b
  Nate Begeman authored Apr 22, 2006
```
x86 and ppc for 100% dense switch statements when relocations are non-PIC.
This support will be extended and enhanced in the coming days to support
PIC, and less dense forms of jump tables.

llvm-svn: 27947
```
  4ca2ea5b
- Teach the JIT how to relocate LI, this fixes the JIT on Prolangs-C/TimberWolfMC · c8afdfec
  Chris Lattner authored Apr 22, 2006
```
llvm-svn: 27943
```
  c8afdfec
- Fix the comment · 57a32f0b
  Nate Begeman authored Apr 21, 2006
```
llvm-svn: 27938
```
  57a32f0b
- Change the PPC JIT to use a Static relocation model · 516b3939
  Nate Begeman authored Apr 21, 2006
```
llvm-svn: 27937
```
  516b3939
Apr 20, 2006
- Fix the CodeGen/PowerPC/buildvec_canonicalize.ll regression last night. · 99d3da9d
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27908
```
  99d3da9d
- Make sure that the new instructions selected have the right type. This fixes · 0cd0065c
  Chris Lattner authored Apr 20, 2006
```
CodeGen/PowerPC/2006-04-19-vmaddfp-crash.ll

llvm-svn: 27868
```
  0cd0065c
Apr 19, 2006
- add a note · 05bbec50
  Chris Lattner authored Apr 19, 2006
```
llvm-svn: 27832
```
  05bbec50
- add a note · a922a516
  Chris Lattner authored Apr 19, 2006
```
llvm-svn: 27828
```
  a922a516
Apr 18, 2006

These are correctly encoded by the JIT. I checked :) · 34c901b5
Chris Lattner authored Apr 18, 2006
```
llvm-svn: 27810
```
34c901b5
add a note · 197d7622
Chris Lattner authored Apr 18, 2006
```
llvm-svn: 27809
```
197d7622

Fix a crash on: · 518834c6

Chris Lattner authored Apr 18, 2006

void foo2(vector float *A, vector float *B) {
  vector float C = (vector float)vec_cmpeq(*A, *B);
  if (!vec_any_eq(*A, *B))
    *B = (vector float){0,0,0,0};
  *A = C;
}

llvm-svn: 27808

518834c6

pretty print node name · 1e174c87
Chris Lattner authored Apr 18, 2006
```
llvm-svn: 27806
```
1e174c87

Implement an important entry from README_ALTIVEC: · 9754d142

Chris Lattner authored Apr 18, 2006

If an altivec predicate compare is used immediately by a branch, don't
use a (serializing) MFCR instruction to read the CR6 register, which requires
a compare to get it back to CR's.  Instead, just branch on CR6 directly. :)

For example, for:
void foo2(vector float *A, vector float *B) {
  if (!vec_any_eq(*A, *B))
    *B = (vector float){0,0,0,0};
}

We now generate:

_foo2:
        mfspr r2, 256
        oris r5, r2, 12288
        mtspr 256, r5
        lvx v2, 0, r4
        lvx v3, 0, r3
        vcmpeqfp. v2, v3, v2
        bne cr6, LBB1_2 ; UnifiedReturnBlock
LBB1_1: ; cond_true
        vxor v2, v2, v2
        stvx v2, 0, r4
        mtspr 256, r2
        blr
LBB1_2: ; UnifiedReturnBlock
        mtspr 256, r2
        blr

instead of:

_foo2:
        mfspr r2, 256
        oris r5, r2, 12288
        mtspr 256, r5
        lvx v2, 0, r4
        lvx v3, 0, r3
        vcmpeqfp. v2, v3, v2
        mfcr r3, 2
        rlwinm r3, r3, 27, 31, 31
        cmpwi cr0, r3, 0
        beq cr0, LBB1_2 ; UnifiedReturnBlock
LBB1_1: ; cond_true
        vxor v2, v2, v2
        stvx v2, 0, r4
        mtspr 256, r2
        blr
LBB1_2: ; UnifiedReturnBlock
        mtspr 256, r2
        blr

This implements CodeGen/PowerPC/vec_br_cmp.ll.

llvm-svn: 27804

9754d142

move some stuff around, clean things up · 68c16a20
Chris Lattner authored Apr 18, 2006
```
llvm-svn: 27802
```
68c16a20
Use vmladduhm to do v8i16 multiplies which is faster and simpler than doing · 96d50487
Chris Lattner authored Apr 18, 2006
```
even/odd halves.  Thanks to Nate telling me what's what.

llvm-svn: 27793
```
96d50487

Implement v16i8 multiply with this code: · d6d82aa8

Chris Lattner authored Apr 18, 2006

        vmuloub v5, v3, v2
        vmuleub v2, v3, v2
        vperm v2, v2, v5, v4

This implements CodeGen/PowerPC/vec_mul.ll.  With this, v16i8 multiplies are
6.79x faster than before.

Overall, UnitTests/Vector/multiplies.c is now 2.45x faster with LLVM than with
GCC.

Remove the 'integer multiplies' todo from the README file.

llvm-svn: 27792

d6d82aa8

Lower v8i16 multiply into this code: · 7e439874

Chris Lattner authored Apr 18, 2006

        li r5, lo16(LCPI1_0)
        lis r6, ha16(LCPI1_0)
        lvx v4, r6, r5
        vmulouh v5, v3, v2
        vmuleuh v2, v3, v2
        vperm v2, v2, v5, v4

where v4 is:
LCPI1_0:                                        ;  <16 x ubyte>
        .byte   2
        .byte   3
        .byte   18
        .byte   19
        .byte   6
        .byte   7
        .byte   22
        .byte   23
        .byte   10
        .byte   11
        .byte   26
        .byte   27
        .byte   14
        .byte   15
        .byte   30
        .byte   31

This is 5.07x faster on the G5 (measured) than lowering to scalar code +
loads/stores.

llvm-svn: 27789

7e439874

Custom lower v4i32 multiplies into a cute sequence, instead of having legalize · a2cae1bb

Chris Lattner authored Apr 18, 2006

scalarize the sequence into 4 mullw's and a bunch of load/store traffic.

This speeds up v4i32 multiplies 4.1x (measured) on a G5.  This implements
PowerPC/vec_mul.ll

llvm-svn: 27788

a2cae1bb

Apr 17, 2006

remove done item · 63a5cdc4
Chris Lattner authored Apr 17, 2006
```
llvm-svn: 27778
```
63a5cdc4

Don't diddle VRSAVE if no registers need to be added/removed from it. This · 6bd68ae8

Chris Lattner authored Apr 17, 2006

allows us to codegen functions as:

_test_rol:
        vspltisw v2, -12
        vrlw v2, v2, v2
        blr

instead of:

_test_rol:
        mfvrsave r2, 256
        mr r3, r2
        mtvrsave r3
        vspltisw v2, -12
        vrlw v2, v2, v2
        mtvrsave r2
        blr

Testcase here: CodeGen/PowerPC/vec_vrsave.ll

llvm-svn: 27777

6bd68ae8

Vectors that are known live-in and live-out are clearly already marked in · 72d7c270

Chris Lattner authored Apr 17, 2006

the vrsave register for the caller.  This allows us to codegen a function as:

_test_rol:
        mfspr r2, 256
        mr r3, r2
        mtspr 256, r3
        vspltisw v2, -12
        vrlw v2, v2, v2
        mtspr 256, r2
        blr

instead of:

_test_rol:
        mfspr r2, 256
        oris r3, r2, 40960
        mtspr 256, r3
        vspltisw v0, -12
        vrlw v2, v0, v0
        mtspr 256, r2
        blr

llvm-svn: 27772

72d7c270