Commits · 4302e8fb67b9a705b75aa7971f61bf6038a64bc6 · Roger Ferrer / llvm-epi-0.8

May 16, 2006

Switch the PPC backend over to using FORMAL_ARGUMENTS for formal argument · 4302e8fb

Chris Lattner authored May 16, 2006

handling.  This makes the lower argument code significantly simpler (we
only need to handle legal argument types).

Incidentally, this also implements support for vector argument registers,
so long as they are not on the stack.

llvm-svn: 28331

4302e8fb

Apr 18, 2006

Implement an important entry from README_ALTIVEC: · 9754d142

Chris Lattner authored Apr 18, 2006

If an altivec predicate compare is used immediately by a branch, don't
use a (serializing) MFCR instruction to read the CR6 register, which requires
a compare to get it back to CR's.  Instead, just branch on CR6 directly. :)

For example, for:
void foo2(vector float *A, vector float *B) {
  if (!vec_any_eq(*A, *B))
    *B = (vector float){0,0,0,0};
}

We now generate:

_foo2:
        mfspr r2, 256
        oris r5, r2, 12288
        mtspr 256, r5
        lvx v2, 0, r4
        lvx v3, 0, r3
        vcmpeqfp. v2, v3, v2
        bne cr6, LBB1_2 ; UnifiedReturnBlock
LBB1_1: ; cond_true
        vxor v2, v2, v2
        stvx v2, 0, r4
        mtspr 256, r2
        blr
LBB1_2: ; UnifiedReturnBlock
        mtspr 256, r2
        blr

instead of:

_foo2:
        mfspr r2, 256
        oris r5, r2, 12288
        mtspr 256, r5
        lvx v2, 0, r4
        lvx v3, 0, r3
        vcmpeqfp. v2, v3, v2
        mfcr r3, 2
        rlwinm r3, r3, 27, 31, 31
        cmpwi cr0, r3, 0
        beq cr0, LBB1_2 ; UnifiedReturnBlock
LBB1_1: ; cond_true
        vxor v2, v2, v2
        stvx v2, 0, r4
        mtspr 256, r2
        blr
LBB1_2: ; UnifiedReturnBlock
        mtspr 256, r2
        blr

This implements CodeGen/PowerPC/vec_br_cmp.ll.

llvm-svn: 27804

9754d142

Apr 12, 2006

Rename get_VSPLI_elt -> get_VSPLTI_elt · 74cf9ff7

Chris Lattner authored Apr 12, 2006

Canonicalize BUILD_VECTOR's that match VSPLTI's into a single type for each
form, eliminating a bunch of Pat patterns in the .td file and allowing us to
CSE stuff more aggressively.  This implements
PowerPC/buildvec_canonicalize.ll:VSPLTI

llvm-svn: 27614

74cf9ff7

Apr 08, 2006
- Change the interface to the predicate that determines if vsplti* can be used. · d71a1f94
  Chris Lattner authored Apr 08, 2006
```
No functionality changes.

llvm-svn: 27536
```
  d71a1f94
Apr 07, 2006
- Match vpku[hw]um(x,x). · a4bbfaed
  Chris Lattner authored Apr 06, 2006
```
Convert vsldoi(x,x) to work the same way other (x,x) cases work.

llvm-svn: 27467
```
  a4bbfaed
- Add support for matching vmrg(x,x) patterns · f38e0332
  Chris Lattner authored Apr 06, 2006
```
llvm-svn: 27463
```
  f38e0332
Apr 06, 2006
- Pattern match vmrg* instructions, which are now lowered by the CFE into shuffles. · d1dcb520
  Chris Lattner authored Apr 06, 2006
```
llvm-svn: 27457
```
  d1dcb520
- Support pattern matching vsldoi(x,y) and vsldoi(x,x), which allows the f.e. to · 1d338191
  Chris Lattner authored Apr 06, 2006
```
lower it and LLVM to have one fewer intrinsic.  This implements
CodeGen/PowerPC/vec_shuffle.ll

llvm-svn: 27450
```
  1d338191
- Compile the vpkuhum/vpkuwum intrinsics into vpkuhum/vpkuwum instead of into · e8b83b42
  Chris Lattner authored Apr 06, 2006
```
vperm with a perm mask lvx'd from the constant pool.

llvm-svn: 27448
```
  e8b83b42
Apr 04, 2006

Ask legalize to promote all vector shuffles to be v16i8 instead of having to · 95c7adc7

Chris Lattner authored Apr 04, 2006

handle all 4 PPC vector types.   This simplifies the matching code and allows
us to eliminate a bunch of patterns.  This also adds cases we were missing,
such as CodeGen/PowerPC/vec_splat.ll:splat_h.

llvm-svn: 27400

95c7adc7

Apr 02, 2006
- Inform the dag combiner that the predicate compares only return a low bit. · c5287c0e
  Chris Lattner authored Apr 02, 2006
```
llvm-svn: 27359
```
  c5287c0e
Mar 31, 2006
- Lower vector compares to VCMP nodes, just like we lower vector comparison · d7495ae7
  Chris Lattner authored Mar 31, 2006
```
predicates to VCMPo nodes.

llvm-svn: 27285
```
  d7495ae7
Mar 28, 2006
- Use normal lvx for scalar_to_vector instead of lve*x. They do the exact · cb5ec07c
  Chris Lattner authored Mar 28, 2006
```
same thing and we have a dag node for the former.

llvm-svn: 27205
```
  cb5ec07c
Mar 26, 2006
- Codegen vector predicate compares. · 6961fc76
  Chris Lattner authored Mar 26, 2006
```
llvm-svn: 27151
```
  6961fc76
- Remove PPC:isZeroVector, use ISD::isBuildVectorAllZeros instead · b1ddc988
  Evan Cheng authored Mar 26, 2006
```
llvm-svn: 27149
```
  b1ddc988
Mar 25, 2006

Codegen things like: · 2771e2c9

Chris Lattner authored Mar 25, 2006

 <int -1, int -1, int -1, int -1>
and
 <int 65537, int 65537, int 65537, int 65537>

Using things like:
  vspltisb v0, -1
and:
  vspltish v0, 1

instead of using constant pool loads.

This implements CodeGen/PowerPC/vec_splat.ll:splat_imm_i{32|16}.

llvm-svn: 27106

2771e2c9

Mar 24, 2006
- add support for using vxor to build zero vectors. This implements · ab882abc
  Chris Lattner authored Mar 24, 2006
```
Regression/CodeGen/PowerPC/vec_zero.ll

llvm-svn: 27059
```
  ab882abc
Mar 22, 2006

When possible, custom lower 32-bit SINT_TO_FP to this: · 4a66d694

Chris Lattner authored Mar 22, 2006

_foo2:
        extsw r2, r3
        std r2, -8(r1)
        lfd f0, -8(r1)
        fcfid f0, f0
        frsp f1, f0
        blr

instead of this:

_foo2:
        lis r2, ha16(LCPI2_0)
        lis r4, 17200
        xoris r3, r3, 32768
        stw r3, -4(r1)
        stw r4, -8(r1)
        lfs f0, lo16(LCPI2_0)(r2)
        lfd f1, -8(r1)
        fsub f0, f1, f0
        frsp f1, f0
        blr

This speeds up Misc/pi from 2.44s->2.09s with LLC and from 3.01->2.18s
with llcbeta (16.7% and 38.1% respectively).

llvm-svn: 26943

4a66d694

Mar 20, 2006
- fix duplicate definition errors · ffc47568
  Chris Lattner authored Mar 20, 2006
```
llvm-svn: 26896
```
  ffc47568
- Check in some intermediate code that adds a skeleton for matching vsplt* · 382f356b
  Chris Lattner authored Mar 20, 2006
```
instructions

llvm-svn: 26894
```
  382f356b
- Custom lower arbitrary VECTOR_SHUFFLE's to VPERM. · a8713b1e
  Chris Lattner authored Mar 20, 2006
```
TODO: leave specific ones as VECTOR_SHUFFLE's and turn them into specialized
operations like vsplt*

llvm-svn: 26887
```
  a8713b1e
Mar 19, 2006
- Custom lower SCALAR_TO_VECTOR into lve*x. · 7e9440a4
  Chris Lattner authored Mar 19, 2006
```
llvm-svn: 26868
```
  7e9440a4
Mar 14, 2006
- Added getTargetLowering() to TargetMachine. Refactored targets to support this. · 2dd2c652
  Evan Cheng authored Mar 13, 2006
```
llvm-svn: 26742
```
  2dd2c652
Mar 01, 2006

Compile this: · 27f5345b

Chris Lattner authored Mar 01, 2006

void foo(float a, int *b) { *b = a; }

to this:

_foo:
        fctiwz f0, f1
        stfiwx f0, 0, r4
        blr

instead of this:

_foo:
        fctiwz f0, f1
        stfd f0, -8(r1)
        lwz r2, -4(r1)
        stw r2, 0(r4)
        blr

This implements CodeGen/PowerPC/stfiwx.ll, and also incidentally does the
right thing for GCC bugzilla 26505.

llvm-svn: 26447

27f5345b

Use a target-specific dag-combine to implement CodeGen/PowerPC/fp-int-fp.ll. · f4184358
Chris Lattner authored Mar 01, 2006
```
llvm-svn: 26445
```
f4184358

Feb 22, 2006
- split register class handling from explicit physreg handling. · 7ad77dfc
  Chris Lattner authored Feb 22, 2006
```
llvm-svn: 26308
```
  7ad77dfc
- Updates to match change of getRegForInlineAsmConstraint prototype · 7bb4696d
  Chris Lattner authored Feb 21, 2006
```
llvm-svn: 26305
```
  7bb4696d
Feb 07, 2006
- Implement getConstraintType for PPC. · 203b2f12
  Chris Lattner authored Feb 07, 2006
```
llvm-svn: 26042
```
  203b2f12
- Add the simple PPC integer constraints · 15a6c4c4
  Chris Lattner authored Feb 07, 2006
```
llvm-svn: 26027
```
  15a6c4c4
Jan 31, 2006
- add info about the inline asm register constraints for PPC · 0151361d
  Chris Lattner authored Jan 31, 2006
```
llvm-svn: 25853
```
  0151361d
Jan 28, 2006
- Use PPCISD::CALL instead of ISD::CALL · f424a665
  Chris Lattner authored Jan 27, 2006
```
llvm-svn: 25717
```
  f424a665
Jan 27, 2006
- Make llvm.frame/returnaddr not crash on ppc · 4d967a4c
  Chris Lattner authored Jan 27, 2006
```
llvm-svn: 25710
```
  4d967a4c
- Remove TLI.LowerReturnTo, and just let targets custom lower ISD::RET for · 8c47c3a3
  Nate Begeman authored Jan 27, 2006
```
the same functionality.  This addresses another piece of bug 680.  Next,
on to fixing Alpha VAARG, which I broke last time.

llvm-svn: 25696
```
  8c47c3a3
Jan 25, 2006

First part of bug 680: · e74795cd

Nate Begeman authored Jan 25, 2006

Remove TLI.LowerVA* and replace it with SDNodes that are lowered the same
way as everything else.

llvm-svn: 25606

e74795cd

Jan 10, 2006
- Give PPCISD:: nodes legible names in dumps. · 347ed8a5
  Chris Lattner authored Jan 09, 2006
```
llvm-svn: 25166
```
  347ed8a5
Dec 20, 2005
- Pattern-match return. Includes gross hack! · b11b8e44
  Nate Begeman authored Dec 20, 2005
```
llvm-svn: 24874
```
  b11b8e44
Dec 13, 2005
- Prepare support for AltiVec multiply, divide, and sqrt. · 69caef2b
  Nate Begeman authored Dec 13, 2005
```
llvm-svn: 24700
```
  69caef2b
Dec 06, 2005

Use new PPC-specific nodes to represent shifts which require the 6-bit · fea33f7e

Chris Lattner authored Dec 06, 2005

amount handling that PPC provides.  These are generated by the lowering code
and prevents the dag combiner from assuming (rightfully) that the shifts
don't only look at 5 bits.  This fixes a miscompilation of crafty with
the new front-end.

llvm-svn: 24615

fea33f7e

Nov 17, 2005

Add an initial hack at legalizing GlobalAddress into the appropriate nodes · 595088aa

Chris Lattner authored Nov 17, 2005

on Darwin to remove smarts from the isel.  This is currently disabled by
default (uncomment setOperationAction(ISD::GlobalAddress to enable it).
tblgen needs to become smarter about tglobaladdr nodes and bigger patterns
needed to be added to the .td file.  However, we can currently emit stuff like
this:  :)

        li r2, lo16(L_x$non_lazy_ptr)
        lis r3, ha16(L_x$non_lazy_ptr)
        lwzx r2, r3, r2

The obvious improvements will follow.

llvm-svn: 24390

595088aa

Oct 19, 2005

Add the ability to lower return instructions to TargetLowering. This · 78afac2d

Nate Begeman authored Oct 18, 2005

allows us to lower legal return types to something else, to meet ABI
requirements (such as that i64 be returned in two i32 regs on Darwin/ppc).

llvm-svn: 23802

78afac2d