Commits · 53345be5e24ed28a9c9a3ae9ae34e5ddae9dc1af · Roger Ferrer / llvm-epi-0.8

Apr 20, 2006
- remove some v9 specific code · 3e552179
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27900
```
  3e552179
- Remove this obsolete file · 2a875285
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27895
```
  2a875285
- This target is no longer built. The ,v files now live in the reoptimizer. · ac611955
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27885
```
  ac611955
- - Added support to turn "vector clear elements", e.g. pand V, <-1, -1, 0, -1> · 60f0b899
  Evan Cheng authored Apr 20, 2006
```
to a vector shuffle.
- VECTOR_SHUFFLE lowering change in preparation for more efficient codegen
of vector shuffle with zero (or any splat) vector.

llvm-svn: 27875
```
  60f0b899
- Make sure that the new instructions selected have the right type. This fixes · 0cd0065c
  Chris Lattner authored Apr 20, 2006
```
CodeGen/PowerPC/2006-04-19-vmaddfp-crash.ll

llvm-svn: 27868
```
  0cd0065c
- Handle v2i64 BUILD_VECTOR custom lowering correctly. v2i64 is a legal type, · 15c264b7
  Evan Cheng authored Apr 20, 2006
```
but i64 is not. If possible, change a i64 op to a f64 (e.g. load, constant)
and then cast it back.

llvm-svn: 27849
```
  15c264b7
- isSplatMask() bug: first element can be an undef. · 4a1b0d32
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27847
```
  4a1b0d32
- - Added support to do aribitrary 4 wide shuffle with no more than three · a3caaee5
  Evan Cheng authored Apr 19, 2006
```
  instructions.
- Fixed a commute vector_shuff bug.

llvm-svn: 27845
```
  a3caaee5
Apr 19, 2006
- Prefer {p}unpack* and mov*dup over {p}shuf* as well. · 6d5297da
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27844
```
  6d5297da
- Renamed AddedCost to AddedComplexity. · 52df7400
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27843
```
  52df7400
- - Renamed AddedCost to AddedComplexity. · b416a251
  Evan Cheng authored Apr 19, 2006
```
- Added more movhlps and movlhps patterns.

llvm-svn: 27842
```
  b416a251
- Commute vector_shuffle to match more movlhps, movlp{s|d} cases. · 7855e4d0
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27840
```
  7855e4d0
- More mov{h|l}p{d|s} patterns. · cc7abc6c
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27836
```
  cc7abc6c
- - More mov{h|l}ps patterns. · aeb09ccd
  Evan Cheng authored Apr 19, 2006
```
- Increase cost (complexity) of patterns which match mov{h|l}ps ops. These
  are preferred over shufps in most cases.

llvm-svn: 27835
```
  aeb09ccd
- Allow "let AddedCost = n in" to increase pattern complexity. · aa3325e9
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27834
```
  aa3325e9
- add a note · 05bbec50
  Chris Lattner authored Apr 19, 2006
```
llvm-svn: 27832
```
  05bbec50
- add a note · a922a516
  Chris Lattner authored Apr 19, 2006
```
llvm-svn: 27828
```
  a922a516
- Add a note. · bfab8281
  Chris Lattner authored Apr 19, 2006
```
llvm-svn: 27827
```
  bfab8281
Apr 18, 2006

- PEXTRW cannot take a memory location as its first source operand. · 3823aa1d
Evan Cheng authored Apr 18, 2006
```
- PINSRWrmi encoding bug.

llvm-svn: 27818
```
3823aa1d
SHUFP{S|D}, PSHUF* encoding bugs. Left out the mask immediate operand. · 43f4ef4f
Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27817
```
43f4ef4f
Name change for clarity sake · a179ea63
Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27816
```
a179ea63
Encoding bug: CMPPSrmi, CMPPDrmi dropped operand 2 (condtion immediate). · 09e36ef7
Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27815
```
09e36ef7
Name change for clarity sake · d799d680
Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27814
```
d799d680
Left a pattern out · 0ee281f3
Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27813
```
0ee281f3
These are correctly encoded by the JIT. I checked :) · 34c901b5
Chris Lattner authored Apr 18, 2006
```
llvm-svn: 27810
```
34c901b5
add a note · 197d7622
Chris Lattner authored Apr 18, 2006
```
llvm-svn: 27809
```
197d7622

Chris Lattner authored Apr 18, 2006

void foo2(vector float *A, vector float *B) {
  vector float C = (vector float)vec_cmpeq(*A, *B);
  if (!vec_any_eq(*A, *B))
    *B = (vector float){0,0,0,0};
  *A = C;
}

llvm-svn: 27808

518834c6

Fixed an encoding bug: movd from XMM to R32. · e2d25a1a
Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27807
```
e2d25a1a
pretty print node name · 1e174c87
Chris Lattner authored Apr 18, 2006
```
llvm-svn: 27806
```
1e174c87

Implement an important entry from README_ALTIVEC: · 9754d142

Chris Lattner authored Apr 18, 2006

If an altivec predicate compare is used immediately by a branch, don't
use a (serializing) MFCR instruction to read the CR6 register, which requires
a compare to get it back to CR's.  Instead, just branch on CR6 directly. :)

For example, for:
void foo2(vector float *A, vector float *B) {
  if (!vec_any_eq(*A, *B))
    *B = (vector float){0,0,0,0};
}

We now generate:

_foo2:
        mfspr r2, 256
        oris r5, r2, 12288
        mtspr 256, r5
        lvx v2, 0, r4
        lvx v3, 0, r3
        vcmpeqfp. v2, v3, v2
        bne cr6, LBB1_2 ; UnifiedReturnBlock
LBB1_1: ; cond_true
        vxor v2, v2, v2
        stvx v2, 0, r4
        mtspr 256, r2
        blr
LBB1_2: ; UnifiedReturnBlock
        mtspr 256, r2
        blr

instead of:

_foo2:
        mfspr r2, 256
        oris r5, r2, 12288
        mtspr 256, r5
        lvx v2, 0, r4
        lvx v3, 0, r3
        vcmpeqfp. v2, v3, v2
        mfcr r3, 2
        rlwinm r3, r3, 27, 31, 31
        cmpwi cr0, r3, 0
        beq cr0, LBB1_2 ; UnifiedReturnBlock
LBB1_1: ; cond_true
        vxor v2, v2, v2
        stvx v2, 0, r4
        mtspr 256, r2
        blr
LBB1_2: ; UnifiedReturnBlock
        mtspr 256, r2
        blr

This implements CodeGen/PowerPC/vec_br_cmp.ll.

llvm-svn: 27804

9754d142

move some stuff around, clean things up · 68c16a20
Chris Lattner authored Apr 18, 2006
```
llvm-svn: 27802
```
68c16a20
Teach the codegen about instructions used for SSE spill code, allowing it · bfc2c683
Chris Lattner authored Apr 18, 2006
```
to optimize cases where it has to spill a lot

llvm-svn: 27801
```
bfc2c683
Use vmladduhm to do v8i16 multiplies which is faster and simpler than doing · 96d50487
Chris Lattner authored Apr 18, 2006
```
even/odd halves.  Thanks to Nate telling me what's what.

llvm-svn: 27793
```
96d50487

Implement v16i8 multiply with this code: · d6d82aa8

Chris Lattner authored Apr 18, 2006

        vmuloub v5, v3, v2
        vmuleub v2, v3, v2
        vperm v2, v2, v5, v4

This implements CodeGen/PowerPC/vec_mul.ll.  With this, v16i8 multiplies are
6.79x faster than before.

Overall, UnitTests/Vector/multiplies.c is now 2.45x faster with LLVM than with
GCC.

Remove the 'integer multiplies' todo from the README file.

llvm-svn: 27792

d6d82aa8

Correct comments · 4d36a369
Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27790
```
4d36a369

Lower v8i16 multiply into this code: · 7e439874

Chris Lattner authored Apr 18, 2006

        li r5, lo16(LCPI1_0)
        lis r6, ha16(LCPI1_0)
        lvx v4, r6, r5
        vmulouh v5, v3, v2
        vmuleuh v2, v3, v2
        vperm v2, v2, v5, v4

where v4 is:
LCPI1_0:                                        ;  <16 x ubyte>
        .byte   2
        .byte   3
        .byte   18
        .byte   19
        .byte   6
        .byte   7
        .byte   22
        .byte   23
        .byte   10
        .byte   11
        .byte   26
        .byte   27
        .byte   14
        .byte   15
        .byte   30
        .byte   31

This is 5.07x faster on the G5 (measured) than lowering to scalar code +
loads/stores.

llvm-svn: 27789

7e439874

Custom lower v4i32 multiplies into a cute sequence, instead of having legalize · a2cae1bb

Chris Lattner authored Apr 18, 2006

scalarize the sequence into 4 mullw's and a bunch of load/store traffic.

This speeds up v4i32 multiplies 4.1x (measured) on a G5.  This implements
PowerPC/vec_mul.ll

llvm-svn: 27788

a2cae1bb

Another entry · 0ef23350
Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27786
```
0ef23350
Another entry. · e008bd3d
Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27784
```
e008bd3d
Use movss to insert_vector_elt(v, s, 0). · 5421206c
Evan Cheng authored Apr 17, 2006
```
llvm-svn: 27782
```
5421206c