Commits · e728efdfce8bb978a576fa3ba9306d17d565b077 · Roger Ferrer / llvm-epi-0.8

Apr 22, 2006

Don't do all the lowering stuff for 2-wide build_vector's. Also, minor... · e728efdf

Evan Cheng authored Apr 22, 2006

Don't do all the lowering stuff for 2-wide build_vector's. Also, minor optimization for shuffle of undef.

llvm-svn: 27946

e728efdf

Fix a performance regression. Use {p}shuf* when there are only two distinct... · 16ef94f4
Evan Cheng authored Apr 22, 2006
```
Fix a performance regression. Use {p}shuf* when there are only two distinct elements in a build_vector.

llvm-svn: 27945
```
16ef94f4
Teach the JIT how to relocate LI, this fixes the JIT on Prolangs-C/TimberWolfMC · c8afdfec
Chris Lattner authored Apr 22, 2006
```
llvm-svn: 27943
```
c8afdfec

Revamp build_vector lowering to take advantage of movss and movd instructions. · 14215c36

Evan Cheng authored Apr 21, 2006

movd always clear the top 96 bits and movss does so when it's loading the
value from memory.
The net result is codegen for 4-wide shuffles is much improved. It is near
optimal if one or more elements is a zero. e.g.

__m128i test(int a, int b) {
  return _mm_set_epi32(0, 0, b, a);
}

compiles to

_test:
	movd 8(%esp), %xmm1
	movd 4(%esp), %xmm0
	punpckldq %xmm1, %xmm0
	ret

compare to gcc:

_test:
	subl	$12, %esp
	movd	20(%esp), %xmm0
	movd	16(%esp), %xmm1
	punpckldq	%xmm0, %xmm1
	movq	%xmm1, %xmm0
	movhps	LC0, %xmm0
	addl	$12, %esp
	ret

or icc:

_test:
        movd      4(%esp), %xmm0                                #5.10
        movd      8(%esp), %xmm3                                #5.10
        xorl      %eax, %eax                                    #5.10
        movd      %eax, %xmm1                                   #5.10
        punpckldq %xmm1, %xmm0                                  #5.10
        movd      %eax, %xmm2                                   #5.10
        punpckldq %xmm2, %xmm3                                  #5.10
        punpckldq %xmm3, %xmm0                                  #5.10
        ret                                                     #5.10

There are still room for improvement, for example the FP variant of the above example:

__m128 test(float a, float b) {
  return _mm_set_ps(0.0, 0.0, b, a);
}

_test:
	movss 8(%esp), %xmm1
	movss 4(%esp), %xmm0
	unpcklps %xmm1, %xmm0
	xorps %xmm1, %xmm1
	movlhps %xmm1, %xmm0
	ret

The xorps and movlhps are unnecessary. This will require post legalizer optimization to handle.

llvm-svn: 27939

14215c36

Fix the comment · 57a32f0b
Nate Begeman authored Apr 21, 2006
```
llvm-svn: 27938
```
57a32f0b
Change the PPC JIT to use a Static relocation model · 516b3939
Nate Begeman authored Apr 21, 2006
```
llvm-svn: 27937
```
516b3939

Apr 21, 2006

fix thinko · 3e62d4b2
Chris Lattner authored Apr 21, 2006
```
llvm-svn: 27935
```
3e62d4b2
add some low-prio notes · e1f9ab7d
Chris Lattner authored Apr 21, 2006
```
llvm-svn: 27934
```
e1f9ab7d

Now generating perfect (I think) code for "vector set" with a single non-zero · e8b51800

Evan Cheng authored Apr 21, 2006

scalar value.

e.g.
        _mm_set_epi32(0, a, 0, 0);
==>
	movd 4(%esp), %xmm0
	pshufd $69, %xmm0, %xmm0

        _mm_set_epi8(0, 0, 0, 0, 0, a, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0);
==>
	movzbw 4(%esp), %ax
	movzwl %ax, %eax
	pxor %xmm0, %xmm0
	pinsrw $5, %eax, %xmm0

llvm-svn: 27923

e8b51800

Apr 20, 2006
- Fix the CodeGen/PowerPC/buildvec_canonicalize.ll regression last night. · 99d3da9d
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27908
```
  99d3da9d
- add a note · d1c3a067
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27907
```
  d1c3a067
- remove some v9 specific code · 3e552179
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27900
```
  3e552179
- Remove this obsolete file · 2a875285
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27895
```
  2a875285
- This target is no longer built. The ,v files now live in the reoptimizer. · ac611955
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27885
```
  ac611955
- - Added support to turn "vector clear elements", e.g. pand V, <-1, -1, 0, -1> · 60f0b899
  Evan Cheng authored Apr 20, 2006
```
to a vector shuffle.
- VECTOR_SHUFFLE lowering change in preparation for more efficient codegen
of vector shuffle with zero (or any splat) vector.

llvm-svn: 27875
```
  60f0b899
- Make sure that the new instructions selected have the right type. This fixes · 0cd0065c
  Chris Lattner authored Apr 20, 2006
```
CodeGen/PowerPC/2006-04-19-vmaddfp-crash.ll

llvm-svn: 27868
```
  0cd0065c
- Handle v2i64 BUILD_VECTOR custom lowering correctly. v2i64 is a legal type, · 15c264b7
  Evan Cheng authored Apr 20, 2006
```
but i64 is not. If possible, change a i64 op to a f64 (e.g. load, constant)
and then cast it back.

llvm-svn: 27849
```
  15c264b7
- isSplatMask() bug: first element can be an undef. · 4a1b0d32
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27847
```
  4a1b0d32
- - Added support to do aribitrary 4 wide shuffle with no more than three · a3caaee5
  Evan Cheng authored Apr 19, 2006
```
  instructions.
- Fixed a commute vector_shuff bug.

llvm-svn: 27845
```
  a3caaee5
Apr 19, 2006
- Prefer {p}unpack* and mov*dup over {p}shuf* as well. · 6d5297da
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27844
```
  6d5297da
- Renamed AddedCost to AddedComplexity. · 52df7400
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27843
```
  52df7400
- - Renamed AddedCost to AddedComplexity. · b416a251
  Evan Cheng authored Apr 19, 2006
```
- Added more movhlps and movlhps patterns.

llvm-svn: 27842
```
  b416a251
- Commute vector_shuffle to match more movlhps, movlp{s|d} cases. · 7855e4d0
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27840
```
  7855e4d0
- More mov{h|l}p{d|s} patterns. · cc7abc6c
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27836
```
  cc7abc6c
- - More mov{h|l}ps patterns. · aeb09ccd
  Evan Cheng authored Apr 19, 2006
```
- Increase cost (complexity) of patterns which match mov{h|l}ps ops. These
  are preferred over shufps in most cases.

llvm-svn: 27835
```
  aeb09ccd
- Allow "let AddedCost = n in" to increase pattern complexity. · aa3325e9
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27834
```
  aa3325e9
- add a note · 05bbec50
  Chris Lattner authored Apr 19, 2006
```
llvm-svn: 27832
```
  05bbec50
- add a note · a922a516
  Chris Lattner authored Apr 19, 2006
```
llvm-svn: 27828
```
  a922a516
- Add a note. · bfab8281
  Chris Lattner authored Apr 19, 2006
```
llvm-svn: 27827
```
  bfab8281
Apr 18, 2006
- - PEXTRW cannot take a memory location as its first source operand. · 3823aa1d
  Evan Cheng authored Apr 18, 2006
```
- PINSRWrmi encoding bug.

llvm-svn: 27818
```
  3823aa1d
- SHUFP{S|D}, PSHUF* encoding bugs. Left out the mask immediate operand. · 43f4ef4f
  Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27817
```
  43f4ef4f
- Name change for clarity sake · a179ea63
  Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27816
```
  a179ea63
- Encoding bug: CMPPSrmi, CMPPDrmi dropped operand 2 (condtion immediate). · 09e36ef7
  Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27815
```
  09e36ef7
- Name change for clarity sake · d799d680
  Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27814
```
  d799d680
- Left a pattern out · 0ee281f3
  Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27813
```
  0ee281f3
- These are correctly encoded by the JIT. I checked :) · 34c901b5
  Chris Lattner authored Apr 18, 2006
```
llvm-svn: 27810
```
  34c901b5
- add a note · 197d7622
  Chris Lattner authored Apr 18, 2006
```
llvm-svn: 27809
```
  197d7622
- Fix a crash on: · 518834c6
  Chris Lattner authored Apr 18, 2006
```
void foo2(vector float *A, vector float *B) {
  vector float C = (vector float)vec_cmpeq(*A, *B);
  if (!vec_any_eq(*A, *B))
    *B = (vector float){0,0,0,0};
  *A = C;
}

llvm-svn: 27808
```
  518834c6
- Fixed an encoding bug: movd from XMM to R32. · e2d25a1a
  Evan Cheng authored Apr 18, 2006
```
llvm-svn: 27807
```
  e2d25a1a
- pretty print node name · 1e174c87
  Chris Lattner authored Apr 18, 2006
```
llvm-svn: 27806
```
  1e174c87