Commits · e728efdfce8bb978a576fa3ba9306d17d565b077 · Roger Ferrer / llvm-epi-0.8

Apr 22, 2006

Don't do all the lowering stuff for 2-wide build_vector's. Also, minor... · e728efdf

Evan Cheng authored Apr 22, 2006

Don't do all the lowering stuff for 2-wide build_vector's. Also, minor optimization for shuffle of undef.

llvm-svn: 27946

e728efdf

Fix a performance regression. Use {p}shuf* when there are only two distinct... · 16ef94f4
Evan Cheng authored Apr 22, 2006
```
Fix a performance regression. Use {p}shuf* when there are only two distinct elements in a build_vector.

llvm-svn: 27945
```
16ef94f4
Teach the JIT how to relocate LI, this fixes the JIT on Prolangs-C/TimberWolfMC · c8afdfec
Chris Lattner authored Apr 22, 2006
```
llvm-svn: 27943
```
c8afdfec
Fix JIT support for static ctors, which was apparently completely broken! · fe36eaeb
Chris Lattner authored Apr 22, 2006
```
This allows Prolangs-C++/city and probably a bunch of other stuff to work
well with the new front-end

llvm-svn: 27941
```
fe36eaeb

Revamp build_vector lowering to take advantage of movss and movd instructions. · 14215c36

Evan Cheng authored Apr 21, 2006

movd always clear the top 96 bits and movss does so when it's loading the
value from memory.
The net result is codegen for 4-wide shuffles is much improved. It is near
optimal if one or more elements is a zero. e.g.

__m128i test(int a, int b) {
  return _mm_set_epi32(0, 0, b, a);
}

compiles to

_test:
	movd 8(%esp), %xmm1
	movd 4(%esp), %xmm0
	punpckldq %xmm1, %xmm0
	ret

compare to gcc:

_test:
	subl	$12, %esp
	movd	20(%esp), %xmm0
	movd	16(%esp), %xmm1
	punpckldq	%xmm0, %xmm1
	movq	%xmm1, %xmm0
	movhps	LC0, %xmm0
	addl	$12, %esp
	ret

or icc:

_test:
        movd      4(%esp), %xmm0                                #5.10
        movd      8(%esp), %xmm3                                #5.10
        xorl      %eax, %eax                                    #5.10
        movd      %eax, %xmm1                                   #5.10
        punpckldq %xmm1, %xmm0                                  #5.10
        movd      %eax, %xmm2                                   #5.10
        punpckldq %xmm2, %xmm3                                  #5.10
        punpckldq %xmm3, %xmm0                                  #5.10
        ret                                                     #5.10

There are still room for improvement, for example the FP variant of the above example:

__m128 test(float a, float b) {
  return _mm_set_ps(0.0, 0.0, b, a);
}

_test:
	movss 8(%esp), %xmm1
	movss 4(%esp), %xmm0
	unpcklps %xmm1, %xmm0
	xorps %xmm1, %xmm1
	movlhps %xmm1, %xmm0
	ret

The xorps and movlhps are unnecessary. This will require post legalizer optimization to handle.

llvm-svn: 27939

14215c36

Fix the comment · 57a32f0b
Nate Begeman authored Apr 21, 2006
```
llvm-svn: 27938
```
57a32f0b
Change the PPC JIT to use a Static relocation model · 516b3939
Nate Begeman authored Apr 21, 2006
```
llvm-svn: 27937
```
516b3939

Apr 21, 2006

fix thinko · 3e62d4b2
Chris Lattner authored Apr 21, 2006
```
llvm-svn: 27935
```
3e62d4b2
add some low-prio notes · e1f9ab7d
Chris Lattner authored Apr 21, 2006
```
llvm-svn: 27934
```
e1f9ab7d
The BFS scheduler is apparently nondeterminstic (causes many llvmgcc bootstrap · b21d3bfd
Chris Lattner authored Apr 21, 2006
```
miscompares).  Switch RISC targets to use the list-td scheduler, which isn't.

llvm-svn: 27933
```
b21d3bfd
Remove a hack required by V9. · 28ead23d
Chris Lattner authored Apr 21, 2006
```
llvm-svn: 27931
```
28ead23d
Fix a couple more memory issues · 662e940f
Chris Lattner authored Apr 21, 2006
```
llvm-svn: 27930
```
662e940f

Now generating perfect (I think) code for "vector set" with a single non-zero · e8b51800

Evan Cheng authored Apr 21, 2006

scalar value.

e.g.
        _mm_set_epi32(0, a, 0, 0);
==>
	movd 4(%esp), %xmm0
	pshufd $69, %xmm0, %xmm0

        _mm_set_epi8(0, 0, 0, 0, 0, a, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0);
==>
	movzbw 4(%esp), %ax
	movzwl %ax, %eax
	pxor %xmm0, %xmm0
	pinsrw $5, %eax, %xmm0

llvm-svn: 27923

e8b51800

Fix a really subtle and obnoxious memory bug that caused issues with an · cc47ab33

Chris Lattner authored Apr 20, 2006

llvm-gcc4 boostrap.  Whenever a node is deleted by the dag combiner, it
*must* be returned by the visit function, or the dag combiner will not
know that the node has been processed (and will, e.g., send it to the
target dag combine xforms).

llvm-svn: 27922

cc47ab33

Apr 20, 2006
- Fix Transforms/ScalarRepl/2006-04-20-PromoteCrash.ll · dae49df4
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27912
```
  dae49df4
- Fix the CodeGen/PowerPC/buildvec_canonicalize.ll regression last night. · 99d3da9d
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27908
```
  99d3da9d
- add a note · d1c3a067
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27907
```
  d1c3a067
- remove some v9 specific code · 3e552179
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27900
```
  3e552179
- This field no longer exists · dcc1f995
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27899
```
  dcc1f995
- Remove this obsolete file · 2a875285
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27895
```
  2a875285
- Remove some of the obvious V9-specific cruft · a38c3580
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27893
```
  a38c3580
- This target is no longer built. The ,v files now live in the reoptimizer. · ac611955
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27885
```
  ac611955
- Make code match cvs commit message :) · f89e630b
  Andrew Lenharth authored Apr 20, 2006
```
llvm-svn: 27881
```
  f89e630b
- If we can convert the return pointer type into an integer that IntPtrType · 61eae29a
  Andrew Lenharth authored Apr 20, 2006
```
can be converted to losslessly, we can continue the conversion to a direct call.

llvm-svn: 27880
```
  61eae29a
- - Added support to turn "vector clear elements", e.g. pand V, <-1, -1, 0, -1> · 60f0b899
  Evan Cheng authored Apr 20, 2006
```
to a vector shuffle.
- VECTOR_SHUFFLE lowering change in preparation for more efficient codegen
of vector shuffle with zero (or any splat) vector.

llvm-svn: 27875
```
  60f0b899
- Turn a VAND into a VECTOR_SHUFFLE is applicable. · a320abc4
  Evan Cheng authored Apr 20, 2006
```
DAG combiner can turn a VAND V, <-1, 0, -1, -1>, i.e. vector clear elements,
into a vector shuffle with a zero vector. It only does so when TLI tells it
the xform is profitable.

llvm-svn: 27874
```
  a320abc4
- Make sure that the new instructions selected have the right type. This fixes · 0cd0065c
  Chris Lattner authored Apr 20, 2006
```
CodeGen/PowerPC/2006-04-19-vmaddfp-crash.ll

llvm-svn: 27868
```
  0cd0065c
- Implement folding of a bunch of binops with undef · bc1b2627
  Chris Lattner authored Apr 20, 2006
```
llvm-svn: 27863
```
  bc1b2627
- Handle v2i64 BUILD_VECTOR custom lowering correctly. v2i64 is a legal type, · 15c264b7
  Evan Cheng authored Apr 20, 2006
```
but i64 is not. If possible, change a i64 op to a f64 (e.g. load, constant)
and then cast it back.

llvm-svn: 27849
```
  15c264b7
- isSplatMask() bug: first element can be an undef. · 4a1b0d32
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27847
```
  4a1b0d32
- Simplify some code · 73eb58e1
  Chris Lattner authored Apr 19, 2006
```
llvm-svn: 27846
```
  73eb58e1
- - Added support to do aribitrary 4 wide shuffle with no more than three · a3caaee5
  Evan Cheng authored Apr 19, 2006
```
  instructions.
- Fixed a commute vector_shuff bug.

llvm-svn: 27845
```
  a3caaee5
Apr 19, 2006
- Prefer {p}unpack* and mov*dup over {p}shuf* as well. · 6d5297da
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27844
```
  6d5297da
- Renamed AddedCost to AddedComplexity. · 52df7400
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27843
```
  52df7400
- - Renamed AddedCost to AddedComplexity. · b416a251
  Evan Cheng authored Apr 19, 2006
```
- Added more movhlps and movlhps patterns.

llvm-svn: 27842
```
  b416a251
- Commute vector_shuffle to match more movlhps, movlp{s|d} cases. · 7855e4d0
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27840
```
  7855e4d0
- More mov{h|l}p{d|s} patterns. · cc7abc6c
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27836
```
  cc7abc6c
- - More mov{h|l}ps patterns. · aeb09ccd
  Evan Cheng authored Apr 19, 2006
```
- Increase cost (complexity) of patterns which match mov{h|l}ps ops. These
  are preferred over shufps in most cases.

llvm-svn: 27835
```
  aeb09ccd
- Allow "let AddedCost = n in" to increase pattern complexity. · aa3325e9
  Evan Cheng authored Apr 19, 2006
```
llvm-svn: 27834
```
  aa3325e9
- add a note · 05bbec50
  Chris Lattner authored Apr 19, 2006
```
llvm-svn: 27832
```
  05bbec50