Commits · f92bd8cf22890f2070c87dc4a38d989a317f94fa · Roger Ferrer / llvm-epi-0.8

Sep 03, 2010
- Reintroduce a simple function refactoring done in r112934, also without any functionality changes · fe8717c5
  Bruno Cardoso Lopes authored Sep 03, 2010
```
llvm-svn: 113008
```
  fe8717c5
- Reapply piecies of r112942 and r112934 which don't do · 48e589b1
  Bruno Cardoso Lopes authored Sep 03, 2010
```
functional changes

llvm-svn: 113007
```
  48e589b1
- Reapply Fix comment · 6979cf08
  Bruno Cardoso Lopes authored Sep 03, 2010
```
llvm-svn: 113006
```
  6979cf08
- Revert r112934, "- Use specific nodes to match unpckl masks.", which introduced · 6f3da24d
  Daniel Dunbar authored Sep 03, 2010
```
some infinite loop and select failures.
 - Apologies for eager reverting, but its branch day.

llvm-svn: 113000
```
  6f3da24d
- Revert r112938 "Fix comment", which depends on r112934, which introduced some · f1aacd55
  Daniel Dunbar authored Sep 03, 2010
```
infinite loop and select failures.

llvm-svn: 112999
```
  f1aacd55
- Revert r112942, "Use punpckh and unpckh family of nodes instead of using unpckh · 0ffe4db4
  Daniel Dunbar authored Sep 03, 2010
```
mask pattern fragment", which depends on r112934, which introduced some infinite
loop and select failures.

llvm-svn: 112998
```
  0ffe4db4
- Use punpckh and unpckh family of nodes instead of using unpckh mask pattern fragment · a85ec104
  Bruno Cardoso Lopes authored Sep 03, 2010
```
llvm-svn: 112942
```
  a85ec104
- Fix comment · adc6bca2
  Bruno Cardoso Lopes authored Sep 03, 2010
```
llvm-svn: 112938
```
  adc6bca2
- - Use specific nodes to match unpckl masks. · cce44678
  Bruno Cardoso Lopes authored Sep 03, 2010
```
- Teach getShuffleScalarElt how to handle more target
specific nodes, so the DAGCombine can make use of it.
- Add another hack to avoid the node update problem
during legalization. More description on the comments

llvm-svn: 112934
```
  cce44678
- Revert win64 changes. They seem to be incomplete · a689c5b2
  Anton Korobeynikov authored Sep 02, 2010
```
llvm-svn: 112885
```
  a689c5b2
- Properly allocate win64 shadow reg area. · 56291f7e
  Anton Korobeynikov authored Sep 02, 2010
```
Patch by Jan Sjodin!

llvm-svn: 112875
```
  56291f7e
Sep 02, 2010
- Replace unpckl_undef and unpckh_undef matching with target specific opcodes · 489613f1
  Bruno Cardoso Lopes authored Sep 02, 2010
```
llvm-svn: 112806
```
  489613f1
- Move condition out to prepare for more matching · e4e4be38
  Bruno Cardoso Lopes authored Sep 02, 2010
```
llvm-svn: 112805
```
  e4e4be38
- Remove checking for isUNPCKL_v_undef_Mask, the specific node is already emitted for it · bf7fd146
  Bruno Cardoso Lopes authored Sep 02, 2010
```
llvm-svn: 112804
```
  bf7fd146
- become more strict about when it's safe to use X86ISD::MOVLPS · 6a7f6344
  Bruno Cardoso Lopes authored Sep 02, 2010
```
llvm-svn: 112799
```
  6a7f6344
- Revert r112689, avoid those kind of checks cause they mess up with mmx · 04c25c15
  Bruno Cardoso Lopes authored Sep 01, 2010
```
llvm-svn: 112760
```
  04c25c15
Sep 01, 2010
- Use movlps, movlpd, movss and movsd specific nodes instead of pattern matching... · b3825216
  Bruno Cardoso Lopes authored Sep 01, 2010
```
Use movlps, movlpd, movss and movsd specific nodes instead of pattern matching with movlp pattern fragment

llvm-svn: 112694
```
  b3825216
- minor change, simplify some logic · 6aaebe87
  Bruno Cardoso Lopes authored Sep 01, 2010
```
llvm-svn: 112689
```
  6aaebe87
- Move some functions around so they can be used for some other to come function · 2b025707
  Bruno Cardoso Lopes authored Sep 01, 2010
```
llvm-svn: 112687
```
  2b025707
- Use x86 specific MOVSLDUP node, add more patterns to match it and remove useless load nodes · 4b56d872
  Bruno Cardoso Lopes authored Aug 31, 2010
```
llvm-svn: 112661
```
  4b56d872
- Use x86 specific MOVSHDUP node and add more patterns to match it · 61996ef8
  Bruno Cardoso Lopes authored Aug 31, 2010
```
llvm-svn: 112657
```
  61996ef8
Aug 31, 2010
- Use MOVHLPS node instead of matching using movhlps and movhlps_undef pattern fragments · 5de15ce4
  Bruno Cardoso Lopes authored Aug 31, 2010
```
llvm-svn: 112644
```
  5de15ce4
- Use MOVLHPS and MOVHLPS x86 nodes whenever possible. Also remove some useless nodes · 03e4c353
  Bruno Cardoso Lopes authored Aug 31, 2010
```
llvm-svn: 112642
```
  03e4c353
- Use X86ISD::MOVSS and MOVSD to represent the movl mask pattern, also fix the... · dfd9dd5d
  Bruno Cardoso Lopes authored Aug 31, 2010
```
Use X86ISD::MOVSS and MOVSD to represent the movl mask pattern, also fix the handling of those nodes when seeking for scalars inside vector shuffles

llvm-svn: 112570
```
  dfd9dd5d
Aug 28, 2010

fix the buildvector->insertp[sd] logic to not always create a redundant · 94656b1c

Chris Lattner authored Aug 28, 2010

insertp[sd] $0, which is a noop.  Before:

_f32:                                   ## @f32
	pshufd	$1, %xmm1, %xmm2
	pshufd	$1, %xmm0, %xmm3
	addss	%xmm2, %xmm3
	addss	%xmm1, %xmm0
                                        ## kill: XMM0<def> XMM0<kill> XMM0<def>
	insertps	$0, %xmm0, %xmm0
	insertps	$16, %xmm3, %xmm0
	ret

after:

_f32:                                   ## @f32
	movdqa	%xmm0, %xmm2
	addss	%xmm1, %xmm2
	pshufd	$1, %xmm1, %xmm1
	pshufd	$1, %xmm0, %xmm3
	addss	%xmm1, %xmm3
	movdqa	%xmm2, %xmm0
	insertps	$16, %xmm3, %xmm0
	ret

The extra movs are due to a random (poor) scheduling decision.

llvm-svn: 112379

94656b1c

fix the BuildVector -> unpcklps logic to not do pointless shuffles · bcb6090a

Chris Lattner authored Aug 28, 2010

when the top elements of a vector are undefined.  This happens all
the time for X86-64 ABI stuff because only the low 2 elements of
a 4 element vector are defined.  For example, on:

_Complex float f32(_Complex float A, _Complex float B) {
  return A+B;
}

We used to produce (with SSE2, SSE4.1+ uses insertps):

_f32:                                   ## @f32
	movdqa	%xmm0, %xmm2
	addss	%xmm1, %xmm2
	pshufd	$16, %xmm2, %xmm2
	pshufd	$1, %xmm1, %xmm1
	pshufd	$1, %xmm0, %xmm0
	addss	%xmm1, %xmm0
	pshufd	$16, %xmm0, %xmm1
	movdqa	%xmm2, %xmm0
	unpcklps	%xmm1, %xmm0
	ret

We now produce:

_f32:                                   ## @f32
	movdqa	%xmm0, %xmm2
	addss	%xmm1, %xmm2
	pshufd	$1, %xmm1, %xmm1
	pshufd	$1, %xmm0, %xmm3
	addss	%xmm1, %xmm3
	movaps	%xmm2, %xmm0
	unpcklps	%xmm3, %xmm0
	ret

This implements rdar://8368414

llvm-svn: 112378

bcb6090a

improve comments in the unpcklps generating logic, introduce · 96db6e66

Chris Lattner authored Aug 28, 2010

a new EltStride variable instead of reusing NumElems variable
for a non-obvious purpose.  No functionality change.

llvm-svn: 112377

96db6e66

Clean up the logic of vector shuffles -> vector shifts. · a982aa24

Bruno Cardoso Lopes authored Aug 28, 2010

Also teach this logic how to handle target specific shuffles if
needed, this is necessary while searching recursively for zeroed
scalar elements in vector shuffle operands.

llvm-svn: 112348

a982aa24

Aug 27, 2010
- Properly handle passing of FP stuff to varargs function on Win64: · c0b36921
  Anton Korobeynikov authored Aug 27, 2010
```
value should be copied to the corresponding shadow reg as well.
Patch by Cameron Esfahani!

llvm-svn: 112262
```
  c0b36921
Aug 26, 2010
- zap the now unused MVT::getIntVectorWithNumElements · e25ba0c7
  Bruno Cardoso Lopes authored Aug 26, 2010
```
llvm-svn: 112218
```
  e25ba0c7
- implement SplitVecOp_CONCAT_VECTORS, fixing the included testcase with SSE1. · eb2cc0ce
  Chris Lattner authored Aug 26, 2010
```
llvm-svn: 112171
```
  eb2cc0ce
- fix sse1 only codegen in x86-64 mode, which is something we · cc60609c
  Chris Lattner authored Aug 26, 2010
```
apparently try to support.

llvm-svn: 112168
```
  cc60609c
Aug 25, 2010
- Revert this for now, PUNPCKLDQ dont operate on v4f32 · d4085f6e
  Bruno Cardoso Lopes authored Aug 25, 2010
```
llvm-svn: 112090
```
  d4085f6e
- Fix nasty mingw32 bug, which e.g. prevented llvm-gcc bootstrap there. · b3b53eca
  Anton Korobeynikov authored Aug 25, 2010
```
Mark _alloca call as clobberring EFLAGS, otherwise some DCE might remove
other flags-clobberring stuff (e.g. cmp instructions) occuring after
_alloca call.

llvm-svn: 112034
```
  b3b53eca
- PUNPCKLDQ should also be used for v4f32 · 0770d257
  Bruno Cardoso Lopes authored Aug 25, 2010
```
llvm-svn: 112020
```
  0770d257
- teach lowering to get target specific nodes for pshufd, emulating the same... · 2e45d522
  Bruno Cardoso Lopes authored Aug 25, 2010
```
teach lowering to get target specific nodes for pshufd, emulating the same isel behavior for now, so we can pass all vector shuffle tests

llvm-svn: 112017
```
  2e45d522
Aug 24, 2010
- Fix X86's isLegalAddressingMode to recognize that static addresses · c88fda47
  Dan Gohman authored Aug 24, 2010
```
need not be RIP-relative in small mode.

llvm-svn: 111917
```
  c88fda47
- Use pshufhw and pshuflw in more cases and fix getTargetShuffleNode number of arguments · 758d7b1f
  Bruno Cardoso Lopes authored Aug 24, 2010
```
llvm-svn: 111890
```
  758d7b1f
Aug 23, 2010
- Start using target speficic nodes for shuffles: pshufhw and pshuflw · 264d90ff
  Bruno Cardoso Lopes authored Aug 23, 2010
```
llvm-svn: 111837
```
  264d90ff
- Revert invalid r111792. Jump tables are not broken on x86-64 / coff, · cbbe4501
  Anton Korobeynikov authored Aug 23, 2010
```
it's COFF emitter which does not support differences of two symbols
(and needs to be fixed). GAS is pretty fine with code produced.

llvm-svn: 111801
```
  cbbe4501