Commits · eadaeaab93302456465f8bbcf9476a4801096102 · Roger Ferrer / llvm-epi-0.8

Oct 01, 2010

Dale Johannesen authored Sep 30, 2010

The x86_mmx type is used for MMX intrinsics, parameters and
return values where these use MMX registers, and is also
supported in load, store, and bitcast.

Only the above operations generate MMX instructions, and optimizations
do not operate on or produce MMX intrinsics. 

MMX-sized vectors <2 x i32> etc. are lowered to XMM or split into
smaller pieces.  Optimizations may occur on these forms and the
result casted back to x86_mmx, provided the result feeds into a
previous existing x86_mmx operation.

The point of all this is prevent optimizations from introducing
MMX operations, which is unsafe due to the EMMS problem.

llvm-svn: 115243

dd224d23

Sep 22, 2010
- reimplement elf TLS support in terms of addressing modes, eliminating SegmentBaseAddress. · 8a236b63
  Chris Lattner authored Sep 22, 2010
```
llvm-svn: 114529
```
  8a236b63
- convert the last 4 X86ISD nodes that should have memoperands to have them. · a5156c30
  Chris Lattner authored Sep 22, 2010
```
llvm-svn: 114523
```
  a5156c30
- give X86ISD::FNSTCW16m a memoperand, since it touches memory. It only · ed85da56
  Chris Lattner authored Sep 22, 2010
```
can access the stack due to how it is generated though.

llvm-svn: 114522
```
  ed85da56
- give FP_TO_INT16_IN_MEM and friends a memoperand. They are only · 78f518b7
  Chris Lattner authored Sep 22, 2010
```
used with stack slots, but hey, lets be safe.

llvm-svn: 114521
```
  78f518b7
- give VZEXT_LOAD a memory operand, it now works with segment registers. · 54e53295
  Chris Lattner authored Sep 22, 2010
```
llvm-svn: 114515
```
  54e53295
- give LCMPXCHG_DAG[8] a memory operand, allowing it to work with addrspace 256/257 · e479e964
  Chris Lattner authored Sep 21, 2010
```
llvm-svn: 114508
```
  e479e964
Sep 21, 2010

Reimplement r114460 in target-independent DAGCombine rather than target-dependent, by using · 5e65dfbb

Owen Anderson authored Sep 21, 2010

the predicate to discover the number of sign bits.  Enhance X86's target lowering to provide
a useful response to this query.

llvm-svn: 114473

5e65dfbb

Sep 13, 2010
- Added skeleton for inline asm multiple alternative constraint support. · 1094c802
  John Thompson authored Sep 13, 2010
```
llvm-svn: 113766
```
  1094c802
Sep 01, 2010
- Use movlps, movlpd, movss and movsd specific nodes instead of pattern matching... · b3825216
  Bruno Cardoso Lopes authored Sep 01, 2010
```
Use movlps, movlpd, movss and movsd specific nodes instead of pattern matching with movlp pattern fragment

llvm-svn: 112694
```
  b3825216
Aug 31, 2010
- Use MOVLHPS and MOVHLPS x86 nodes whenever possible. Also remove some useless nodes · 03e4c353
  Bruno Cardoso Lopes authored Aug 31, 2010
```
llvm-svn: 112642
```
  03e4c353
Aug 21, 2010

Prepare LowerVECTOR_SHUFFLEv8i16 to use x86 target specific nodes directly · 9f20e7a1
Bruno Cardoso Lopes authored Aug 21, 2010
```
llvm-svn: 111704
```
9f20e7a1

This is the first step towards refactoring the x86 vector shuffle code. The · 6f3b38a8

Bruno Cardoso Lopes authored Aug 20, 2010

general idea here is to have a group of x86 target specific nodes which are
going to be selected during lowering and then directly matched in isel.

The commit includes the addition of those specific nodes and a *bunch* of
patterns, and incrementally we're going to switch between them and what we
have right now. Both the patterns and target specific nodes can change as
we move forward with this work.

llvm-svn: 111691

6f3b38a8

Aug 11, 2010

Add AVX matching patterns to Packed Bit Test intrinsics. · 91d61df3

Bruno Cardoso Lopes authored Aug 10, 2010

Apply the same approach of SSE4.1 ptest intrinsics but
create a new x86 node "testp" since AVX introduces
vtest{ps}{pd} instructions which set ZF and CF depending
on sign bit AND and ANDN of packed floating-point sources.

This is slightly different from what the "ptest" does.
Tests comming with the other 256 intrinsics tests.

llvm-svn: 110744

91d61df3

Jul 28, 2010

~40% faster vector shl <4 x i32> on SSE 4.1 Larger improvements for smaller... · 269a6da0

Nate Begeman authored Jul 27, 2010

~40% faster vector shl <4 x i32> on SSE 4.1  Larger improvements for smaller types coming in future patches.

For:

define <2 x i64> @shl(<4 x i32> %r, <4 x i32> %a) nounwind readnone ssp {
entry:
  %shl = shl <4 x i32> %r, %a                     ; <<4 x i32>> [#uses=1]
  %tmp2 = bitcast <4 x i32> %shl to <2 x i64>     ; <<2 x i64>> [#uses=1]
  ret <2 x i64> %tmp2
}

We get:

_shl:                                   ## @shl
	pslld	$23, %xmm1
	paddd	LCPI0_0, %xmm1
	cvttps2dq	%xmm1, %xmm1
	pmulld	%xmm1, %xmm0
	ret

Instead of:

_shl:                                   ## @shl
	pshufd	$3, %xmm0, %xmm2
	movd	%xmm2, %eax
	pshufd	$3, %xmm1, %xmm2
	movd	%xmm2, %ecx
	shll	%cl, %eax
	movd	%eax, %xmm2
	pshufd	$1, %xmm0, %xmm3
	movd	%xmm3, %eax
	pshufd	$1, %xmm1, %xmm3
	movd	%xmm3, %ecx
	shll	%cl, %eax
	movd	%eax, %xmm3
	punpckldq	%xmm2, %xmm3
	movd	%xmm0, %eax
	movd	%xmm1, %ecx
	shll	%cl, %eax
	movd	%eax, %xmm2
	movhlps	%xmm0, %xmm0
	movd	%xmm0, %eax
	movhlps	%xmm1, %xmm1
	movd	%xmm1, %ecx
	shll	%cl, %eax
	movd	%eax, %xmm0
	punpckldq	%xmm0, %xmm2
	movdqa	%xmm2, %xmm0
	punpckldq	%xmm3, %xmm0
	ret

llvm-svn: 109549

269a6da0

Jul 26, 2010
- On x86, f32 / f64 nodes share the same registers as 128-bit vector values. · d4218b87
  Evan Cheng authored Jul 26, 2010
```
llvm-svn: 109450
```
  d4218b87
Jul 24, 2010

Add an ILP scheduler. This is a register pressure aware scheduler that's · 37b740c4

Evan Cheng authored Jul 24, 2010

appropriate for targets without detailed instruction iterineries.
The scheduler schedules for increased instruction level parallelism in
low register pressure situation; it schedules to reduce register pressure
when the register pressure becomes high.

On x86_64, this is a win for all tests in CFP2000. It also sped up 256.bzip2
by 16%.

llvm-svn: 109300

37b740c4

Jul 22, 2010

Custom lower the memory barrier instructions and add support · 9a773826

Eric Christopher authored Jul 22, 2010

for lowering without sse2.  Add a couple of new testcases.

Fixes a few libgomp tests and latent bugs.  Remove a few todos.

llvm-svn: 109078

9a773826

Jul 21, 2010
- Pulling out previous patch, must've run the tests in · d27913e5
  Eric Christopher authored Jul 21, 2010
```
the wrong directory.

llvm-svn: 109005
```
  d27913e5
- Lower MEMBARRIER on x86 and support processors without SSE2. · b2d10670
  Eric Christopher authored Jul 21, 2010
```
Fixes a pile of libgomp failures in the llvm-gcc testsuite due
to the libcall not existing.

llvm-svn: 109004
```
  b2d10670
Jul 15, 2010
- Use TargetOpcode::COPY instead of X86-native register copy instructions when · 9b449d5a
  Jakob Stoklund Olesen authored Jul 14, 2010
```
lowering atomics. This will allow those copies to still be coalesced after
TII::isMoveInstr is removed.

llvm-svn: 108385
```
  9b449d5a
Jul 10, 2010

Reapply bottom-up fast-isel, with several fixes for x86-32: · d7b5ce33

Dan Gohman authored Jul 10, 2010

 - Check getBytesToPopOnReturn().
 - Eschew ST0 and ST1 for return values.
 - Fix the PIC base register initialization so that it doesn't ever
   fail to end up the top of the entry block.

llvm-svn: 108039

d7b5ce33

Jul 09, 2010

--- Reverse-merging r107947 into '.': · 6586e9b2

Bob Wilson authored Jul 09, 2010

U    utils/TableGen/FastISelEmitter.cpp
--- Reverse-merging r107943 into '.':
U    test/CodeGen/X86/fast-isel.ll
U    test/CodeGen/X86/fast-isel-loads.ll
U    include/llvm/Target/TargetLowering.h
U    include/llvm/Support/PassNameParser.h
U    include/llvm/CodeGen/FunctionLoweringInfo.h
U    include/llvm/CodeGen/CallingConvLower.h
U    include/llvm/CodeGen/FastISel.h
U    include/llvm/CodeGen/SelectionDAGISel.h
U    lib/CodeGen/LLVMTargetMachine.cpp
U    lib/CodeGen/CallingConvLower.cpp
U    lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
U    lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp
U    lib/CodeGen/SelectionDAG/FastISel.cpp
U    lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
U    lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp
U    lib/CodeGen/SelectionDAG/InstrEmitter.cpp
U    lib/CodeGen/SelectionDAG/TargetLowering.cpp
U    lib/Target/XCore/XCoreISelLowering.cpp
U    lib/Target/XCore/XCoreISelLowering.h
U    lib/Target/X86/X86ISelLowering.cpp
U    lib/Target/X86/X86FastISel.cpp
U    lib/Target/X86/X86ISelLowering.h

llvm-svn: 107987

6586e9b2

Re-apply bottom-up fast-isel, with fixes. Be very careful to avoid emitting · 0b5aa1cd
Dan Gohman authored Jul 09, 2010
```
a DBG_VALUE after a terminator, or emitting any instructions before an EH_LABEL.

llvm-svn: 107943
```
0b5aa1cd

Jul 08, 2010
- Revert 107840 107839 107813 107804 107800 107797 107791. · e7570436
  Dan Gohman authored Jul 08, 2010
```
Debug info intrinsics win for now.

llvm-svn: 107850
```
  e7570436
Jul 07, 2010
- Add X86FastISel support for return statements. This entails refactoring · 2d4d01d0
  Dan Gohman authored Jul 07, 2010
```
a bunch of stuff, to allow the target-independent calling convention
logic to be employed.

llvm-svn: 107800
```
  2d4d01d0
- Simplify FastISel's constructor by giving it a FunctionLoweringInfo · 87fb4e8f
  Dan Gohman authored Jul 07, 2010
```
instance, rather than pointers to all of FunctionLoweringInfo's
members.

This eliminates an NDEBUG ABI sensitivity.

llvm-svn: 107789
```
  87fb4e8f
- Split the SDValue out of OutputArg so that SelectionDAG-independent · fe7532a3
  Dan Gohman authored Jul 07, 2010
```
code can do calling-convention queries. This obviates OutputArgReg.

llvm-svn: 107786
```
  fe7532a3
- CanLowerReturn doesn't need a SelectionDAG; it just needs an LLVMContext. · ee0cb703
  Dan Gohman authored Jul 06, 2010
```
SelectBasicBlock doesn't needs its BasicBlock argument.

llvm-svn: 107712
```
  ee0cb703
Jul 06, 2010
- Fix up -fstack-protector on linux to use the segment · 2ad0c779
  Eric Christopher authored Jul 06, 2010
```
registers.  Split out testcases per architecture and os
now.

Patch from Nelson Elhage.

llvm-svn: 107640
```
  2ad0c779
Jun 25, 2010

The hasMemory argument is irrelevant to how the argument · ce97d55a

Dale Johannesen authored Jun 25, 2010

for an "i" constraint should get lowered; PR 6309.  While
this argument was passed around a lot, this is the only
place it was used, so it goes away from a lot of other
places.

llvm-svn: 106893

ce97d55a

Jun 03, 2010
- Add first pass at darwin tls compiler support. · b0e1a458
  Eric Christopher authored Jun 03, 2010
```
llvm-svn: 105381
```
  b0e1a458
May 21, 2010
- Fix i64->f64 conversion, x86-64, -no-sse. A bit · b3b9c8ac
  Dale Johannesen authored May 21, 2010
```
tricky since there's a 3rd 64-bit type, MMX vectors.
PR 7135.

llvm-svn: 104308
```
  b3b9c8ac
May 11, 2010

Implement a bunch more TargetSelectionDAGInfo infrastructure. · bb919dfb

Dan Gohman authored May 11, 2010

Move EmitTargetCodeForMemcpy, EmitTargetCodeForMemset, and
EmitTargetCodeForMemmove out of TargetLowering and into
SelectionDAGInfo to exercise this.

llvm-svn: 103481

bb919dfb

Remove the TargetLowering::getSubtarget() virtual function, which · 4df9d9ce
Dan Gohman authored May 11, 2010
```
was unused. TargetMachine::getSubtarget() is used instead.

llvm-svn: 103474
```
4df9d9ce

May 01, 2010
- Get rid of the EdgeMapping map. Instead, just check for BasicBlock · 25c16537
  Dan Gohman authored May 01, 2010
```
changes before doing phi lowering for switches.

llvm-svn: 102809
```
  25c16537
Apr 26, 2010
- Promoting 16-bit cmp / test aren't free. Don't do it. · 6e45f1d1
  Evan Cheng authored Apr 26, 2010
```
llvm-svn: 102366
```
  6e45f1d1
- - Move TargetLowering::EmitTargetCodeForFrameDebugValue to TargetInstrInfo and... · ed69b382
  Evan Cheng authored Apr 26, 2010
```
- Move TargetLowering::EmitTargetCodeForFrameDebugValue to TargetInstrInfo and rename it to emitFrameIndexDebugValue.
- Teach spiller to modify DBG_VALUE instructions to reference spill slots.

llvm-svn: 102323
```
  ed69b382
Apr 25, 2010

Stop abusing EmitInstrWithCustomInserter for target-dependent · 582565e9

Dale Johannesen authored Apr 25, 2010

form of DEBUG_VALUE, as it doesn't have reasonable default
behavior for unsupported targets.  Add a new hook instead.
No functional change.

llvm-svn: 102320

582565e9

Apr 23, 2010
- Fix X86ISD::CMP i16 to i32 promotion. · 03675597
  Evan Cheng authored Apr 23, 2010
```
llvm-svn: 102192
```
  03675597