Commits · eadaeaab93302456465f8bbcf9476a4801096102 · Roger Ferrer / llvm-epi-0.8

Oct 03, 2010

Implement support for the bizarre 3DNow! encoding (which is unlike anything · 45270db9

Chris Lattner authored Oct 03, 2010

else in X86), and add support for pavgusb.  This is apparently the
only instruction (other than movsx) that is preventing ffmpeg from building
with clang.

If someone else is interested in banging out the rest of the 3DNow! 
instructions, it should be quite easy now.

llvm-svn: 115466

45270db9

Sep 17, 2010
- fix rdar://8444631 - encoder crash on 'enter' · cea0a8d7
  Chris Lattner authored Sep 17, 2010
```
What a weird instruction.

llvm-svn: 114190
```
  cea0a8d7
Sep 05, 2010

implement rdar://6653118 - fastisel should fold loads where possible. · eeba0c73

Chris Lattner authored Sep 05, 2010

Since mem2reg isn't run at -O0, we get a ton of reloads from the stack,
for example, before, this code:

int foo(int x, int y, int z) {
  return x+y+z;
}

used to compile into:

_foo:                                   ## @foo
	subq	$12, %rsp
	movl	%edi, 8(%rsp)
	movl	%esi, 4(%rsp)
	movl	%edx, (%rsp)
	movl	8(%rsp), %edx
	movl	4(%rsp), %esi
	addl	%edx, %esi
	movl	(%rsp), %edx
	addl	%esi, %edx
	movl	%edx, %eax
	addq	$12, %rsp
	ret

Now we produce:

_foo:                                   ## @foo
	subq	$12, %rsp
	movl	%edi, 8(%rsp)
	movl	%esi, 4(%rsp)
	movl	%edx, (%rsp)
	movl	8(%rsp), %edx
	addl	4(%rsp), %edx    ## Folded load
	addl	(%rsp), %edx     ## Folded load
	movl	%edx, %eax
	addq	$12, %rsp
	ret

Fewer instructions and less register use = faster compiles.

llvm-svn: 113102

eeba0c73

Aug 26, 2010
- Fix PR7748 without using microsoft extensions · 184eaea8
  Bruno Cardoso Lopes authored Aug 26, 2010
```
llvm-svn: 112128
```
  184eaea8
Aug 19, 2010
- fix PR7465, mishandling of lcall and ljmp: intersegment long · f547740d
  Chris Lattner authored Aug 19, 2010
```
call and jumps.

llvm-svn: 111496
```
  f547740d
Jul 22, 2010

remove the JIT "NeedsExactSize" feature and supporting logic. · 8f3adc90
Chris Lattner authored Jul 22, 2010
```
llvm-svn: 109167
```
8f3adc90

instead of migrating it to the MC instruction encoder, just · 083be4d3

Chris Lattner authored Jul 22, 2010

rip out the implementation of X86InstrInfo::GetInstSizeInBytes.
The code being ripped out just implemented a copy and hacked up
version of the (old) instruction encoder, and is buggy and 
terrible in other ways.  Since "GetInstSizeInBytes" is really 
only there to support the JIT's "NeedsExactSize" hook (which
noone is using), just rip out the code.  I will rip out the
NeedsExactSize hook next.

This resolves rdar://7617809 - switch X86InstrInfo::GetInstSizeInBytes to use X86MCCodeEmitter

llvm-svn: 109149

083be4d3

Jul 17, 2010
- Remove the isMoveInstr() hook. · 8289f785
  Jakob Stoklund Olesen authored Jul 16, 2010
```
llvm-svn: 108567
```
  8289f785
Jul 13, 2010
- AVX 256-bit conversion instructions · fd8bfcd6
  Bruno Cardoso Lopes authored Jul 13, 2010
```
Add the x86 VEX_L form to handle special cases where VEX_L must be set.

llvm-svn: 108274
```
  fd8bfcd6
Jul 11, 2010
- X86InstrInfo::copyRegToReg is dead. Long live copyPhysReg! · e46f3eb0
  Jakob Stoklund Olesen authored Jul 11, 2010
```
llvm-svn: 108076
```
  e46f3eb0
Jul 09, 2010
- Merge VEX enums with other x86 enum forms. Also fix all checks of which VEX · 992d25da
  Bruno Cardoso Lopes authored Jul 09, 2010
```
fields to use. 

llvm-svn: 107952
```
  992d25da
- add some long-overdue enums to refer to the parts of the 5-operand · ec536276
  Chris Lattner authored Jul 08, 2010
```
X86 memory operand.

llvm-svn: 107925
```
  ec536276
- introduce a new X86II::getMemoryOperandNo method, which · 1dd82c7d
  Chris Lattner authored Jul 08, 2010
```
returns the start of the memory operand for an instruction.

Introduce a new "X86AddrSegment" enum to reduce # magic numbers
referring to X86 memory operand layout.

llvm-svn: 107916
```
  1dd82c7d
Jul 08, 2010
- Implement X86InstrInfo::copyPhysReg · 930f8082
  Jakob Stoklund Olesen authored Jul 08, 2010
```
llvm-svn: 107898
```
  930f8082
- Implement the major chunk of PR7195: support for 'callw' · ac588129
  Chris Lattner authored Jul 07, 2010
```
in the integrated assembler.  Still some discussion to be
done.

llvm-svn: 107825
```
  ac588129
Jul 07, 2010
- Add AVX vblendvpd, vblendvps and vpblendvb instructions · e2bd058d
  Bruno Cardoso Lopes authored Jul 06, 2010
```
Update VEX encoding to support those new instructions

llvm-svn: 107715
```
  e2bd058d
Jul 01, 2010

· 05166740

Bruno Cardoso Lopes authored Jul 01, 2010

- Add AVX SSE2 Move doubleword and quadword instructions.
- Add encode bits for VEX_W
- All 128-bit SSE 1 & SSE2 instructions that are described
  in the .td file now have a AVX encoded form already working.

llvm-svn: 107365

05166740

Jun 23, 2010
- Add AVX MOV{SS,SD}{rr,rm} instructions · 1a890f9d
  Bruno Cardoso Lopes authored Jun 22, 2010
```
llvm-svn: 106588
```
  1a890f9d
Jun 18, 2010

Add a DebugLoc parameter to TargetInstrInfo::InsertBranch(). This · 0125b641

Stuart Hastings authored Jun 17, 2010

addresses a longstanding deficiency noted in many FIXMEs scattered
across all the targets.

This effectively moves the problem up one level, replacing eleven
FIXMEs in the targets with eight FIXMEs in CodeGen, plus one path
through FastISel where we actually supply a DebugLoc, fixing Radar
7421831.

llvm-svn: 106243

0125b641

Jun 09, 2010
- Reapply r105521, this time appending "LLU" to 64 bit · c2f87b7b
  Bruno Cardoso Lopes authored Jun 08, 2010
```
immediates to avoid breaking the build.

llvm-svn: 105652
```
  c2f87b7b
Jun 05, 2010

revert r105521, which is breaking the buildbots with stuff like this: · fdd26143

Chris Lattner authored Jun 05, 2010

In file included from X86InstrInfo.cpp:16:
X86GenInstrInfo.inc:2789: error: integer constant is too large for 'long' type
X86GenInstrInfo.inc:2790: error: integer constant is too large for 'long' type
X86GenInstrInfo.inc:2792: error: integer constant is too large for 'long' type
X86GenInstrInfo.inc:2793: error: integer constant is too large for 'long' type
X86GenInstrInfo.inc:2808: error: integer constant is too large for 'long' type
X86GenInstrInfo.inc:2809: error: integer constant is too large for 'long' type
X86GenInstrInfo.inc:2816: error: integer constant is too large for 'long' type
X86GenInstrInfo.inc:2817: error: integer constant is too large for 'long' type

llvm-svn: 105524

fdd26143

Initial AVX support for some instructions. No patterns matched · 594fa263
Bruno Cardoso Lopes authored Jun 05, 2010
```
yet, only assembly encoding support.

llvm-svn: 105521
```
594fa263

Jun 03, 2010

Add first pass at darwin tls compiler support. · b0e1a458
Eric Christopher authored Jun 03, 2010
```
llvm-svn: 105381
```
b0e1a458

Slightly change the meaning of the reMaterialize target hook when the original · a8ad9774

Jakob Stoklund Olesen authored Jun 02, 2010

instruction defines subregisters.

Any existing subreg indices on the original instruction are preserved or
composed with the new subreg index.

Also substitute multiple operands mentioning the original register by using the
new MachineInstr::substituteRegister() function. This is necessary because there
will soon be <imp-def> operands added to non read-modify-write partial
definitions. This instruction:

  %reg1234:foo = FLAP %reg1234<imp-def>

will reMaterialize(%reg3333, bar) like this:

  %reg3333:bar-foo = FLAP %reg333:bar<imp-def>

Finally, replace the TargetRegisterInfo pointer argument with a reference to
indicate that it cannot be NULL.

llvm-svn: 105358

a8ad9774

May 22, 2010
- Implement @llvm.returnaddress. rdar://8015977. · 168ced94
  Evan Cheng authored May 22, 2010
```
llvm-svn: 104421
```
  168ced94
May 06, 2010
- Add a DebugLoc argument to TargetInstrInfo::copyRegToReg, so that it · 779c69bb
  Dan Gohman authored May 06, 2010
```
doesn't have to guess.

llvm-svn: 103194
```
  779c69bb
- Add argument TargetRegisterInfo to loadRegFromStackSlot and storeRegToStackSlot. · efb126a6
  Evan Cheng authored May 06, 2010
```
llvm-svn: 103193
```
  efb126a6
Apr 29, 2010
- Frame index can be negative. · 250e917e
  Evan Cheng authored Apr 29, 2010
```
llvm-svn: 102577
```
  250e917e
Apr 27, 2010

on darwin empty functions need to codegen into something of non-zero length, · 6a5e706e

Chris Lattner authored Apr 26, 2010

otherwise labels get incorrectly merged.  We handled this by emitting a 
".byte 0", but this isn't correct on thumb/arm targets where the text segment
needs to be a multiple of 2/4 bytes.  Handle this by emitting a noop.  This
is more gross than it should be because arm/ppc are not fully mc'ized yet.

This fixes rdar://7908505

llvm-svn: 102400

6a5e706e

Apr 26, 2010

- Move TargetLowering::EmitTargetCodeForFrameDebugValue to TargetInstrInfo and... · ed69b382

Evan Cheng authored Apr 26, 2010

- Move TargetLowering::EmitTargetCodeForFrameDebugValue to TargetInstrInfo and rename it to emitFrameIndexDebugValue.
- Teach spiller to modify DBG_VALUE instructions to reference spill slots.

llvm-svn: 102323

ed69b382

Mar 31, 2010

Renumber SSE execution domains for better code size. · dbff4e81

Jakob Stoklund Olesen authored Mar 30, 2010

SSEDomainFix will collapse to the domain with the lower number when it has a
choice. The SSEPackedSingle domain often has smaller instructions, so prefer
that.

llvm-svn: 99952

dbff4e81

Mar 30, 2010
- Basic implementation of SSEDomainFix pass. · b551aa4d
  Jakob Stoklund Olesen authored Mar 29, 2010
```
Cross-block inference is primitive and wrong, but the pass is working otherwise.

llvm-svn: 99848
```
  b551aa4d
Mar 25, 2010

Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. · 49e121d5

Jakob Stoklund Olesen authored Mar 25, 2010

On Nehalem and newer CPUs there is a 2 cycle latency penalty on using a register
in a different domain than where it was defined. Some instructions have
equvivalents for different domains, like por/orps/orpd.

The SSEDomainFix pass tries to minimize the number of domain crossings by
changing between equvivalent opcodes where possible.

This is a work in progress, in particular the pass doesn't do anything yet. SSE
instructions are tagged with their execution domain in TableGen using the last
two bits of TSFlags. Note that not all instructions are tagged correctly. Life
just isn't that simple.

The SSE execution domain issue is very similar to the ARM NEON/VFP pipeline
issue handled by NEONMoveFixPass. This pass may become target independent to
handle both.

llvm-svn: 99524

49e121d5

Mar 24, 2010
- Revert "Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings." · a86ccbfe
  Jakob Stoklund Olesen authored Mar 23, 2010
```
This reverts commit 99345. It was breaking buildbots.

llvm-svn: 99352
```
  a86ccbfe
- Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. · 31da45b7
  Jakob Stoklund Olesen authored Mar 23, 2010
```
This is work in progress. So far, SSE execution domain tables are added to
X86InstrInfo, and a skeleton pass is enabled with -sse-domain-fix.

llvm-svn: 99345
```
  31da45b7
Feb 13, 2010
- add encoder support and tests for rdtscp · f83726f6
  Chris Lattner authored Feb 13, 2010
```
llvm-svn: 96076
```
  f83726f6
- remove special cases for vmlaunch, vmresume, vmxoff, and swapgs · 140caa72
  Chris Lattner authored Feb 13, 2010
```
fix swapgs to be spelled right.

llvm-svn: 96058
```
  140caa72
- implement infrastructure to support fixups for rip-rel · 4ad96055
  Chris Lattner authored Feb 12, 2010
```
addressing.  This isn't complete because I need an MCContext
to generate new MCExprs.

llvm-svn: 96036
```
  4ad96055
Feb 12, 2010
- enhance the immediate field encoding to know whether the immediate · 12455ca0
  Chris Lattner authored Feb 12, 2010
```
is pc relative or not, mark call and branches as pcrel.

llvm-svn: 96026
```
  12455ca0
- add a bunch of mod/rm encoding types for fixed mod/rm bytes. · f7477e59
  Chris Lattner authored Feb 12, 2010
```
This will work better for the disassembler for modeling things
like lfence/monitor/vmcall etc.

llvm-svn: 95960
```
  f7477e59