Commits · 168ced94d84929f7cdb5d61c67ed79e7275f0ec6 · Roger Ferrer / llvm-epi-0.8

May 22, 2010
- Implement @llvm.returnaddress. rdar://8015977. · 168ced94
  Evan Cheng authored May 22, 2010
```
llvm-svn: 104421
```
  168ced94
May 06, 2010
- Add a DebugLoc argument to TargetInstrInfo::copyRegToReg, so that it · 779c69bb
  Dan Gohman authored May 06, 2010
```
doesn't have to guess.

llvm-svn: 103194
```
  779c69bb
- Add argument TargetRegisterInfo to loadRegFromStackSlot and storeRegToStackSlot. · efb126a6
  Evan Cheng authored May 06, 2010
```
llvm-svn: 103193
```
  efb126a6
Apr 29, 2010
- Frame index can be negative. · 250e917e
  Evan Cheng authored Apr 29, 2010
```
llvm-svn: 102577
```
  250e917e
Apr 27, 2010

on darwin empty functions need to codegen into something of non-zero length, · 6a5e706e

Chris Lattner authored Apr 26, 2010

otherwise labels get incorrectly merged.  We handled this by emitting a 
".byte 0", but this isn't correct on thumb/arm targets where the text segment
needs to be a multiple of 2/4 bytes.  Handle this by emitting a noop.  This
is more gross than it should be because arm/ppc are not fully mc'ized yet.

This fixes rdar://7908505

llvm-svn: 102400

6a5e706e

Apr 26, 2010

- Move TargetLowering::EmitTargetCodeForFrameDebugValue to TargetInstrInfo and... · ed69b382

Evan Cheng authored Apr 26, 2010

- Move TargetLowering::EmitTargetCodeForFrameDebugValue to TargetInstrInfo and rename it to emitFrameIndexDebugValue.
- Teach spiller to modify DBG_VALUE instructions to reference spill slots.

llvm-svn: 102323

ed69b382

Mar 31, 2010

Renumber SSE execution domains for better code size. · dbff4e81

Jakob Stoklund Olesen authored Mar 30, 2010

SSEDomainFix will collapse to the domain with the lower number when it has a
choice. The SSEPackedSingle domain often has smaller instructions, so prefer
that.

llvm-svn: 99952

dbff4e81

Mar 30, 2010
- Basic implementation of SSEDomainFix pass. · b551aa4d
  Jakob Stoklund Olesen authored Mar 29, 2010
```
Cross-block inference is primitive and wrong, but the pass is working otherwise.

llvm-svn: 99848
```
  b551aa4d
Mar 25, 2010

Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. · 49e121d5

Jakob Stoklund Olesen authored Mar 25, 2010

On Nehalem and newer CPUs there is a 2 cycle latency penalty on using a register
in a different domain than where it was defined. Some instructions have
equvivalents for different domains, like por/orps/orpd.

The SSEDomainFix pass tries to minimize the number of domain crossings by
changing between equvivalent opcodes where possible.

This is a work in progress, in particular the pass doesn't do anything yet. SSE
instructions are tagged with their execution domain in TableGen using the last
two bits of TSFlags. Note that not all instructions are tagged correctly. Life
just isn't that simple.

The SSE execution domain issue is very similar to the ARM NEON/VFP pipeline
issue handled by NEONMoveFixPass. This pass may become target independent to
handle both.

llvm-svn: 99524

49e121d5

Mar 24, 2010
- Revert "Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings." · a86ccbfe
  Jakob Stoklund Olesen authored Mar 23, 2010
```
This reverts commit 99345. It was breaking buildbots.

llvm-svn: 99352
```
  a86ccbfe
- Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. · 31da45b7
  Jakob Stoklund Olesen authored Mar 23, 2010
```
This is work in progress. So far, SSE execution domain tables are added to
X86InstrInfo, and a skeleton pass is enabled with -sse-domain-fix.

llvm-svn: 99345
```
  31da45b7
Feb 13, 2010
- add encoder support and tests for rdtscp · f83726f6
  Chris Lattner authored Feb 13, 2010
```
llvm-svn: 96076
```
  f83726f6
- remove special cases for vmlaunch, vmresume, vmxoff, and swapgs · 140caa72
  Chris Lattner authored Feb 13, 2010
```
fix swapgs to be spelled right.

llvm-svn: 96058
```
  140caa72
- implement infrastructure to support fixups for rip-rel · 4ad96055
  Chris Lattner authored Feb 12, 2010
```
addressing.  This isn't complete because I need an MCContext
to generate new MCExprs.

llvm-svn: 96036
```
  4ad96055
Feb 12, 2010
- enhance the immediate field encoding to know whether the immediate · 12455ca0
  Chris Lattner authored Feb 12, 2010
```
is pc relative or not, mark call and branches as pcrel.

llvm-svn: 96026
```
  12455ca0
- add a bunch of mod/rm encoding types for fixed mod/rm bytes. · f7477e59
  Chris Lattner authored Feb 12, 2010
```
This will work better for the disassembler for modeling things
like lfence/monitor/vmcall etc.

llvm-svn: 95960
```
  f7477e59
- revert r95949, it turns out that adding new prefixes is not a · 44ac89f5
  Chris Lattner authored Feb 12, 2010
```
great solution for the disassembler, we'll go with "plan b".

llvm-svn: 95957
```
  44ac89f5
- add another bit of space for new kinds of instruction prefixes. · 336f9abb
  Chris Lattner authored Feb 12, 2010
```
llvm-svn: 95949
```
  336f9abb
Feb 05, 2010
- port X86InstrInfo::determineREX over to the new encoder. · 58827ff9
  Chris Lattner authored Feb 05, 2010
```
llvm-svn: 95440
```
  58827ff9
- move functions for decoding X86II values into the X86II namespace. · 50324355
  Chris Lattner authored Feb 05, 2010
```
llvm-svn: 95410
```
  50324355
- constant propagate a method away. · 342762fd
  Chris Lattner authored Feb 05, 2010
```
llvm-svn: 95408
```
  342762fd
- change getSizeOfImm and getBaseOpcodeFor to just take · b8d375fd
  Chris Lattner authored Feb 05, 2010
```
TSFlags directly instead of a TargetInstrDesc.

llvm-svn: 95405
```
  b8d375fd
Feb 03, 2010

enhance new encoder to support prefixes + RawFrm · 223084d3

Chris Lattner authored Feb 03, 2010

instructions with no operands.  It can now handle

define void @test2() nounwind { ret void }

llvm-svn: 95261

223084d3

Jan 22, 2010
- Add two target hooks to determine whether two loads are near and should be scheduled together. · 4f026f37
  Evan Cheng authored Jan 22, 2010
```
llvm-svn: 94147
```
  4f026f37
Jan 13, 2010

Add a quick pass to optimize sign / zero extension instructions. For targets... · 30bebff4

Evan Cheng authored Jan 13, 2010

Add a quick pass to optimize sign / zero extension instructions. For targets where the pre-extension values are available in the subreg of the result of the extension, replace the uses of the pre-extension value with the result + extract_subreg.

For now, this pass is fairly conservative. It only perform the replacement when both the pre- and post- extension values are used in the block. It will miss cases where the post-extension values are live, but not used.

llvm-svn: 93278

30bebff4

Jan 12, 2010

Add TargetInstrInfo::isCoalescableInstr. It returns true if the specified · 4216615f

Evan Cheng authored Jan 12, 2010

instruction is copy like where the source and destination registers can
overlap. This is to be used by the coalescable to coalesce the source and
destination registers of instructions like X86::MOVSX64rr32. Apparently
some crazy people believe the coalescer is too simple.

llvm-svn: 93210

4216615f

Dec 11, 2009
- Add support to 3-addressify 16-bit instructions. · 766a73fb
  Evan Cheng authored Dec 11, 2009
```
llvm-svn: 91104
```
  766a73fb
Dec 05, 2009
- Remove the target hook TargetInstrInfo::BlockHasNoFallThrough in favor of · 047a767d
  Dan Gohman authored Dec 05, 2009
```
MachineBasicBlock::canFallThrough(), which is target-independent and more
thorough.

llvm-svn: 90634
```
  047a767d
Dec 04, 2009

· 0508e435

David Greene authored Dec 04, 2009

Have hasLoad/StoreFrom/ToStackSlot return the relevant MachineMemOperand.

llvm-svn: 90608

0508e435

Nov 30, 2009

Remove isProfitableToDuplicateIndirectBranch target hook. It is profitable · 505ddaa4

Bob Wilson authored Nov 30, 2009

for all the processors where I have tried it, and even when it might not help
performance, the cost is quite low.  The opportunities for duplicating
indirect branches are limited by other factors so code size does not change
much due to tail duplicating indirect branches aggressively.

llvm-svn: 90144

505ddaa4

Nov 25, 2009

Based on the testcase for pr3120, running on my MacPro with Xeon processors, · 120f729e

Bob Wilson authored Nov 25, 2009

it is definitely profitable to tail duplicate indirect branches for x86.
This is likely to be true to various degrees for all modern x86 processors.

llvm-svn: 89865

120f729e

Nov 14, 2009

- Change TargetInstrInfo::reMaterialize to pass in TargetRegisterInfo. · 6ad7da96

Evan Cheng authored Nov 14, 2009

- If destination is a physical register and it has a subreg index, use the
  sub-register instead.
This fixes PR5423.

llvm-svn: 88745

6ad7da96

Nov 13, 2009

· 2f4c3742

David Greene authored Nov 13, 2009

Fix a bootstrap failure.

Provide special isLoadFromStackSlotPostFE and isStoreToStackSlotPostFE
interfaces to explicitly request checking for post-frame ptr elimination
operands.  This uses a heuristic so it isn't reliable for correctness.

llvm-svn: 87047

2f4c3742

Nov 12, 2009

· 70fdd57d

David Greene authored Nov 12, 2009

Add hasLoadFromStackSlot and hasStoreToStackSlot to return whether a
machine instruction loads or stores from/to a stack slot.  Unlike
isLoadFromStackSlot and isStoreFromStackSlot, the instruction may be
something other than a pure load/store (e.g. it may be an arithmetic
operation with a memory operand).  This helps AsmPrinter determine when
to print a spill/reload comment.

This is only a hint since we may not be able to figure this out in all
cases.  As such, it should not be relied upon for correctness.

Implement for X86.  Return false by default for other architectures.

llvm-svn: 87026

70fdd57d

Oct 30, 2009

Fix MachineLICM to use the correct virtual register class when · 49fa51d9

Dan Gohman authored Oct 30, 2009

unfolding loads for hoisting.  getOpcodeAfterMemoryUnfold returns the
opcode of the original operation without the load, not the load
itself, MachineLICM needs to know the operand index in order to get
the correct register class. Extend getOpcodeAfterMemoryUnfold to
return this information.

llvm-svn: 85622

49fa51d9

Oct 10, 2009
- Replace X86's CanRematLoadWithDispOperand by calling the target-independent · e919de5a
  Dan Gohman authored Oct 10, 2009
```
MachineInstr::isInvariantLoad instead, which has the benefit of being
more complete.

llvm-svn: 83696
```
  e919de5a
Oct 09, 2009
- Add basic infrastructure and x86 support for preserving MachineMemOperand · dd76bb23
  Dan Gohman authored Oct 09, 2009
```
information when unfolding memory references.

llvm-svn: 83656
```
  dd76bb23
Oct 07, 2009

Replace TargetInstrInfo::isInvariantLoad and its target-specific · be8137b0

Dan Gohman authored Oct 07, 2009

implementations with a new MachineInstr::isInvariantLoad, which uses
MachineMemOperands and is target-independent. This brings MachineLICM
and other functionality to targets which previously lacked an
isInvariantLoad implementation.

llvm-svn: 83475

be8137b0

Oct 05, 2009
- Remove explicit enum integer values. They don't appear to be needed, and · 2728569a
  Dan Gohman authored Oct 05, 2009
```
they make it less convenient to add new entries.

llvm-svn: 83308
```
  2728569a
Sep 11, 2009

It's not legal to fold a load from a narrower stack slot into a wider... · 3cad6283

Evan Cheng authored Sep 11, 2009

It's not legal to fold a load from a narrower stack slot into a wider instruction. If done, the instruction does a 64-bit load and that's not
safe. This can happen we a subreg_to_reg 0 has been coalesced. One
exception is when the instruction that folds the load is a move, then we
can simply turn it into a 32-bit load from the stack slot.

rdar://7170444

llvm-svn: 81494

3cad6283