Commits · 4d0916788d95e848c0fa2743d09644c78285a905 · Roger Ferrer / llvm-epi-0.8

Jul 12, 2012
- Give the rdrand instructions a SideEffect flag and a chain so MachineCSE and... · 4d091678
  Benjamin Kramer authored Jul 12, 2012
```
Give the rdrand instructions a SideEffect flag and a chain so MachineCSE and MachineLICM don't touch it.

I already had the necessary things in place for IR-level passes but missed the machine passes.

llvm-svn: 160137
```
  4d091678
- Add intrinsics for Ivy Bridge's rdrand instruction. · 0ab2794e
  Benjamin Kramer authored Jul 12, 2012
```
The rdrand/cmov sequence is the same that is emitted by both
GCC and ICC.

Fixes PR13284.

llvm-svn: 160117
```
  0ab2794e
- Update GATHER instructions to support 2 read-write operands. Patch from myself and Manman Ren. · f7755df7
  Craig Topper authored Jul 12, 2012
```
llvm-svn: 160110
```
  f7755df7
Jul 11, 2012
- [x86 fast-isel] Per discussion with Eric, add all cases to switch with verbose · 8446ede0
  Chad Rosier authored Jul 11, 2012
```
comments.

llvm-svn: 160069
```
  8446ede0
- X86: Update to peephole optimization to move Movr0 before (Sub, Cmp) pair. · 1553ce0e
  Manman Ren authored Jul 11, 2012
```
When Movr0 is between sub and cmp, we move Movr0 before sub if it enables
removal of Cmp.

llvm-svn: 160066
```
  1553ce0e
- [x86 fast-isel] Rather then call llvm_unreachable() have fast-isel fall back · 43218c59
  Chad Rosier authored Jul 11, 2012
```
to Selection DAG isel.  Patch by Andrew Kaylor <andrew.kaylor@intel.com>.

llvm-svn: 160055
```
  43218c59
- · d2bdcebb
  Nadav Rotem authored Jul 11, 2012
```
When ext-loading and trunc-storing vectors to memory, on x86 32bit systems, allow loads/stores of 64bit values from xmm registers.

llvm-svn: 160044
```
  d2bdcebb
Jul 10, 2012

Move [get|set]BasePtrStackAdjustment() from MachineFrameInfo to · 97c22142

Chad Rosier authored Jul 10, 2012

X86MachineFunctionInfo as this is currently only used by X86. If this ever
becomes an issue on another arch (e.g., ARM) then we can hoist it back out.

llvm-svn: 160009

97c22142

Add support for dynamic stack realignment in the presence of dynamic allocas on · bdb08ac5

Chad Rosier authored Jul 10, 2012

X86.  Basically, this is a reapplication of r158087 with a few fixes.

Specifically, (1) the stack pointer is restored from the base pointer before
popping callee-saved registers and (2) in obscure cases (see comments in patch)
we must cache the value of the original stack adjustment in the prologue and
apply it in the epilogue.

rdar://11496434

llvm-svn: 160002

bdb08ac5

· d908ddc1

Nadav Rotem authored Jul 10, 2012

Improve the loading of load-anyext vectors by allowing the codegen to load
multiple scalars and insert them into a vector. Next, we shuffle the elements
into the correct places, as before.
Also fix a small dagcombine bug in SimplifyBinOpWithSameOpcodeHands, when the
migration of bitcasts happened too late in the SelectionDAG process.

llvm-svn: 159991

d908ddc1

Reverse assembler/disassembler operand order for gather instructions. · be41e2da
Craig Topper authored Jul 10, 2012
```
llvm-svn: 159983
```
be41e2da

Jul 09, 2012

X86: implement functions to analyze & synthesize CMOV|SET|Jcc · 5f6fa428

Manman Ren authored Jul 09, 2012

getCondFromSETOpc, getCondFromCMovOpc, getSETFromCond, getCMovFromCond

No functional change intended.
If we want to update the condition code of CMOV|SET|Jcc, we first analyze the
opcode to get the condition code, then update the condition code, finally
synthesize the new opcode form the new condition code.

llvm-svn: 159955

5f6fa428

Jul 07, 2012

I'm introducing a new machine model to simultaneously allow simple · 87255e34

Andrew Trick authored Jul 07, 2012

subtarget CPU descriptions and support new features of
MachineScheduler.

MachineModel has three categories of data:
1) Basic properties for coarse grained instruction cost model.
2) Scheduler Read/Write resources for simple per-opcode and operand cost model (TBD).
3) Instruction itineraties for detailed per-cycle reservation tables.

These will all live side-by-side. Any subtarget can use any
combination of them. Instruction itineraries will not change in the
near term. In the long run, I expect them to only be relevant for
in-order VLIW machines that have complex contraints and require a
precise scheduling/bundling model. Once itineraries are only actively
used by VLIW-ish targets, they could be replaced by something more
appropriate for those targets.

This tablegen backend rewrite sets things up for introducing
MachineModel type #2: per opcode/operand cost model.

llvm-svn: 159891

87255e34

X86: Fix optimizeCompare to correctly check safe condition. · bb360740

Manman Ren authored Jul 07, 2012

It is safe if EFLAGS is killed or re-defined.
When we are done with the basic block, check whether EFLAGS is live-out.
Do not optimize away cmp if EFLAGS is live-out.

llvm-svn: 159888

bb360740

Jul 06, 2012

X86: peephole optimization to remove cmp instruction · c9656737

Manman Ren authored Jul 06, 2012

For each Cmp, we check whether there is an earlier Sub which make Cmp
redundant. We handle the case where SUB operates on the same source operands as
Cmp, including the case where the two source operands are swapped.

llvm-svn: 159838

c9656737

Jul 05, 2012

Make X86 call and return instructions non-variadic. · d14101e0

Jakob Stoklund Olesen authored Jul 04, 2012

Function argument and return value registers aren't part of the
encoding, so they should be implicit operands.

llvm-svn: 159728

d14101e0

Jul 04, 2012

Ensure CopyToReg nodes are always glued to the call instruction. · 2dee8124

Jakob Stoklund Olesen authored Jul 04, 2012

The CopyToReg nodes that set up the argument registers before a call
must be glued to the call instruction. Otherwise, the scheduler may emit
the physreg copies long before the call, causing long live ranges for
the fixed registers.

Besides disabling good register allocation, that can also expose
problems when EmitInstrWithCustomInserter() splits a basic block during
the live range of a physreg.

llvm-svn: 159721

2dee8124

Add early if-conversion support to X86. · 49e4d4b3

Jakob Stoklund Olesen authored Jul 04, 2012

Implement the TII hooks needed by EarlyIfConversion to create cmov
instructions and estimate their latency.

Early if-conversion is still not enabled by default.

llvm-svn: 159695

49e4d4b3

Jul 03, 2012
- Remove extra space. · 85c938f4
  Craig Topper authored Jul 03, 2012
```
llvm-svn: 159647
```
  85c938f4
- Change i128mem/i256mem to f128mem/f256mem on some floating point vector instructions. · f067f9aa
  Craig Topper authored Jul 03, 2012
```
llvm-svn: 159646
```
  f067f9aa
- Add aliases for pblendvb, blendvpd, and blendvps instructions with the... · 676dcd8c
  Craig Topper authored Jul 03, 2012
```
Add aliases for pblendvb, blendvpd, and blendvps instructions with the implicit xmm0 operand specified. Fixes PR13252.

llvm-svn: 159644
```
  676dcd8c
Jul 02, 2012

Add all codegen passes to the PassManager via TargetPassConfig. · bbd38dd9

Bob Wilson authored Jul 02, 2012

This is a preliminary step toward having TargetPassConfig be able to
start and stop the compilation at specified passes for unit testing
and debugging.  No functionality change.

llvm-svn: 159567

bbd38dd9

Jul 01, 2012
- Optimization of shuffle node that can fit to the register form of VBROADCAST instruction on AVX2. · 9af899fa
  Elena Demikhovsky authored Jul 01, 2012
```
llvm-svn: 159504
```
  9af899fa
- Reduce code size by using a second switch statement to avoid extra calls to... · 3af251db
  Craig Topper authored Jul 01, 2012
```
Reduce code size by using a second switch statement to avoid extra calls to SelectAtomic64. Also catch cases where SelectAtomic64 fails.

llvm-svn: 159503
```
  3af251db
- Add a break to the end of case statement missed in r159501. · e15e5f7c
  Craig Topper authored Jul 01, 2012
```
llvm-svn: 159502
```
  e15e5f7c
- Fix a crash on release builds if gather intrinsics are passed a non-constant... · fbb954f7
  Craig Topper authored Jul 01, 2012
```
Fix a crash on release builds if gather intrinsics are passed a non-constant value for the last argument.

llvm-svn: 159501
```
  fbb954f7
- Use a second switch statement to reduce number of calls to SelectGather in... · def044b9
  Craig Topper authored Jul 01, 2012
```
Use a second switch statement to reduce number of calls to SelectGather in code. Reduces code size a bit.

llvm-svn: 159500
```
  def044b9
Jun 29, 2012

In the initial exec mode we always do a load to find the address of a variable. · efdfb1e6

Rafael Espindola authored Jun 29, 2012

Before this patch in pic 32 bit code we would add the global base register
and not load from that address. This is a really old bug, but before the
introduction of the tls attributes we would never select initial exec for
pic code.

llvm-svn: 159409

efdfb1e6

X86: add more GATHER intrinsics in LLVM · 98a5bf24

Manman Ren authored Jun 29, 2012

Corrected type for index of llvm.x86.avx2.gather.d.pd.256
  from 256-bit to 128-bit.
Corrected types for src|dst|mask of llvm.x86.avx2.gather.q.ps.256
  from 256-bit to 128-bit.

Support the following intrinsics:
  llvm.x86.avx2.gather.d.q, llvm.x86.avx2.gather.q.q
  llvm.x86.avx2.gather.d.q.256, llvm.x86.avx2.gather.q.q.256
  llvm.x86.avx2.gather.d.d, llvm.x86.avx2.gather.q.d
  llvm.x86.avx2.gather.d.d.256, llvm.x86.avx2.gather.q.d.256

llvm-svn: 159402

98a5bf24

Jun 28, 2012

Move lib/Analysis/DebugInfo.cpp to lib/VMCore/DebugInfo.cpp and · e38859dc

Bill Wendling authored Jun 28, 2012

include/llvm/Analysis/DebugInfo.h to include/llvm/DebugInfo.h.

The reasoning is because the DebugInfo module is simply an interface to the
debug info MDNodes and has nothing to do with analysis.

llvm-svn: 159312

e38859dc

Whitespace. · 51afe639
Chad Rosier authored Jun 27, 2012
```
llvm-svn: 159300
```
51afe639

Jun 26, 2012

X86: add GATHER intrinsics (AVX2) in LLVM · a0982041

Manman Ren authored Jun 26, 2012

Support the following intrinsics:
llvm.x86.avx2.gather.d.pd, llvm.x86.avx2.gather.q.pd
llvm.x86.avx2.gather.d.pd.256, llvm.x86.avx2.gather.q.pd.256
llvm.x86.avx2.gather.d.ps, llvm.x86.avx2.gather.q.ps
llvm.x86.avx2.gather.d.ps.256, llvm.x86.avx2.gather.q.ps.256

Modified Disassembler to handle VSIB addressing mode.

llvm-svn: 159221

a0982041

There are a number of generic inline asm operand modifiers that · 5e69cffe

Jack Carter authored Jun 26, 2012

up to r158925 were handled as processor specific. Making them 
generic and putting tests for these modifiers in the CodeGen/Generic
directory caused a number of targets to fail. 

This commit addresses that problem by having the targets call 
the generic routine for generic modifiers that they don't currently
have explicit code for.

For now only generic print operands 'c' and 'n' are supported.vi


Affected files:

    test/CodeGen/Generic/asm-large-immediate.ll
    lib/Target/PowerPC/PPCAsmPrinter.cpp
    lib/Target/NVPTX/NVPTXAsmPrinter.cpp
    lib/Target/ARM/ARMAsmPrinter.cpp
    lib/Target/XCore/XCoreAsmPrinter.cpp
    lib/Target/X86/X86AsmPrinter.cpp
    lib/Target/Hexagon/HexagonAsmPrinter.cpp
    lib/Target/CellSPU/SPUAsmPrinter.cpp
    lib/Target/Sparc/SparcAsmPrinter.cpp
    lib/Target/MBlaze/MBlazeAsmPrinter.cpp
    lib/Target/Mips/MipsAsmPrinter.cpp
    
MSP430 isn't represented because it did not even run with
the long existing 'c' modifier and it was not apparent what
needs to be done to get it inline asm ready.

Contributer: Jack Carter
llvm-svn: 159203

5e69cffe

Removed unused variable · 863d2d32
Elena Demikhovsky authored Jun 26, 2012
```
llvm-svn: 159197
```
863d2d32
Rename to match other X86_64* names. · 8ed44466
Bill Wendling authored Jun 26, 2012
```
llvm-svn: 159196
```
8ed44466

Shuffle optimization for AVX/AVX2. · 26088d2e

Elena Demikhovsky authored Jun 26, 2012

The current patch optimizes frequently used shuffle patterns and gives these instruction sequence reduction.
Before:
      vshufps $-35, %xmm1, %xmm0, %xmm2 ## xmm2 = xmm0[1,3],xmm1[1,3]
       vpermilps       $-40, %xmm2, %xmm2 ## xmm2 = xmm2[0,2,1,3]
       vextractf128    $1, %ymm1, %xmm1
       vextractf128    $1, %ymm0, %xmm0
       vshufps $-35, %xmm1, %xmm0, %xmm0 ## xmm0 = xmm0[1,3],xmm1[1,3]
       vpermilps       $-40, %xmm0, %xmm0 ## xmm0 = xmm0[0,2,1,3]
       vinsertf128     $1, %xmm0, %ymm2, %ymm0
After:
      vshufps $13, %ymm0, %ymm1, %ymm1 ## ymm1 = ymm1[1,3],ymm0[0,0],ymm1[5,7],ymm0[4,4]
      vshufps $13, %ymm0, %ymm0, %ymm0 ## ymm0 = ymm0[1,3,0,0,5,7,4,4]
      vunpcklps       %ymm1, %ymm0, %ymm0 ## ymm0 = ymm0[0],ymm1[0],ymm0[1],ymm1[1],ymm0[4],ymm1[4],ymm0[5],ymm1[5]

llvm-svn: 159188

26088d2e

Remove some duplicate instructions that exist only to given different... · 94bf0f38

Craig Topper authored Jun 26, 2012

Remove some duplicate instructions that exist only to given different mnemonics for the assembler. Use InstAlias instead.

llvm-svn: 159184

94bf0f38

Make some ugly hacks for inline asm operands which name a specific register a... · bbcd09cc
Eli Friedman authored Jun 25, 2012
```
Make some ugly hacks for inline asm operands which name a specific register a bit more thorough.  PR13196.

llvm-svn: 159176
```
bbcd09cc

Jun 25, 2012

Add SSE2 predicate to CVTPS2PD instructions. Doesn't matter much because there... · 357de815

Craig Topper authored Jun 25, 2012

Add SSE2 predicate to CVTPS2PD instructions. Doesn't matter much because there are no patterns in the instruction.

llvm-svn: 159127

357de815

Remove codegen only instruction in favor of one that has the same definition.... · b6eb513c

Craig Topper authored Jun 25, 2012

Remove codegen only instruction in favor of one that has the same definition. Make some pattern operands more explicit about types.

llvm-svn: 159126

b6eb513c