Commits · bd847cc5622cabc9abb1a59e950981cdf9418f96 · Roger Ferrer / llvm-epi-0.8

Oct 09, 2012

Separate AVXCC and SSECC printing for cmpps/pd/ss/sd and add masking before... · f1c20160

Craig Topper authored Oct 09, 2012

Separate AVXCC and SSECC printing for cmpps/pd/ss/sd and add masking before the switch statement. This keeps the unreachable default case from being hit if the instruction was created with an intrinsic with too large of an immediate.

llvm-svn: 165483

f1c20160

Sep 26, 2012
- Remove hasNoAVX method. Can just invert hasAVX instead. · 0a928fa3
  Craig Topper authored Sep 26, 2012
```
llvm-svn: 164664
```
  0a928fa3
Sep 21, 2012

Revise td of X86 atomic instructions · c33bebff

Michael Liao authored Sep 21, 2012

- Rewirte most atomic instructions in templates for both better
  maintenance and future extensions, such as HLE in TSX.

llvm-svn: 164357

c33bebff

Sep 13, 2012

Revert r163761 "Don't fold indexed loads into TCRETURNmi64." · 78b9f8fc
Jakob Stoklund Olesen authored Sep 13, 2012
```
The patch caused "Wrong topological sorting" assertions.

llvm-svn: 163810
```
78b9f8fc

Don't fold indexed loads into TCRETURNmi64. · bfacef45

Jakob Stoklund Olesen authored Sep 13, 2012

We don't have enough GR64_TC registers when calling a varargs function
with 6 arguments. Since %al holds the number of vector registers used,
only %r11 is available as a scratch register.

This means that addressing modes using both base and index registers
can't be folded into TCRETURNmi64.

<rdar://problem/12282281>

llvm-svn: 163761

bfacef45

Sep 11, 2012
- Update function names to conform to guidelines. No functional change intended. · 38e05a9e
  Chad Rosier authored Sep 10, 2012
```
llvm-svn: 163561
```
  38e05a9e
Aug 30, 2012

Introduce 'UseSSEx' to force SSE legacy encoding · bbd10792

Michael Liao authored Aug 30, 2012

- Add 'UseSSEx' to force SSE legacy insn not being selected when AVX is
  enabled.

  As the penalty of inter-mixing SSE and AVX instructions, we need
  prevent SSE legacy insn from being generated except explicitly
  specified through some intrinsics. For patterns supported by both
  SSE and AVX, so far, we force AVX insn will be tried first relying on
  AddedComplexity or position in td file. It's error-prone and
  introduces bugs accidentally.

  'UseSSEx' is disabled when AVX is turned on. For SSE insns inherited
  by AVX, we need this predicate to force VEX encoding or SSE legacy
  encoding only.

  For insns not inherited by AVX, we still use the previous predicates,
  i.e. 'HasSSEx'. So far, these insns fall into the following
  categories:
  * SSE insns with MMX operands
  * SSE insns with GPR/MEM operands only (xFENCE, PREFETCH, CLFLUSH,
    CRC, and etc.)
  * SSE4A insns.
  * MMX insns.
  * x87 insns added by SSE.

2 test cases are modified:

 - test/CodeGen/X86/fast-isel-x86-64.ll
   AVX code generation is different from SSE one. 'vcvtsi2sdq' cannot be
   selected by fast-isel due to complicated pattern and fast-isel
   fallback to materialize it from constant pool.

 - test/CodeGen/X86/widen_load-1.ll
   AVX code generation is different from SSE one after fixing SSE/AVX
   inter-mixing. Exec-domain fixing prefers 'vmovapd' instead of
   'vmovaps'.

llvm-svn: 162919

bbd10792

Aug 27, 2012

Add HasAVX1Only predicate and use it for patterns that have an AVX1... · f7828f91

Craig Topper authored Aug 27, 2012

Add HasAVX1Only predicate and use it for patterns that have an AVX1 instruction and an AVX2 instruction rather than relying on AddedComplexity.

llvm-svn: 162654

f7828f91

Aug 24, 2012
- X86MemBarrier has unmodeled side effects. · df1faa05
  Jakob Stoklund Olesen authored Aug 24, 2012
```
llvm-svn: 162514
```
  df1faa05
Jul 18, 2012

Make x86 asm parser to check for xmm vs ymm for index register in gather... · 01deb5f2

Craig Topper authored Jul 18, 2012

Make x86 asm parser to check for xmm vs ymm for index register in gather instructions. Also fix Intel syntax for gather instructions to use 'DWORD PTR' or 'QWORD PTR' to match gas.

llvm-svn: 160420

01deb5f2

Jul 12, 2012

Give the rdrand instructions a SideEffect flag and a chain so MachineCSE and... · 4d091678

Benjamin Kramer authored Jul 12, 2012

Give the rdrand instructions a SideEffect flag and a chain so MachineCSE and MachineLICM don't touch it.

I already had the necessary things in place for IR-level passes but missed the machine passes.

llvm-svn: 160137

4d091678

Add intrinsics for Ivy Bridge's rdrand instruction. · 0ab2794e

Benjamin Kramer authored Jul 12, 2012

The rdrand/cmov sequence is the same that is emitted by both
GCC and ICC.

Fixes PR13284.

llvm-svn: 160117

0ab2794e

Jun 29, 2012

X86: add more GATHER intrinsics in LLVM · 98a5bf24

Manman Ren authored Jun 29, 2012

Corrected type for index of llvm.x86.avx2.gather.d.pd.256
  from 256-bit to 128-bit.
Corrected types for src|dst|mask of llvm.x86.avx2.gather.q.ps.256
  from 256-bit to 128-bit.

Support the following intrinsics:
  llvm.x86.avx2.gather.d.q, llvm.x86.avx2.gather.q.q
  llvm.x86.avx2.gather.d.q.256, llvm.x86.avx2.gather.q.q.256
  llvm.x86.avx2.gather.d.d, llvm.x86.avx2.gather.q.d
  llvm.x86.avx2.gather.d.d.256, llvm.x86.avx2.gather.q.d.256

llvm-svn: 159402

98a5bf24

Jun 26, 2012

X86: add GATHER intrinsics (AVX2) in LLVM · a0982041

Manman Ren authored Jun 26, 2012

Support the following intrinsics:
llvm.x86.avx2.gather.d.pd, llvm.x86.avx2.gather.q.pd
llvm.x86.avx2.gather.d.pd.256, llvm.x86.avx2.gather.q.pd.256
llvm.x86.avx2.gather.d.ps, llvm.x86.avx2.gather.q.ps
llvm.x86.avx2.gather.d.ps.256, llvm.x86.avx2.gather.q.ps.256

Modified Disassembler to handle VSIB addressing mode.

llvm-svn: 159221

a0982041

Jun 03, 2012
- Rename FMA3 feature flag to just FMA to match gcc so it can be added to clang. · 79dbb0c6
  Craig Topper authored Jun 03, 2012
```
llvm-svn: 157903
```
  79dbb0c6
Jun 01, 2012

Implement the local-dynamic TLS model for x86 (PR3985) · 789acfb6

Hans Wennborg authored Jun 01, 2012

This implements codegen support for accesses to thread-local variables
using the local-dynamic model, and adds a clean-up pass so that the base
address for the TLS block can be re-used between local-dynamic access on
an execution path.

llvm-svn: 157818

789acfb6

May 31, 2012

X86: Rename the CLMUL target feature to PCLMUL. · a0396e45

Benjamin Kramer authored May 31, 2012

It was renamed in gcc/gas a while ago and causes all kinds of
confusion because it was named differently in llvm and clang.

llvm-svn: 157745

a0396e45

May 10, 2012
- Added X86 Atom latencies for instructions in X86InstrInfo.td. · 4fe10a5d
  Preston Gurd authored May 10, 2012
```
llvm-svn: 156579
```
  4fe10a5d
May 09, 2012

Use ptr_rc_tailcall instead of GR32_TC. · 7e21d617

Jakob Stoklund Olesen authored May 09, 2012

The getPointerRegClass() hook will return GR32_TC, or whatever is
appropriate for the current function.

Patch by Yiannis Tsiouris!

llvm-svn: 156459

7e21d617

Apr 27, 2012

X86: Don't emit conditional floating point moves on when targeting pre-pentiumpro architectures. · 913da4b2

Benjamin Kramer authored Apr 27, 2012

* Model FPSW (the FPU status word) as a register.
* Add ISel patterns for the FUCOM*, FNSTSW and SAHF instructions.
* During Legalize/Lowering, build a node sequence to transfer the comparison
result from FPSW into EFLAGS. If you're wondering about the right-shift: That's
an implicit sub-register extraction (%ax -> %ah) which is handled later on by
the instruction selector.

Fixes PR6679. Patch by Christoph Erhardt!

llvm-svn: 155704

913da4b2

Apr 03, 2012
- Add support for AVX enhanced comparison predicates. Patch from Kay Tiong Khoo. · 7629d63b
  Craig Topper authored Apr 03, 2012
```
llvm-svn: 153935
```
  7629d63b
Mar 06, 2012
- Fix the operand ordering on aliases for shld and shrd. PR12173, part 2. · de850676
  Eli Friedman authored Mar 06, 2012
```
llvm-svn: 152136
```
  de850676
Mar 05, 2012
- Make aliases for shld and shrd match gas. PR12173. · a5a6d6aa
  Eli Friedman authored Mar 05, 2012
```
llvm-svn: 152014
```
  a5a6d6aa
Feb 27, 2012
- Add q suffix aliases for the fistp and fisttp mnemonics. · a72393a3
  Chad Rosier authored Feb 27, 2012
```
rdar://10921670
PR11935

llvm-svn: 151543
```
  a72393a3
Feb 24, 2012
- Add WIN_FTOL_* psudo-instructions to model the unique calling convention · 248d65e7
  Michael J. Spencer authored Feb 24, 2012
```
used by the Win32 _ftol2 runtime function. Patch by Joe Groff!

llvm-svn: 151382
```
  248d65e7
Feb 18, 2012

Emacs-tag and some comment fix for all ARM, CellSPU, Hexagon, MBlaze, MSP430,... · b22310fd
Jia Liu authored Feb 18, 2012
```
Emacs-tag and some comment fix for all ARM, CellSPU, Hexagon, MBlaze, MSP430, PPC, PTX, Sparc, X86, XCore.

llvm-svn: 150878
```
b22310fd

Add X86 assembler and disassembler support for AMD SVM instructions. Original... · ed7aa463

Craig Topper authored Feb 18, 2012

Add X86 assembler and disassembler support for AMD SVM instructions. Original patch by Kay Tiong Khoo. Few tweaks by me for code density and to reduce replication.

llvm-svn: 150873

ed7aa463

Feb 16, 2012

Use the same CALL instructions for Windows as for everything else. · 97e3115d

Jakob Stoklund Olesen authored Feb 16, 2012

The different calling conventions and call-preserved registers are
represented with regmask operands that are added dynamically.

llvm-svn: 150708

97e3115d

Jan 17, 2012
- Intel syntax: Fix parser match class to check memory operand size. · c9ed5187
  Devang Patel authored Jan 17, 2012
```
llvm-svn: 148338
```
  c9ed5187
Jan 16, 2012
- Get rid of unused codegen-only instruction. · 75e3db4c
  Eli Friedman authored Jan 16, 2012
```
llvm-svn: 148239
```
  75e3db4c
Jan 12, 2012

Add predicate method check match memory operand size, if available. · fc6be102

Devang Patel authored Jan 12, 2012

In att style asm syntax memory operand size is derived from suffix attached with mnemonic.  In intel style asm syntax it is part of memory operand hence predicate method check is required to select appropriate instruction.

llvm-svn: 148006

fc6be102

Jan 10, 2012

Instruction selection priority fixes to remove the XMM/XMMInt/orAVX... · eb8f9e9e

Craig Topper authored Jan 10, 2012

Instruction selection priority fixes to remove the XMM/XMMInt/orAVX predicates. Another commit will remove orAVX functions from X86SubTarget.

llvm-svn: 147841

eb8f9e9e

Jan 09, 2012

Don't disable MMX support when AVX is enabled. Fix predicates for MMX... · 744f6311

Craig Topper authored Jan 09, 2012

Don't disable MMX support when AVX is enabled. Fix predicates for MMX instructions that were added along with SSE instructions to check for AVX in addition to SSE level.

llvm-svn: 147762

744f6311

Jan 01, 2012

Allow CRC32 instructions to be selected when AVX is enabled. · b9109844
Craig Topper authored Jan 01, 2012
```
llvm-svn: 147411
```
b9109844

Fix sfence, lfence, mfence, and clflush to be able to be selected when AVX is... · 1c064e0a

Craig Topper authored Jan 01, 2012

Fix sfence, lfence, mfence, and clflush to be able to be selected when AVX is enabled. Fix monitor and mwait to require SSE3 or AVX, previously they worked even if SSE3 was disabled. Make prefetch instructions not set the execution domain since they don't use XMM registers.

llvm-svn: 147409

1c064e0a

Dec 12, 2011
- XOP instructions and encoding tests. · 7c0face4
  Jan Sjödin authored Dec 12, 2011
```
llvm-svn: 146407
```
  7c0face4
Dec 09, 2011
- Remove hasSSE1orAVX(). It's the same as hasXMM(). · 557cda7f
  Evan Cheng authored Dec 09, 2011
```
llvm-svn: 146246
```
  557cda7f
Dec 08, 2011

Many of the SSE patterns should not be selected when AVX is available. This... · 4d1a2d44

Evan Cheng authored Dec 08, 2011

Many of the SSE patterns should not be selected when AVX is available. This led to the following code in X86Subtarget.cpp

if (HasAVX)
X86SSELevel = NoMMXSSE;

This is so patterns that are predicated on hasSSE3, etc. would not be selected when avx is available. Instead, the AVX variant is selected.
However, this breaks instructions which do not have AVX variants.

The right way to fix this is for the SSE but not-AVX patterns to predicate on something like hasSSE3() && !hasAVX().
Then we can take out the hack in X86Subtarget.cpp. Patterns which do not have AVX variants do not need to change.

However, we need to audit all the patterns before we make the change. This patch is workaround that fixes one specific case,
the prefetch instructions. rdar://10538297

llvm-svn: 146163

4d1a2d44

Nov 29, 2011

Make X86::FsFLD0SS / FsFLD0SD real pseudo-instructions. · bde32d36

Jakob Stoklund Olesen authored Nov 29, 2011

Like V_SET0, these instructions are expanded by ExpandPostRA to xorps /
vxorps so they can participate in execution domain swizzling.

This also makes the AVX variants redundant.

llvm-svn: 145440

bde32d36

Nov 24, 2011
- X86: alias cqo to cqto. · 651db373
  Benjamin Kramer authored Nov 24, 2011
```
llvm-svn: 145121
```
  651db373