Commits · eb8f9e9e5b49aca1748cff20783974522f7e01b4 · Roger Ferrer / llvm-epi-0.8

Jan 10, 2012

Instruction selection priority fixes to remove the XMM/XMMInt/orAVX... · eb8f9e9e

Craig Topper authored Jan 10, 2012

Instruction selection priority fixes to remove the XMM/XMMInt/orAVX predicates. Another commit will remove orAVX functions from X86SubTarget.

llvm-svn: 147841

eb8f9e9e

Jan 09, 2012

Fix asm string wrt variants. · 29ba4f97
Devang Patel authored Jan 09, 2012
```
llvm-svn: 147805
```
29ba4f97

Split AsmParser into two components - AsmParser and AsmParserVariant · 85d684a4

Devang Patel authored Jan 09, 2012

AsmParser holds info specific to target parser.
AsmParserVariant holds info specific to asm variants supported by the target.

llvm-svn: 147787

85d684a4

Don't rely on the fact that shift values are never very large, and thus · c16622da

Chandler Carruth authored Jan 09, 2012

this substraction will result in small negative numbers at worst which
become very large positive numbers on assignment and are thus caught by
the <=4 check on the next line. The >0 check clearly intended to catch
these as negative numbers.

Spotted by inspection, and impossible to trigger given the shift widths
that can be used.

llvm-svn: 147773

c16622da

Remove AVX hack in X86Subtarget. AVX/AVX2 are now treated as an SSE level.... · f287a450

Craig Topper authored Jan 09, 2012

Remove AVX hack in X86Subtarget. AVX/AVX2 are now treated as an SSE level. Predicate functions have been altered to maintain previous names and behavior.

llvm-svn: 147770

f287a450

Add HasAVX predicate to some of the AVX patterns. · b89805c7
Craig Topper authored Jan 09, 2012
```
llvm-svn: 147769
```
b89805c7

Reorder a bunch of patterns to put the AVX version first thus giving it... · a51f7f75

Craig Topper authored Jan 09, 2012

Reorder a bunch of patterns to put the AVX version first thus giving it priority over the SSE version. Another step towards trying to remove the AVX hack that disables SSE from X86Subtarget.

llvm-svn: 147768

a51f7f75

Clean up patterns for MOVNT*. Not sure why there were floating point types on... · ef7f5bf8

Craig Topper authored Jan 09, 2012

Clean up patterns for MOVNT*. Not sure why there were floating point types on MOVNTPS and MOVNTDQ. And v4i64 was completely missing.

llvm-svn: 147767

ef7f5bf8

Mark MOVNTI as being supported in SSE2 OR AVX mode. This instruction has no... · c1f5622a

Craig Topper authored Jan 09, 2012

Mark MOVNTI as being supported in SSE2 OR AVX mode. This instruction has no AVX equivalent so we should use the SSE version.

llvm-svn: 147766

c1f5622a

Move SSE2 logical operations PAND/POR/PXOR/PANDN above SSE1 logical operations... · a081644f

Craig Topper authored Jan 09, 2012

Move SSE2 logical operations PAND/POR/PXOR/PANDN above SSE1 logical operations ANDPS/ORPS/XORPS/ANDNPS. This fixes a pattern ordering issue that meant that the SSE2 instructions could never be directly selected since the SSE1 patterns would always match first. This is largely moot with the ExeDepsFix pass, but I'm trying to audit for all such ordering issues.

llvm-svn: 147765

a081644f

Change some places that were checking for AVX OR SSE1/2 to use... · 210e4f81

Craig Topper authored Jan 09, 2012

Change some places that were checking for AVX OR SSE1/2 to use hasXMM/hasXMMInt instead. Also fix one place that checked SSE3, but accidentally excluded AVX to use hasSSE3orAVX. This is a step towards removing the AVX hack from the X86Subtarget.h

llvm-svn: 147764

210e4f81

Don't disable MMX support when AVX is enabled. Fix predicates for MMX... · 744f6311

Craig Topper authored Jan 09, 2012

Don't disable MMX support when AVX is enabled. Fix predicates for MMX instructions that were added along with SSE instructions to check for AVX in addition to SSE level.

llvm-svn: 147762

744f6311

Enable FISTTP* instructions when AVX is enabled. · c1ab7afe
Craig Topper authored Jan 08, 2012
```
llvm-svn: 147758
```
c1ab7afe

Jan 08, 2012
- Reverted commit #147601 upon Evan's request. · 540651cf
  Victor Umansky authored Jan 08, 2012
```
llvm-svn: 147748
```
  540651cf
Jan 07, 2012
- Fix typo in the X86 backend readme. Patch from Jaeden Amero. · f210619d
  Craig Topper authored Jan 07, 2012
```
llvm-svn: 147739
```
  f210619d
- Remove VectorExtras. This unused helper was written for a type of API that is discouraged now. · 6898db62
  Benjamin Kramer authored Jan 07, 2012
```
llvm-svn: 147738
```
  6898db62
- Remove unnecessary check of hasAVX(). It's already included in hasXMM(). · ca66bba4
  Craig Topper authored Jan 07, 2012
```
llvm-svn: 147734
```
  ca66bba4
- Make the 'x' constraint work for AVX registers as well. · c206d467
  Eric Christopher authored Jan 07, 2012
```
Fixes rdar://10614894

llvm-svn: 147704
```
  c206d467
Jan 05, 2012

Mark scalar FMA4 instructions as ignoring the VEX.L bit. · 29b07374
Craig Topper authored Jan 05, 2012
```
llvm-svn: 147602
```
29b07374

Peephole optimization of ptest-conditioned branch in X86 arch. Performs... · 9255b6d9

Victor Umansky authored Jan 05, 2012

Peephole optimization of ptest-conditioned branch in X86 arch. Performs instruction combining of sequences generated by ptestz/ptestc intrinsics to ptest+jcc pair for SSE and AVX.

Testing: passed 'make check' including LIT tests for all sequences being handled (both SSE and AVX)

Reviewers: Evan Cheng, David Blaikie, Bruno Lopes, Elena Demikhovsky, Chad Rosier, Anton Korobeynikov
llvm-svn: 147601

9255b6d9

Replace the uint64_t -> double convertion algorithm with one that's more efficient. · ac27f0c8

Bill Wendling authored Jan 05, 2012

This small bit of ASM code is sufficient to do what the old algorithm did:

     movq       %rax,  %xmm0
     punpckldq  (c0),  %xmm0  // c0: (uint4){ 0x43300000U, 0x45300000U, 0U, 0U }
     subpd      (c1),  %xmm0  // c1: (double2){ 0x1.0p52, 0x1.0p52 * 0x1.0p32 }
   #ifdef __SSE3__
     haddpd   %xmm0, %xmm0          
   #else
     pshufd   $0x4e, %xmm0, %xmm1 
     addpd    %xmm1, %xmm0
   #endif

It's arguably faster. One caveat, the 'haddpd' instruction isn't very fast on
all processors.
<rdar://problem/7719814>

llvm-svn: 147593

ac27f0c8

Jan 04, 2012

Silence warnings of a mysterious compiler that still defaults to C89. · 9c48f263
Benjamin Kramer authored Jan 04, 2012
```
llvm-svn: 147553
```
9c48f263

For x86, canonicalize max · 104dbb0f

Evan Cheng authored Jan 04, 2012

(x > y) ? x : y
=>
(x >= y) ? x : y

So for something like
(x - y) > 0 : (x - y) ? 0
It will be
(x - y) >= 0 : (x - y) ? 0

This makes is possible to test sign-bit and eliminate a comparison against
zero. e.g.
subl   %esi, %edi
testl  %edi, %edi
movl   $0, %eax
cmovgl %edi, %eax
=>
xorl   %eax, %eax
subl   %esi, $edi
cmovsl %eax, %edi

rdar://10633221

llvm-svn: 147512

104dbb0f

Fix 80-column violations. · 6ca97df9
Chad Rosier authored Jan 03, 2012
```
llvm-svn: 147495
```
6ca97df9

Jan 03, 2012
- Revert 147426 because it caused pr11696. · 6d31bac8
  Nadav Rotem authored Jan 03, 2012
```
llvm-svn: 147485
```
  6d31bac8
- Enhance DAGCombine for transforming 128->256 casts into a vmovaps, rather · 493c1b31
  Chad Rosier authored Jan 03, 2012
```
then a vxorps + vinsertf128 pair if the original vector came from a load.
rdar://10594409

llvm-svn: 147481
```
  493c1b31
- Intel style asm variant does not need '%' prefix. · c1215324
  Devang Patel authored Jan 03, 2012
```
llvm-svn: 147453
```
  c1215324
Jan 02, 2012

Miscellaneous shuffle lowering cleanup. No functional changes. Primarily... · 5bacb7e9

Craig Topper authored Jan 02, 2012

Miscellaneous shuffle lowering cleanup. No functional changes. Primarily converting the indexing loops to unsigned to be consistent across functions.

llvm-svn: 147430

5bacb7e9

Make CanXFormVExtractWithShuffleIntoLoad reject loads with multiple uses. Also... · 53d55964

Craig Topper authored Jan 02, 2012

Make CanXFormVExtractWithShuffleIntoLoad reject loads with multiple uses. Also make it return false if there's not even a load at all. This makes the code better match the code in DAGCombiner that it tries to match. These two changes prevent some cases where vector_shuffles were making it to instruction selection and causing the older shuffle selection code to be triggered. Also needed to fix a bad pattern that this change exposed. This is the first step towards getting rid of the old shuffle selection support. No test cases yet because there's no way to tell whether a shuffle was handled in the legalize stage or at instruction selection.

llvm-svn: 147428

53d55964

· 6c7a0e6c

Nadav Rotem authored Jan 02, 2012

Optimize the sequence blend(sign_extend(x)) to blend(shl(x)) since SSE blend instructions only look at the highest bit.

llvm-svn: 147426

6c7a0e6c

Jan 01, 2012
- Allow CRC32 instructions to be selected when AVX is enabled. · b9109844
  Craig Topper authored Jan 01, 2012
```
llvm-svn: 147411
```
  b9109844
- Fix sfence, lfence, mfence, and clflush to be able to be selected when AVX is... · 1c064e0a
  Craig Topper authored Jan 01, 2012
```
Fix sfence, lfence, mfence, and clflush to be able to be selected when AVX is enabled. Fix monitor and mwait to require SSE3 or AVX, previously they worked even if SSE3 was disabled. Make prefetch instructions not set the execution domain since they don't use XMM registers.

llvm-svn: 147409
```
  1c064e0a
- X86Disassembler: Fix undefined behavior found by GCC 4.6 · 47aecca5
  Benjamin Kramer authored Jan 01, 2012
```
llvm-svn: 147404
```
  47aecca5
- Merge X86 SHUFPS and SHUFPD node types. · 6e54ba7e
  Craig Topper authored Dec 31, 2011
```
llvm-svn: 147394
```
  6e54ba7e
- Add patterns for integer forms of SHUFPD/VSHUFPD with a memory load. · d51092d9
  Craig Topper authored Dec 31, 2011
```
llvm-svn: 147393
```
  d51092d9
- Fix typo in a SHUFPD and VSHUFPD pattern that prevented SHUFPD/VSHUFPD with a... · 0e796fee
  Craig Topper authored Dec 31, 2011
```
Fix typo in a SHUFPD and VSHUFPD pattern that prevented SHUFPD/VSHUFPD with a load from being selected.

llvm-svn: 147392
```
  0e796fee
Dec 30, 2011

Make FMA4 imply AVX so that YMM registers would be available. Necessitates... · a5d1fc2c

Craig Topper authored Dec 30, 2011

Make FMA4 imply AVX so that YMM registers would be available. Necessitates removing from Bulldozer CPU types since it would enable AVX code generation implicitly. Also make SSE4A imply SSE3. Without some level of SSE implied, XMM registers wouldn't be legal.

llvm-svn: 147369

a5d1fc2c

Add disassembler support for VPERMIL2PD and VPERMIL2PS. · 2ba766ae
Craig Topper authored Dec 30, 2011
```
llvm-svn: 147368
```
2ba766ae
Add FMA4 instructions to disassembler. · 03a0beda
Craig Topper authored Dec 30, 2011
```
llvm-svn: 147367
```
03a0beda

Separate the concept of having memory access in operand 4 from the concept of... · cd93de93

Craig Topper authored Dec 30, 2011

Separate the concept of having memory access in operand 4 from the concept of having the W bit set for XOP instructons. Removes ORing W-bits in the encoder and will similarly simplify the disassembler implementation.

llvm-svn: 147366

cd93de93