Commits · 72f51c3986c81d03f80852cd58eac959d328429f · Roger Ferrer / llvm-epi-0.8

Aug 28, 2012
- Convert V_SETALLONES/AVX_SETALLONES/AVX2_SETALLONES to Post-RA pseudos. · 72f51c39
  Craig Topper authored Aug 28, 2012
```
llvm-svn: 162740
```
  72f51c39
- Merge AVX_SET0PSY/AVX_SET0PDY/AVX2_SET0 into a single post-RA pseudo. · bd509eea
  Craig Topper authored Aug 28, 2012
```
llvm-svn: 162738
```
  bd509eea
- More missing mayLoad flags on AVX multiclasses. · 89d6b29d
  Jakob Stoklund Olesen authored Aug 28, 2012
```
llvm-svn: 162714
```
  89d6b29d
Aug 27, 2012

Don't allow vextractf128 to be folded with unaligned stores. We don't fold... · 5af2fed5

Craig Topper authored Aug 27, 2012

Don't allow vextractf128 to be folded with unaligned stores. We don't fold unaligned loads so shouldn't fold unaligned stores as it can cause an alignment fault to occur.

llvm-svn: 162658

5af2fed5

Fold some patterns into instruction definitons so tablegen can infer flags... · 6d44554c

Craig Topper authored Aug 27, 2012

Fold some patterns into instruction definitons so tablegen can infer flags removing the need for an explicit 'neverHasSideEffects = 1'

llvm-svn: 162656

6d44554c

Add HasAVX1Only predicate and use it for patterns that have an AVX1... · f7828f91

Craig Topper authored Aug 27, 2012

Add HasAVX1Only predicate and use it for patterns that have an AVX1 instruction and an AVX2 instruction rather than relying on AddedComplexity.

llvm-svn: 162654

f7828f91

Aug 25, 2012
- Add missing mayLoad flags to a large class of AVX *_Int instructions. · 3d91b43a
  Jakob Stoklund Olesen authored Aug 24, 2012
```
llvm-svn: 162622
```
  3d91b43a
Aug 24, 2012

Remove some spurious mayLoad = 0 flags. · d3511235

Jakob Stoklund Olesen authored Aug 24, 2012

They were inserted to silence TableGen's warning about
redundant properties. That warning is now gone.

llvm-svn: 162517

d3511235

Aug 19, 2012

When unsafe math is used, we can use commutative FMAX and FMIN. In some cases · 178250ad

Nadav Rotem authored Aug 19, 2012

this allows for better code generation.

Added a new DAGCombine transformation to convert FMAX and FMIN to FMANC and
FMINC, which are commutative.

For example:

  movaps  %xmm0, %xmm1
  movsd LC(%rip), %xmm0
  minsd %xmm1, %xmm0

becomes:

  minsd LC(%rip), %xmm0

llvm-svn: 162187

178250ad

Aug 14, 2012

fix PR11334 · 34107b91

Michael Liao authored Aug 14, 2012

- FP_EXTEND only support extending from vectors with matching elements.
  This results in the scalarization of extending to v2f64 from v2f32,
  which will be legalized to v4f32 not matching with v2f64.
- add X86-specific VFPEXT supproting extending from v4f32 to v2f64.
- add BUILD_VECTOR lowering helper to recover back the original
  extending from v4f32 to v2f64.
- test case is enhanced to include different vector width.

llvm-svn: 161894

34107b91

Aug 06, 2012

Implement proper handling for pcmpistri/pcmpestri intrinsics. Requires custom... · ab47fe4e

Craig Topper authored Aug 06, 2012

Implement proper handling for pcmpistri/pcmpestri intrinsics. Requires custom handling in DAGISelToDAG due to limitations in TableGen's implicit def handling. Fixes PR11305.

llvm-svn: 161318

ab47fe4e

Aug 05, 2012
- Remove custom inserter for MWAIT. It doesn't do anything that couldn't be represented in a pattern. · 6d0408d3
  Craig Topper authored Aug 05, 2012
```
llvm-svn: 161306
```
  6d0408d3
Aug 02, 2012
- X86: mark GATHER instructios as mayLoad · 40591453
  Manman Ren authored Aug 01, 2012
```
llvm-svn: 161143
```
  40591453
Jul 30, 2012
- Give VCVTTPD2DQ priority over CVTTPD2DQ. · 14eac5dd
  Craig Topper authored Jul 30, 2012
```
llvm-svn: 160942
```
  14eac5dd
- Fix patterns for CVTTPS2DQ to specify SSE2 instead of SSE1. · f881d385
  Craig Topper authored Jul 30, 2012
```
llvm-svn: 160941
```
  f881d385
- Fix up patterns for VCVTSS2SD. Specifically give it priority over SSE form.... · 415b3586
  Craig Topper authored Jul 30, 2012
```
Fix up patterns for VCVTSS2SD. Specifically give it priority over SSE form. Add an OptForSpeed to explicitly pair up with an OptForSize that was already on another pattern.

llvm-svn: 160939
```
  415b3586
- Fix load types on intrinsic forms of SS2SD and SD2SS AVX/SSE convert instruction patterns. · 28402efc
  Craig Topper authored Jul 29, 2012
```
llvm-svn: 160938
```
  28402efc
- Move more SSE/AVX convert instruction patterns into their definitions. · b6767f3a
  Craig Topper authored Jul 29, 2012
```
llvm-svn: 160937
```
  b6767f3a
Jul 28, 2012
- Fold patterns for some of the SSE/AVX convert instructions into their instruction definitions. · fc93281c
  Craig Topper authored Jul 28, 2012
```
llvm-svn: 160922
```
  fc93281c
- Mark some of the SSE/AVX convert instructions as mayLoad/neverHasSideEffects. · 024797b9
  Craig Topper authored Jul 28, 2012
```
llvm-svn: 160921
```
  024797b9
- Make CVTSS2SI instruction definition consistent with CVTSD2SI. · 44f9b534
  Craig Topper authored Jul 28, 2012
```
llvm-svn: 160914
```
  44f9b534
- Fix up memory load types for SSE scalar convert intrinsic patterns. · 1c1aef07
  Craig Topper authored Jul 28, 2012
```
llvm-svn: 160913
```
  1c1aef07
Jul 27, 2012

Remove the last mentions of sub_ss and sub_sd from patterns. · 77cd55b4
Jakob Stoklund Olesen authored Jul 26, 2012
```
I'll remove these two sub-register indexes shortly.

llvm-svn: 160831
```
77cd55b4

Eliminate sub_ss, sub_sd from broadcast patterns. · b96d0b4e

Jakob Stoklund Olesen authored Jul 26, 2012

The (COPY_TO_REGCLASS GR32:$src, VR128) pattern looks odd, but
copyPhysReg does the right thing with it. (The old pattern would
eventually produce the same cross-class copy).

llvm-svn: 160830

b96d0b4e

Eliminate more sub_ss / sub_sd patterns. · 206b825f

Jakob Stoklund Olesen authored Jul 26, 2012

This gets rid of some more INSERT_SUBREG - IMPLICIT_DEF patterns,
simplifying the emitted code a bit.

llvm-svn: 160820

206b825f

Eliminate some SUBREG_TO_REG patterns with sub_ss and sub_sd. · 75d17b05

Jakob Stoklund Olesen authored Jul 26, 2012

The SUBREG_TO_REG instruction has magic semantics asserting that the
source value was defined by an instruction that cleared the high half of
the register. Those semantics are never actually exploited for xmm
registers.

llvm-svn: 160818

75d17b05

Jul 26, 2012

Eliminate a batch of uses of sub_ss and sub_sd in the X86 target. · ceee4a9d

Jakob Stoklund Olesen authored Jul 26, 2012

These idempotent sub-register indices don't do anything --- They simply
map XMM registers to themselves. They no longer affect register classes
either since the SubRegClasses field has been removed from Target.td.

This patch replaces XMM->XMM EXTRACT_SUBREG and INSERT_SUBREG patterns
with COPY_TO_REGCLASS patterns which simply become COPY instructions.

The number of IMPLICIT_DEF instructions before register allocation is
reduced, and that is the cause of the test case changes.

llvm-svn: 160816

ceee4a9d

Make l/q suffixes on AVX forms of scalar convert instructions consistent with their non-AVX forms. · c7690ac7
Craig Topper authored Jul 26, 2012
```
llvm-svn: 160775
```
c7690ac7

Jul 18, 2012

The vbroadcast family of instructions has 'fallback patterns' in case where the · 4c12245b

Nadav Rotem authored Jul 18, 2012

load source operand is used by multiple nodes. The v2i64 broadcast was emulated
by shuffling the two lower i32 elements to the upper two.
We had a bug in the immediate used for the broadcast.
Replacing 0 to 0x44.
0x44 means [01|00|01|00] which corresponds to the correct lane.

Patch by Michael Kuperstein.

llvm-svn: 160430

4c12245b

Make x86 asm parser to check for xmm vs ymm for index register in gather... · 01deb5f2

Craig Topper authored Jul 18, 2012

Make x86 asm parser to check for xmm vs ymm for index register in gather instructions. Also fix Intel syntax for gather instructions to use 'DWORD PTR' or 'QWORD PTR' to match gas.

llvm-svn: 160420

01deb5f2

Jul 15, 2012

Rename VBROADCASTSDrm into VBROADCASTSDYrm to match the naming convention. · ee3552f8

Nadav Rotem authored Jul 15, 2012

Allow the folding of vbroadcastRR to vbroadcastRM, where the memory operand is a spill slot.

PR12782.

Together with Michael Kuperstein <michael.m.kuperstein@intel.com>

llvm-svn: 160230

ee3552f8

Jul 13, 2012
- Mark VINSERTI128rm as MayLoad=1. Fixes PR13348. · b3bac490
  Craig Topper authored Jul 13, 2012
```
llvm-svn: 160162
```
  b3bac490
Jul 12, 2012
- Update GATHER instructions to support 2 read-write operands. Patch from myself and Manman Ren. · f7755df7
  Craig Topper authored Jul 12, 2012
```
llvm-svn: 160110
```
  f7755df7
Jul 10, 2012
- Reverse assembler/disassembler operand order for gather instructions. · be41e2da
  Craig Topper authored Jul 10, 2012
```
llvm-svn: 159983
```
  be41e2da
Jul 03, 2012
- Remove extra space. · 85c938f4
  Craig Topper authored Jul 03, 2012
```
llvm-svn: 159647
```
  85c938f4
- Change i128mem/i256mem to f128mem/f256mem on some floating point vector instructions. · f067f9aa
  Craig Topper authored Jul 03, 2012
```
llvm-svn: 159646
```
  f067f9aa
- Add aliases for pblendvb, blendvpd, and blendvps instructions with the... · 676dcd8c
  Craig Topper authored Jul 03, 2012
```
Add aliases for pblendvb, blendvpd, and blendvps instructions with the implicit xmm0 operand specified. Fixes PR13252.

llvm-svn: 159644
```
  676dcd8c
Jul 01, 2012
- Optimization of shuffle node that can fit to the register form of VBROADCAST instruction on AVX2. · 9af899fa
  Elena Demikhovsky authored Jul 01, 2012
```
llvm-svn: 159504
```
  9af899fa
Jun 29, 2012

X86: add more GATHER intrinsics in LLVM · 98a5bf24

Manman Ren authored Jun 29, 2012

Corrected type for index of llvm.x86.avx2.gather.d.pd.256
  from 256-bit to 128-bit.
Corrected types for src|dst|mask of llvm.x86.avx2.gather.q.ps.256
  from 256-bit to 128-bit.

Support the following intrinsics:
  llvm.x86.avx2.gather.d.q, llvm.x86.avx2.gather.q.q
  llvm.x86.avx2.gather.d.q.256, llvm.x86.avx2.gather.q.q.256
  llvm.x86.avx2.gather.d.d, llvm.x86.avx2.gather.q.d
  llvm.x86.avx2.gather.d.d.256, llvm.x86.avx2.gather.q.d.256

llvm-svn: 159402

98a5bf24

Jun 26, 2012

X86: add GATHER intrinsics (AVX2) in LLVM · a0982041

Manman Ren authored Jun 26, 2012

Support the following intrinsics:
llvm.x86.avx2.gather.d.pd, llvm.x86.avx2.gather.q.pd
llvm.x86.avx2.gather.d.pd.256, llvm.x86.avx2.gather.q.pd.256
llvm.x86.avx2.gather.d.ps, llvm.x86.avx2.gather.q.ps
llvm.x86.avx2.gather.d.ps.256, llvm.x86.avx2.gather.q.ps.256

Modified Disassembler to handle VSIB addressing mode.

llvm-svn: 159221

a0982041