Commits · eadaeaab93302456465f8bbcf9476a4801096102 · Roger Ferrer / llvm-epi-0.8

Oct 06, 2010
- remove the !nameconcat tblgen feature. It "shorthand" and only used in 4 places · 94026336
  Chris Lattner authored Oct 06, 2010
```
where !cast is just as short.

llvm-svn: 115722
```
  94026336
- allow !strconcat to take more than two operands to eliminate · 61ea00b4
  Chris Lattner authored Oct 05, 2010
```
!strconcat(!strconcat(!strconcat(!strconcat

Simplify some x86 td files to use it.

llvm-svn: 115719
```
  61ea00b4
Oct 05, 2010
- distribute the rest of the contents of X86Instr64bit.td out to · ab85ef9e
  Chris Lattner authored Oct 05, 2010
```
the right places.  X86Instr64bit.td now dies, long live x86-64!

llvm-svn: 115669
```
  ab85ef9e
- move CMOV_FR32 and friends to InstrCompiler, since they are · da8c94ef
  Chris Lattner authored Oct 05, 2010
```
pseudo instructions.

Move POPCNT to InstrSSE since they are SSE4 instructions.

llvm-svn: 115603
```
  da8c94ef
Sep 29, 2010
- fix rdar://8490728 - llvm-mc rejects gpr64 form of 'movmskpd' · 8f7851d2
  Chris Lattner authored Sep 29, 2010
```
llvm-svn: 115029
```
  8f7851d2
- add assembler support for the cvtsd2sil/cvtsd2siq mnemonics, rdar://8456382 · 52e60208
  Chris Lattner authored Sep 29, 2010
```
llvm-svn: 115027
```
  52e60208
- add basic avx support to the disassembler, also teach it about ssmem/sdmem · f60062fd
  Chris Lattner authored Sep 29, 2010
```
operands.

With this done, we can remove the _Int suffixes from the round instructions
without the disassembler blowing up.  This allows the assembler to support
them, implementing rdar://8456376 - llvm-mc rejects 'roundss'

llvm-svn: 115019
```
  f60062fd
- add asmparser support for cvttpd2dq by removing some Int_ prefixes. · ff3a3930
  Chris Lattner authored Sep 29, 2010
```
Clean up cvttps2dq by removing some redundant implementations of the
same instruction.  rdar://8456382

llvm-svn: 115018
```
  ff3a3930
- implement rdar://8456382 - cvtsd2si support, by removing some Int_ prefixes. · ef1c2fc3
  Chris Lattner authored Sep 29, 2010
```
llvm-svn: 115017
```
  ef1c2fc3
Sep 13, 2010
- Fix typos. 128-bit PSHUFB takes 128-bit memory op. · 1eea3519
  Dale Johannesen authored Sep 13, 2010
```
v8i16 is not an MMX type; put it where it belongs.

llvm-svn: 113785
```
  1eea3519
Sep 09, 2010
- Add one more pattern to fallback movddup · e8501a46
  Bruno Cardoso Lopes authored Sep 09, 2010
```
llvm-svn: 113522
```
  e8501a46
- Move remaining MMX instructions from SSE to MMX. · 0ec303b9
  Dale Johannesen authored Sep 09, 2010
```
llvm-svn: 113501
```
  0ec303b9
- Move most MMX instructions (defined as anything that · 5f4a6f29
  Dale Johannesen authored Sep 09, 2010
```
uses MMX, even if it also uses other things) from InstrSSE
into InstrMMX.  No (intended) functional change.

llvm-svn: 113462
```
  5f4a6f29
Sep 08, 2010

x86 vector shuffle lowering now relies only on target specific · f7fee1c1

Bruno Cardoso Lopes authored Sep 08, 2010

nodes to emit shuffles and don't do isel mask matching anymore.
- Add the selection of the remaining shuffle opcode (movddup)
- Introduce two new functions to "recognize" where we may get
potential folds and add several comments to them explaining why
they are not yet in the desidered shape.
- Add more patterns to fallback the case where we select
a specific shuffle opcode as if it could fold a load, but it
can't, so remap to a valid instruction.
- Add a couple of FIXMEs to address in the following days once
there's a good solution to the current folding problem.

llvm-svn: 113369

f7fee1c1

Sep 07, 2010
- Add patterns for MMX that use the new intrinsics. · 605acfe5
  Dale Johannesen authored Sep 07, 2010
```
Enable palignr intrinsic.
These may need adjustment for a new VT in due course.

llvm-svn: 113233
```
  605acfe5
- Remove unused target specific node · f0ea2222
  Bruno Cardoso Lopes authored Sep 07, 2010
```
llvm-svn: 113224
```
  f0ea2222
Sep 03, 2010
- Remove the rest of the nonexistent 64-bit AVX instructions. · 367afb5a
  Dale Johannesen authored Sep 03, 2010
```
Bruno, please review.

llvm-svn: 113014
```
  367afb5a
- Reapply last harmless part of r112934, the pattern fragment to match X86Unpcklpd · a750d994
  Bruno Cardoso Lopes authored Sep 03, 2010
```
llvm-svn: 113009
```
  a750d994
- Revert r112934, "- Use specific nodes to match unpckl masks.", which introduced · 6f3da24d
  Daniel Dunbar authored Sep 03, 2010
```
some infinite loop and select failures.
 - Apologies for eager reverting, but its branch day.

llvm-svn: 113000
```
  6f3da24d
- AVX doesn't support mm operations neither its instrinsics. · d6634a5b
  Bruno Cardoso Lopes authored Sep 03, 2010
```
The AVX versions of PALIGN and PABS* should only exist for
128-bit. Remove the unnecessary stuff.

llvm-svn: 112944
```
  d6634a5b
- - Use specific nodes to match unpckl masks. · cce44678
  Bruno Cardoso Lopes authored Sep 03, 2010
```
- Teach getShuffleScalarElt how to handle more target
specific nodes, so the DAGCombine can make use of it.
- Add another hack to avoid the node update problem
during legalization. More description on the comments

llvm-svn: 112934
```
  cce44678
Sep 02, 2010

become more strict about when it's safe to use X86ISD::MOVLPS · 6a7f6344
Bruno Cardoso Lopes authored Sep 02, 2010
```
llvm-svn: 112799
```
6a7f6344

Using target specific nodes for shuffle nodes makes the mask · fea81b48

Bruno Cardoso Lopes authored Sep 01, 2010

check more strict, breaking some cases not checked in the
testsuite, but also exposes some foldings not done before,
as this example:

  movaps  (%rdi), %xmm0
  movaps  (%rax), %xmm1
  movaps  %xmm0, %xmm2
  movss %xmm1, %xmm2
  shufps  $36, %xmm2, %xmm0

now is generated as:

  movaps  (%rdi), %xmm0
  movaps  %xmm0, %xmm1
  movlps  (%rax), %xmm1
  shufps  $36, %xmm1, %xmm0

llvm-svn: 112753

fea81b48

Sep 01, 2010
- Use movlps, movlpd, movss and movsd specific nodes instead of pattern matching... · b3825216
  Bruno Cardoso Lopes authored Sep 01, 2010
```
Use movlps, movlpd, movss and movsd specific nodes instead of pattern matching with movlp pattern fragment

llvm-svn: 112694
```
  b3825216
- Use x86 specific MOVSLDUP node, add more patterns to match it and remove useless load nodes · 4b56d872
  Bruno Cardoso Lopes authored Aug 31, 2010
```
llvm-svn: 112661
```
  4b56d872
- Use x86 specific MOVSHDUP node and add more patterns to match it · 61996ef8
  Bruno Cardoso Lopes authored Aug 31, 2010
```
llvm-svn: 112657
```
  61996ef8
Aug 31, 2010
- Use MOVLHPS and MOVHLPS x86 nodes whenever possible. Also remove some useless nodes · 03e4c353
  Bruno Cardoso Lopes authored Aug 31, 2010
```
llvm-svn: 112642
```
  03e4c353
Aug 24, 2010
- Use pshufhw and pshuflw in more cases and fix getTargetShuffleNode number of arguments · 758d7b1f
  Bruno Cardoso Lopes authored Aug 24, 2010
```
llvm-svn: 111890
```
  758d7b1f
Aug 21, 2010

This is the first step towards refactoring the x86 vector shuffle code. The · 6f3b38a8

Bruno Cardoso Lopes authored Aug 20, 2010

general idea here is to have a group of x86 target specific nodes which are
going to be selected during lowering and then directly matched in isel.

The commit includes the addition of those specific nodes and a *bunch* of
patterns, and incrementally we're going to switch between them and what we
have right now. Both the patterns and target specific nodes can change as
we move forward with this work.

llvm-svn: 111691

6f3b38a8

Aug 13, 2010
- Revert 110491. While not wrong, it was based on a · 8d3c89e7
  Dale Johannesen authored Aug 13, 2010
```
misanalysis and is undesirable.

llvm-svn: 111028
```
  8d3c89e7
- Improve comment to make explicit why not to touch this could before JIT goes MC · 1187e3f0
  Bruno Cardoso Lopes authored Aug 13, 2010
```
llvm-svn: 111021
```
  1187e3f0
- Revert last patch and r110954 as I meant to. · 6e5b67cc
  Eric Christopher authored Aug 13, 2010
```
llvm-svn: 111001
```
  6e5b67cc
Aug 12, 2010

Some small clean-up: use of pseudo instructions · cc20fe59
Bruno Cardoso Lopes authored Aug 12, 2010
```
llvm-svn: 110954
```
cc20fe59

- Teach SSEDomainFix to switch between different levels of AVX instructions.... · 7f704b31

Bruno Cardoso Lopes authored Aug 12, 2010

- Teach SSEDomainFix to switch between different levels of AVX instructions. Here we guess that AVX will have domain issues, so just implement them for consistency  and in the future we remove if it's unnecessary.
- Make foldMemoryOperandImpl aware of 256-bit zero vectors folding and support the 128-bit counterparts of AVX too.
- Make sure MOV[AU]PS instructions are only selected when SSE1 is enabled, and duplicate the patterns to match AVX.
- Add a testcase for a simple 128-bit zero vector creation.

llvm-svn: 110946

7f704b31

Define AVX 128-bit pattern versions of SET0PS/PD. · 7e1a30c0
Bruno Cardoso Lopes authored Aug 12, 2010
```
llvm-svn: 110937
```
7e1a30c0

Begin to support some vector operations for AVX 256-bit intructions. The long · 7306c868

Bruno Cardoso Lopes authored Aug 12, 2010

term goal here is to be able to match enough of vector_shuffle and build_vector
so all avx intrinsics which aren't mapped to their own built-ins but to
shufflevector calls can be codegen'd. This is the first (baby) step, support
building zeroed vectors.

llvm-svn: 110897

7306c868

Aug 11, 2010

Add AVX matching patterns to Packed Bit Test intrinsics. · 91d61df3

Bruno Cardoso Lopes authored Aug 10, 2010

Apply the same approach of SSE4.1 ptest intrinsics but
create a new x86 node "testp" since AVX introduces
vtest{ps}{pd} instructions which set ZF and CF depending
on sign bit AND and ANDN of packed floating-point sources.

This is slightly different from what the "ptest" does.
Tests comming with the other 256 intrinsics tests.

llvm-svn: 110744

91d61df3

Aug 10, 2010
- Add AVX movnt{pd,ps,dq} 256-bit intrinsics · 39f215bd
  Bruno Cardoso Lopes authored Aug 10, 2010
```
llvm-svn: 110650
```
  39f215bd
- Add AVX movmsk 256-bit intrinsics · cedf23df
  Bruno Cardoso Lopes authored Aug 10, 2010
```
llvm-svn: 110648
```
  cedf23df
- Support AVX 256-bit load and store intrinsics · 85da72a8
  Bruno Cardoso Lopes authored Aug 10, 2010
```
llvm-svn: 110645
```
  85da72a8