Commits · c5573439569bea69bbaf3e41ca1b1e4f79bd5d5c · Roger Ferrer / llvm-epi-0.8

Dec 27, 2012
- Fix operands and encoding form for ARPL instruction. Register form had and ... · c5573439
  Craig Topper authored Dec 26, 2012
```
Fix operands and encoding form for ARPL instruction. Register form had  and  reversed. Memory form writes memory, but was marked as MRMSrcMem.

llvm-svn: 171123
```
  c5573439
- Add hasSideEffects=0 to some atomic instructions. · d47a70de
  Craig Topper authored Dec 26, 2012
```
llvm-svn: 171122
```
  d47a70de
Dec 26, 2012
- Mark the AL/AX/EAX forms of the basic arithmetic operations has never having side effects. · af237208
  Craig Topper authored Dec 26, 2012
```
llvm-svn: 171121
```
  af237208
- Mark all the _REV instructions as not having side effects. They aren't really... · 1b8c0750
  Craig Topper authored Dec 26, 2012
```
Mark all the _REV instructions as not having side effects. They aren't really emitted by the backend, but it reduces the number of instructions in the output files with unmodelled side effects to make auditing easier.

llvm-svn: 171118
```
  1b8c0750
- Remove a special conditional setting of neverHasSideEffects if the instruction... · 18f2675e
  Craig Topper authored Dec 26, 2012
```
Remove a special conditional setting of neverHasSideEffects if the instruction didn't have a pattern. This was leftover from when tablegen used to complain if things were already inferred from patterns.

llvm-svn: 171117
```
  18f2675e
- Merge still more SSE/AVX instruction definitions. · 24f316e4
  Craig Topper authored Dec 26, 2012
```
llvm-svn: 171103
```
  24f316e4
- Merge more SSE/AVX instruction definitions. · af629e27
  Craig Topper authored Dec 26, 2012
```
llvm-svn: 171102
```
  af629e27
- Fix 80 column violation. · 65fe3045
  Craig Topper authored Dec 26, 2012
```
llvm-svn: 171097
```
  65fe3045
- Fix class name in comment. · f4d0fe8f
  Craig Topper authored Dec 26, 2012
```
llvm-svn: 171096
```
  f4d0fe8f
- Merge SSE/AVX PCMPEQ/PCMPGT instruction definitions. · 59747c4d
  Craig Topper authored Dec 26, 2012
```
llvm-svn: 171095
```
  59747c4d
- Remove 'v' from mnemonic to fix asm matching failures. · 8a486775
  Craig Topper authored Dec 26, 2012
```
llvm-svn: 171093
```
  8a486775
- Use an additional multiclass to merge the 128/256-bit SSE/AVX instruction... · b4ef0fa3
  Craig Topper authored Dec 26, 2012
```
Use an additional multiclass to merge the 128/256-bit SSE/AVX instruction definitions for a bunch of SSE2 integer arithmetic instructions.

llvm-svn: 171092
```
  b4ef0fa3
- Reformat the docs. · 5267bb71
  Nadav Rotem authored Dec 26, 2012
```
llvm-svn: 171091
```
  5267bb71
- Use an additional multiclass to merge the 128/256-bit SSE/AVX instruction... · a2594dd5
  Craig Topper authored Dec 26, 2012
```
Use an additional multiclass to merge the 128/256-bit SSE/AVX instruction definitions for PAND/POR/PXOR/PANDN

llvm-svn: 171087
```
  a2594dd5
- Merge an AVX/SSE 256-bit and 128-bit multiclass. · 97730a0d
  Craig Topper authored Dec 26, 2012
```
llvm-svn: 171086
```
  97730a0d
- Mark VANDNPD/VANDNPDS as not commutable. · 8b597463
  Craig Topper authored Dec 26, 2012
```
llvm-svn: 171085
```
  8b597463
- Remove alignment from a bunch more VEX encoded operations in the folding tables. · 81d1e596
  Craig Topper authored Dec 26, 2012
```
llvm-svn: 171082
```
  81d1e596
- Remove alignment from folding table for VMOVUPD as an unaligned instruction it... · b2922164
  Craig Topper authored Dec 26, 2012
```
Remove alignment from folding table for VMOVUPD as an unaligned instruction it shouldn't require alignment...

llvm-svn: 171081
```
  b2922164
- Remove alignment requirements from (V)EXTRACTPS. This instruction does 32-bit... · d09a9af9
  Craig Topper authored Dec 26, 2012
```
Remove alignment requirements from (V)EXTRACTPS. This instruction does 32-bit stores which aren't required to be aligned on SSE or AVX.

llvm-svn: 171080
```
  d09a9af9
- Remove alignment requirement from VCVTSS2SD in folding tables. Reverting... · caef1c5d
  Craig Topper authored Dec 26, 2012
```
Remove alignment requirement from VCVTSS2SD in folding tables. Reverting r171049. This instruction doesn't require alignment.

llvm-svn: 171078
```
  caef1c5d
Dec 25, 2012
- Expand PPC64 atomic load and store · 1b5ff08d
  Hal Finkel authored Dec 25, 2012
```
Use of store or load with the atomic specifier on 64-bit types would
cause instruction-selection failures. As with the 32-bit case, these
can use the default expansion in terms of cmp-and-swap.

llvm-svn: 171072
```
  1b5ff08d
- X86: Shave off one shuffle from the pcmpeqq sequence for SSE2 by making use of and commutativity. · 81b5a8fd
  Benjamin Kramer authored Dec 25, 2012
```
llvm-svn: 171064
```
  81b5a8fd
- X86: Custom lower <2 x i64> eq and ne when SSE41 is not available. · df4af41b
  Benjamin Kramer authored Dec 25, 2012
```
pcmpeqd, pshufd, pshufd, pand is cheaper than unpack + cmpq, sbbq, cmpq, sbbq + pack.
Small speedup on loop-vectorized viterbi (-march=core2).

llvm-svn: 171063
```
  df4af41b
- VCVTSS2SD requires a strict alignment. Thanks Elena. · 00410ae6
  Nadav Rotem authored Dec 25, 2012
```
llvm-svn: 171049
```
  00410ae6
Dec 24, 2012

Quiet gcc's -Wparenthesis warning. No functionality change. · 521e0d59
Nick Lewycky authored Dec 24, 2012
```
llvm-svn: 171044
```
521e0d59

Use a std::string rather than a dynamically allocated char* buffer. · 9d46110f

Benjamin Kramer authored Dec 24, 2012

This affords us to use std::string's allocation routines and use the destructor
for the memory management. Switching to that also means that we can use
operator==(const std::string&, const char *) to perform the string comparison
rather than resorting to libc functionality (i.e. strcmp).

Patch by Saleem Abdulrasool!

Differential Revision: http://llvm-reviews.chandlerc.com/D230

llvm-svn: 171042

9d46110f

CostModel: We have API for checking the costs of known shuffles. This patch adds · 3ee6b10d
Nadav Rotem authored Dec 24, 2012
```
support for the insert-subvector and extract-subvector kinds.

llvm-svn: 171027
```
3ee6b10d

Some x86 instructions can load/store one of the operands to memory. On SSE,... · dc0ad92b

Nadav Rotem authored Dec 24, 2012

Some x86 instructions can load/store one of the operands to memory. On SSE, this memory needs to be aligned.
When these instructions are encoded in VEX (on AVX) there is no such requirement. This changes the folding
tables and removes the alignment restrictions from VEX-encoded instructions.

llvm-svn: 171024

dc0ad92b

Change the codegen Cost Model API for shuffeles. This patch removes the API... · 7e1599e1

Nadav Rotem authored Dec 24, 2012

Change the codegen Cost Model API for shuffeles. This patch removes the API for broadcast and adds a more general API that accepts an enum of known shuffles.

llvm-svn: 171022

7e1599e1

Dec 23, 2012
- CostModel: Change the default target-independent implementation for finding · cf9999d9
  Nadav Rotem authored Dec 23, 2012
```
the cost of arithmetic functions. We now assume that the cost of arithmetic
operations that are marked as Legal or Promote is low, but ops that are
marked as custom are higher.

llvm-svn: 171002
```
  cf9999d9
- whitespace · b15c69a7
  Nadav Rotem authored Dec 23, 2012
```
llvm-svn: 170997
```
  b15c69a7
- Rename a function. · 1bef5a05
  Nadav Rotem authored Dec 23, 2012
```
llvm-svn: 170996
```
  1bef5a05
- Loop Vectorizer: Update the cost model of scatter/gather operations and make · 2cade680
  Nadav Rotem authored Dec 23, 2012
```
them more expensive.

llvm-svn: 170995
```
  2cade680
Dec 22, 2012
- X86: Turn mul of <4 x i32> into pmuludq when no SSE4.1 is available. · 76268ac6
  Benjamin Kramer authored Dec 22, 2012
```
pmuludq is slow, but it turns out that all the unpacking and packing of the
scalarized mul is even slower. 10% speedup on loop-vectorized paq8p.

llvm-svn: 170985
```
  76268ac6
- X86: Emit vector sext as shuffle + sra if vpmovsx is not available. · b2f0a2bd
  Benjamin Kramer authored Dec 22, 2012
```
Also loosen the SSSE3 dependency a bit, expanded pshufb + psra is still better
than scalarized loads. Fixes PR14590.

llvm-svn: 170984
```
  b2f0a2bd
- In some cases, due to scheduling constraints we copy the EFLAGS. · d5aae980
  Nadav Rotem authored Dec 21, 2012
```
The only way to read the eflags is using push and pop. If we don't
adjust the stack then we run over the first frame index. This is
not something that we want to do, so we have to make sure that
our machine function does not copy the flags. If it does then
we have to emit the prolog that adjusts the stack.

rdar://12896831

llvm-svn: 170961
```
  d5aae980
- [mips] Refactor subword-swap, EXT/INS, load-effective-address and read-hardware · 6ac2fc49
  Akira Hatanaka authored Dec 21, 2012
```
instructions.

llvm-svn: 170956
```
  6ac2fc49
- [mips] Refactor SYNC and multiply/divide instructions. · beea8a34
  Akira Hatanaka authored Dec 21, 2012
```
llvm-svn: 170955
```
  beea8a34
- [mips] Refactor BAL instructions. · 31ddec58
  Akira Hatanaka authored Dec 21, 2012
```
llvm-svn: 170954
```
  31ddec58
- [mips] Fix encoding of BAL instruction. Also, fix assembler test case which · d6b694f0
  Akira Hatanaka authored Dec 21, 2012
```
was not catching the error.

llvm-svn: 170953
```
  d6b694f0