Commits · ed7aa463666576ae868d4e2436bdc7dce13e540c · Roger Ferrer / llvm-epi-0.8

Feb 18, 2012

Add X86 assembler and disassembler support for AMD SVM instructions. Original... · ed7aa463

Craig Topper authored Feb 18, 2012

Add X86 assembler and disassembler support for AMD SVM instructions. Original patch by Kay Tiong Khoo. Few tweaks by me for code density and to reduce replication.

llvm-svn: 150873

ed7aa463

Feb 17, 2012
- Remove the last of the old vector_shuffle patterns from X86 isel. · ba172d2d
  Craig Topper authored Feb 17, 2012
```
llvm-svn: 150795
```
  ba172d2d
Feb 16, 2012

Remove the YMM_HI_6_15 hack. · bc6ba479

Jakob Stoklund Olesen authored Feb 16, 2012

Call clobbers are now represented with register mask operands.  The
regmask can easily represent the fact that xmm6 is call-preserved while
ymm6 isn't.  This is automatically computed by TableGen from the
CalleeSavedRegs containing xmm6.

llvm-svn: 150709

bc6ba479

Use the same CALL instructions for Windows as for everything else. · 97e3115d

Jakob Stoklund Olesen authored Feb 16, 2012

The different calling conventions and call-preserved registers are
represented with regmask operands that are added dynamically.

llvm-svn: 150708

97e3115d

Enable register mask operands for x86 calls. · 8a450cb2

Jakob Stoklund Olesen authored Feb 16, 2012

Call instructions no longer have a list of 43 call-clobbered registers.
Instead, they get a single register mask operand with a bit vector of
call-preserved registers.

This saves a lot of memory, 42 x 32 bytes = 1344 bytes per call
instruction, and it speeds up building call instructions because those
43 imp-def operands no longer need to be added to use-def lists. (And
removed and shifted and re-added for every explicit call operand).

Passes like LiveVariables, LiveIntervals, RAGreedy, PEI, and
BranchFolding are significantly faster because they can deal with call
clobbers in bulk.

Overall, clang -O2 is between 0% and 8% faster, uniformly distributed
depending on call density in the compiled code.  Debug builds using
clang -O0 are 0% - 3% faster.

I have verified that this patch doesn't change the assembly generated
for the LLVM nightly test suite when building with -disable-copyprop
and -disable-branch-fold.

Branch folding behaves slightly differently in a few cases because call
instructions have different hash values now.

Copy propagation flushes its data structures when it crosses a register
mask operand. This causes it to leave a few dead copies behind, on the
order of 20 instruction across the entire nightly test suite, including
SPEC. Fixing this properly would require the pass to use different data
structures.

llvm-svn: 150638

8a450cb2

Feb 15, 2012
- Use a temporary variable, rather then a series of redundant calls. · f0687634
  Chad Rosier authored Feb 15, 2012
```
llvm-svn: 150538
```
  f0687634
- Stop custom lowering forr x86 DEC64m from happening if the load in the lowered... · c21ebf5c
  Pete Cooper authored Feb 15, 2012
```
Stop custom lowering forr x86 DEC64m from happening if the load in the lowered sequence has more than 1 user

llvm-svn: 150537
```
  c21ebf5c
Feb 14, 2012

Move old movl vector_shuffle patterns. Not needed anymore since... · cfad98f7

Craig Topper authored Feb 14, 2012

Move old movl vector_shuffle patterns. Not needed anymore since vector_shuffles shouldn't reach isel.

llvm-svn: 150462

cfad98f7

Feb 13, 2012

Still more vector_shuffle pattern removal. · 8b19d788
Craig Topper authored Feb 13, 2012
```
llvm-svn: 150365
```
8b19d788

Fix various issues (or do cleanups) found by enabling certain MSVC warnings. · 32e983e4

Ahmed Charles authored Feb 13, 2012

- Use unsigned literals when the desired result is unsigned. This mostly allows unsigned/signed mismatch warnings to be less noisy even if they aren't on by default.
- Remove misplaced llvm_unreachable.
- Add static to a declaration of a function on MSVC x86 only.
- Change some instances of calling a static function through a variable to simply calling that function while removing the unused variable.

llvm-svn: 150364

32e983e4

Remove more vector_shuffle patterns for unpack. These should be target... · 74650add

Craig Topper authored Feb 13, 2012

Remove more vector_shuffle patterns for unpack. These should be target specific nodes when they get to isel.

llvm-svn: 150363

74650add

Recommit r150328. Previous test failures should be fixed by r150360. · 6d471c9e
Craig Topper authored Feb 13, 2012
```
llvm-svn: 150362
```
6d471c9e

Update CanXFormVExtractWithShuffleIntoLoad to ensure bitcasts of loads only... · 87119fa3

Craig Topper authored Feb 13, 2012

Update CanXFormVExtractWithShuffleIntoLoad to ensure bitcasts of loads only have one use. Matches DAGCombiner and prevents vector_shuffles from reaching isel.

llvm-svn: 150360

87119fa3

Revert r150328, "Remove more vector_shuffle patterns." · 0826c17d
NAKAMURA Takumi authored Feb 13, 2012
```
It caused 3 failures on pre-penryn and non-x86(generic) hosts.

llvm-svn: 150357
```
0826c17d

Fixed bug when custom lowering DEC64m on x86. · 71be57bb

Pete Cooper authored Feb 13, 2012

If the DEC node had more than one user, it was doing this lowering but
leaving the original DEC node around and so decrementing twice.

Fixes PR11964.

llvm-svn: 150356

71be57bb

Feb 12, 2012
- Remove more vector_shuffle patterns. · e24c94af
  Craig Topper authored Feb 12, 2012
```
llvm-svn: 150328
```
  e24c94af
- Remove more vector_shuffle patterns. · d40d9eb2
  Craig Topper authored Feb 12, 2012
```
llvm-svn: 150321
```
  d40d9eb2
- Remove more vector_shuffle patterns. · 330ca977
  Craig Topper authored Feb 11, 2012
```
llvm-svn: 150314
```
  330ca977
Feb 11, 2012

Add support for implicit TLS model used with MS VC runtime. · c6b4017c
Anton Korobeynikov authored Feb 11, 2012
```
Patch by Kai Nacke!

llvm-svn: 150307
```
c6b4017c
Don't mix declarations and code. · 915e3d95
Benjamin Kramer authored Feb 11, 2012
```
llvm-svn: 150305
```
915e3d95
Make the EDis tables const. · 428704eb
Benjamin Kramer authored Feb 11, 2012
```
llvm-svn: 150304
```
428704eb

Reuse the enum names from X86Desc in the X86Disassembler. · 478e8de8

Benjamin Kramer authored Feb 11, 2012

This requires some gymnastics to make it available for C code. Remove the names
from the disassembler tables, making them relocation free.

llvm-svn: 150303

478e8de8

Remove some patterns for matching vector_shuffle instructions since... · 981c6cf7

Craig Topper authored Feb 11, 2012

Remove some patterns for matching vector_shuffle instructions since vector_shuffles should be custom lowered before isel.

llvm-svn: 150299

981c6cf7

Fix shuffle lowering code to stop creating temporary DAG nodes to do shuffle... · 11826a6e

Craig Topper authored Feb 11, 2012

Fix shuffle lowering code to stop creating temporary DAG nodes to do shuffle mask checks on. This seemed to be confusing things such that vector_shuffle ops to got through to iselection. This is another step towards removing the vector_shuffle handling patterns from isel.

llvm-svn: 150296

11826a6e

Feb 09, 2012
- More tweaks to get the size of the X86 disassembler tables down. · a0cd970b
  Craig Topper authored Feb 09, 2012
```
llvm-svn: 150167
```
  a0cd970b
- Flatten some of the arrays in the X86 disassembler tables to reduce space... · 487e744f
  Craig Topper authored Feb 09, 2012
```
Flatten some of the arrays in the X86 disassembler tables to reduce space needed to store pointers on 64-bit hosts and reduce relocations needed at startup. Part of PR11953.

llvm-svn: 150161
```
  487e744f
- Handle register masks when searching for EFLAGS clobbers. · 4519fd0b
  Jakob Stoklund Olesen authored Feb 09, 2012
```
Calls clobber the flags, but when using register masks there is no
EFLAGS<imp-def> operand.

llvm-svn: 150117
```
  4519fd0b
Feb 08, 2012

Fixed a bug in printing "cmp" pseudo ops. · 1adc1d53

Elena Demikhovsky authored Feb 08, 2012

> This IR code
> %res = call <8 x float> @llvm.x86.avx.cmp.ps.256(<8 x float> %a0, <8 x float> %a1, i8 14)
> fails with assertion:
>
> llc: X86ATTInstPrinter.cpp:62: void llvm::X86ATTInstPrinter::printSSECC(const llvm::MCInst*, unsigned int, llvm::raw_ostream&): Assertion `0 && "Invalid ssecc argument!"' failed.
> 0  llc             0x0000000001355803
> 1  llc             0x0000000001355dc9
> 2  libpthread.so.0 0x00007f79a30575d0
> 3  libc.so.6       0x00007f79a23a1945 gsignal + 53
> 4  libc.so.6       0x00007f79a23a2f21 abort + 385
> 5  libc.so.6       0x00007f79a239a810 __assert_fail + 240
> 6  llc             0x00000000011858d5 llvm::X86ATTInstPrinter::printSSECC(llvm::MCInst const*, unsigned int, llvm::raw_ostream&) + 119

I added the full testing for all possible pseudo-ops of cmp.
I extended X86AsmPrinter.cpp and X86IntelInstPrinter.cpp.

You'l also see lines alignments (unrelated to this fix) in X86IselLowering.cpp from my previous check-in.

llvm-svn: 150068

1adc1d53

Remove a couple unneeded intrinsic patterns · 172b9243
Craig Topper authored Feb 08, 2012
```
llvm-svn: 150067
```
172b9243

Remove GCC builtins for vpermilp* intrinsics as clang no longer needs them.... · 5405571f

Craig Topper authored Feb 08, 2012

Remove GCC builtins for vpermilp* intrinsics as clang no longer needs them. Custom lower the intrinsics to the vpermilp target specific node and remove intrinsic patterns.

llvm-svn: 150060

5405571f

Feb 07, 2012
- Use LEA to adjust stack ptr for Atom. Patch by Andy Zhang. · 1b81fddd
  Evan Cheng authored Feb 07, 2012
```
llvm-svn: 150008
```
  1b81fddd
- Add instruction selection for 256-bit VPSHUFD and 128-bit VPERMILPS/VPERMILPD. · b27fd77c
  Craig Topper authored Feb 07, 2012
```
llvm-svn: 149968
```
  b27fd77c
Feb 06, 2012
- Enable streaming of bitcode · 8b2dcad4
  Derek Schuff authored Feb 06, 2012
```
This CL delays reading of function bodies from initial parse until
materialization, allowing overlap of compilation with bitcode download.

llvm-svn: 149918
```
  8b2dcad4
- Remove some dead code and tidy things up now that vectors use ConstantDataVector · 8213c8af
  Chris Lattner authored Feb 06, 2012
```
instead of always using ConstantVector.

llvm-svn: 149912
```
  8213c8af
- X86: Don't call malloc for 4 bits. No functionality change. · 24967170
  Benjamin Kramer authored Feb 06, 2012
```
llvm-svn: 149866
```
  24967170
- Add shuffle decoding support for 256-bit pshufd. Merge vpermilp* and pshufd decoding. · 1f710577
  Craig Topper authored Feb 06, 2012
```
llvm-svn: 149859
```
  1f710577
Feb 05, 2012
- Persuade GCC that there is nothing worth warning about here (there isn't). · ae22c60f
  Duncan Sands authored Feb 05, 2012
```
llvm-svn: 149834
```
  ae22c60f
- Begin fleshing out more convenience predicates in llvm::Triple and · ebd90c58
  Chandler Carruth authored Feb 05, 2012
```
convert at least one client over to use them. Subsequent patches both to
LLVM and Clang will try to convert more people over to a common set of
predicates.

This round of predicates is focused on OS-categorization predicates.

llvm-svn: 149815
```
  ebd90c58
- Convert assert(0) to llvm_unreachable · c4965bce
  Craig Topper authored Feb 05, 2012
```
llvm-svn: 149814
```
  c4965bce
- Convert assert(0) to llvm_unreachable in X86 Target directory. · 4ed7278f
  Craig Topper authored Feb 05, 2012
```
llvm-svn: 149809
```
  4ed7278f