Commits · 8a450cb2fa9c06a9971fb9a7416ba7c58b41db02 · Roger Ferrer / llvm-epi-0.8

Feb 16, 2012

Enable register mask operands for x86 calls. · 8a450cb2

Jakob Stoklund Olesen authored Feb 16, 2012

Call instructions no longer have a list of 43 call-clobbered registers.
Instead, they get a single register mask operand with a bit vector of
call-preserved registers.

This saves a lot of memory, 42 x 32 bytes = 1344 bytes per call
instruction, and it speeds up building call instructions because those
43 imp-def operands no longer need to be added to use-def lists. (And
removed and shifted and re-added for every explicit call operand).

Passes like LiveVariables, LiveIntervals, RAGreedy, PEI, and
BranchFolding are significantly faster because they can deal with call
clobbers in bulk.

Overall, clang -O2 is between 0% and 8% faster, uniformly distributed
depending on call density in the compiled code.  Debug builds using
clang -O0 are 0% - 3% faster.

I have verified that this patch doesn't change the assembly generated
for the LLVM nightly test suite when building with -disable-copyprop
and -disable-branch-fold.

Branch folding behaves slightly differently in a few cases because call
instructions have different hash values now.

Copy propagation flushes its data structures when it crosses a register
mask operand. This causes it to leave a few dead copies behind, on the
order of 20 instruction across the entire nightly test suite, including
SPEC. Fixing this properly would require the pass to use different data
structures.

llvm-svn: 150638

8a450cb2

Feb 15, 2012
- Optimize redundant sign extends and negation of predicates. · 30804c24
  Sirish Pande authored Feb 15, 2012
  
  llvm-svn: 150606
  30804c24
- Revert "Replacing HexagonOptimizeSZExtends with HexagonPeephole." · 53da633f
  Eric Christopher authored Feb 15, 2012
  
  This reverts commit 1656806a944bbd23e98c6e578810fe02495ab741. llvm-svn: 150605
  53da633f
- Revert "Optimize redundant sign extends and negation of predicates" · d9811eb7
  Eric Christopher authored Feb 15, 2012
  
  as it's breaking the build. This reverts commit 11241abca5e2a313412fed594bb9d9fa2a2057fb. llvm-svn: 150604
  d9811eb7
- Replacing HexagonOptimizeSZExtends with HexagonPeephole. · 99571325
  Sirish Pande authored Feb 15, 2012
  
  llvm-svn: 150603
  99571325
- Optimize redundant sign extends and negation of predicates · 4736aee8
  Sirish Pande authored Feb 15, 2012
  
  llvm-svn: 150601
  4736aee8
- Add braces to if clause to make symmetric with associate else clause. · 0bc51324
  Chad Rosier authored Feb 15, 2012
  
  llvm-svn: 150591
  0bc51324
- Strip the pointer casts from the constants here. · dfb45f4d
  Bill Wendling authored Feb 15, 2012
  
  The c'tor list is stored as a list of 'void ()*'s, so all of the functions are bitcast to that. However, the dyn_cast doesn't automagically look through bitcasts. Do that for it. <rdar://problem/10813350> llvm-svn: 150572
  dfb45f4d
- Added TargetPassConfig::disablePass/substitutePass as a general mechanism to... · c9ce9d23
  Andrew Trick authored Feb 15, 2012
  
  Added TargetPassConfig::disablePass/substitutePass as a general mechanism to override specific passes. llvm-svn: 150562
  c9ce9d23
- Use a temporary variable, rather then a series of redundant calls. · f0687634
  Chad Rosier authored Feb 15, 2012
  
  llvm-svn: 150538
  f0687634
- Stop custom lowering forr x86 DEC64m from happening if the load in the lowered... · c21ebf5c
  Pete Cooper authored Feb 15, 2012
  
  Stop custom lowering forr x86 DEC64m from happening if the load in the lowered sequence has more than 1 user llvm-svn: 150537
  c21ebf5c
- Use a temporary variable, rather then a series of redundant calls. · dccc4794
  Chad Rosier authored Feb 15, 2012
  
  llvm-svn: 150536
  dccc4794
Feb 14, 2012
- Remove unnecessary assignment to temporary, ResultReg. · 5b9c3974
  Chad Rosier authored Feb 14, 2012
  
  llvm-svn: 150520
  5b9c3974
- Move old movl vector_shuffle patterns. Not needed anymore since... · cfad98f7
  Craig Topper authored Feb 14, 2012
  
  Move old movl vector_shuffle patterns. Not needed anymore since vector_shuffles shouldn't reach isel. llvm-svn: 150462
  cfad98f7
- Third time's the charm...? · 876f24f7
  Lang Hames authored Feb 14, 2012
  
  llvm-svn: 150447
  876f24f7
- Unswap swap operands, partially reducing confusion. · 185455df
  Lang Hames authored Feb 14, 2012
  
  llvm-svn: 150444
  185455df
- Don't reserve the R0 and R1 registers here. We don't use these registers, and · 05d6f2ff
  Bill Wendling authored Feb 13, 2012
  
  marking them as "live-in" into a BB ruins some invariants that the back-end tries to maintain. llvm-svn: 150437
  05d6f2ff
- Make operands for VSWP read-modify-write. · aef4ca78
  Lang Hames authored Feb 13, 2012
  
  llvm-svn: 150433
  aef4ca78
Feb 13, 2012

Still more vector_shuffle pattern removal. · 8b19d788
Craig Topper authored Feb 13, 2012
```
llvm-svn: 150365
```
8b19d788

Fix various issues (or do cleanups) found by enabling certain MSVC warnings. · 32e983e4

Ahmed Charles authored Feb 13, 2012

- Use unsigned literals when the desired result is unsigned. This mostly allows unsigned/signed mismatch warnings to be less noisy even if they aren't on by default.
- Remove misplaced llvm_unreachable.
- Add static to a declaration of a function on MSVC x86 only.
- Change some instances of calling a static function through a variable to simply calling that function while removing the unused variable.

llvm-svn: 150364

32e983e4

Remove more vector_shuffle patterns for unpack. These should be target... · 74650add

Craig Topper authored Feb 13, 2012

Remove more vector_shuffle patterns for unpack. These should be target specific nodes when they get to isel.

llvm-svn: 150363

74650add

Recommit r150328. Previous test failures should be fixed by r150360. · 6d471c9e
Craig Topper authored Feb 13, 2012
```
llvm-svn: 150362
```
6d471c9e

Update CanXFormVExtractWithShuffleIntoLoad to ensure bitcasts of loads only... · 87119fa3

Craig Topper authored Feb 13, 2012

Update CanXFormVExtractWithShuffleIntoLoad to ensure bitcasts of loads only have one use. Matches DAGCombiner and prevents vector_shuffles from reaching isel.

llvm-svn: 150360

87119fa3

Revert r150328, "Remove more vector_shuffle patterns." · 0826c17d
NAKAMURA Takumi authored Feb 13, 2012
```
It caused 3 failures on pre-penryn and non-x86(generic) hosts.

llvm-svn: 150357
```
0826c17d

Fixed bug when custom lowering DEC64m on x86. · 71be57bb

Pete Cooper authored Feb 13, 2012

If the DEC node had more than one user, it was doing this lowering but
leaving the original DEC node around and so decrementing twice.

Fixes PR11964.

llvm-svn: 150356

71be57bb

Feb 12, 2012
- Remove more vector_shuffle patterns. · e24c94af
  Craig Topper authored Feb 12, 2012
  
  llvm-svn: 150328
  e24c94af
- Remove redundant getAnalysis<> calls in GlobalOpt. Add a few Itanium ABI calls · 4b273cb7
  Nick Lewycky authored Feb 12, 2012
  
  to TargetLibraryInfo and use one of them in GlobalOpt. llvm-svn: 150323
  4b273cb7
- Remove more vector_shuffle patterns. · d40d9eb2
  Craig Topper authored Feb 12, 2012
  
  llvm-svn: 150321
  d40d9eb2
- Remove more vector_shuffle patterns. · 330ca977
  Craig Topper authored Feb 11, 2012
  
  llvm-svn: 150314
  330ca977
Feb 11, 2012

Add support for implicit TLS model used with MS VC runtime. · c6b4017c
Anton Korobeynikov authored Feb 11, 2012
```
Patch by Kai Nacke!

llvm-svn: 150307
```
c6b4017c
Don't mix declarations and code. · 915e3d95
Benjamin Kramer authored Feb 11, 2012
```
llvm-svn: 150305
```
915e3d95
Make the EDis tables const. · 428704eb
Benjamin Kramer authored Feb 11, 2012
```
llvm-svn: 150304
```
428704eb

Reuse the enum names from X86Desc in the X86Disassembler. · 478e8de8

Benjamin Kramer authored Feb 11, 2012

This requires some gymnastics to make it available for C code. Remove the names
from the disassembler tables, making them relocation free.

llvm-svn: 150303

478e8de8

Remove some patterns for matching vector_shuffle instructions since... · 981c6cf7

Craig Topper authored Feb 11, 2012

Remove some patterns for matching vector_shuffle instructions since vector_shuffles should be custom lowered before isel.

llvm-svn: 150299

981c6cf7

Fix shuffle lowering code to stop creating temporary DAG nodes to do shuffle... · 11826a6e

Craig Topper authored Feb 11, 2012

Fix shuffle lowering code to stop creating temporary DAG nodes to do shuffle mask checks on. This seemed to be confusing things such that vector_shuffle ops to got through to iselection. This is another step towards removing the vector_shuffle handling patterns from isel.

llvm-svn: 150296

11826a6e

Feb 10, 2012

Revert r150222, as the clang driver now handles this properly. · 1c9dd297

Jim Grosbach authored Feb 10, 2012

Now that the clang driver passes the CPU and feature information to
the backend when processing assembly files (150273), this isn't necessary.

llvm-svn: 150274

1c9dd297

Make valgrind happy. · c7f48417
Jason W Kim authored Feb 10, 2012
```
llvm-svn: 150251
```
c7f48417
unnecessary include · f08915ca
Andrew Trick authored Feb 10, 2012
```
llvm-svn: 150228
```
f08915ca
PTX no longer needs to provide its own backend. · f4ff2343
Andrew Trick authored Feb 10, 2012
```
llvm-svn: 150227
```
f4ff2343

RegAlloc superpass: includes phi elimination, coalescing, and scheduling. · d3f8fe81

Andrew Trick authored Feb 10, 2012

Creates a configurable regalloc pipeline.

Ensure specific llc options do what they say and nothing more: -reglloc=... has no effect other than selecting the allocator pass itself. This patch introduces a new umbrella flag, "-optimize-regalloc", to enable/disable the optimizing regalloc "superpass". This allows for example testing coalscing and scheduling under -O0 or vice-versa.

When a CodeGen pass requires the MachineFunction to have a particular property, we need to explicitly define that property so it can be directly queried rather than naming a specific Pass. For example, to check for SSA, use MRI->isSSA, not addRequired<PHIElimination>.

CodeGen transformation passes are never "required" as an analysis

ProcessImplicitDefs does not require LiveVariables.

We have a plan to massively simplify some of the early passes within the regalloc superpass.

llvm-svn: 150226

d3f8fe81