Commits · 7e120f4e664794b22a19ee6ff64a4fa1ac5b6249 · Roger Ferrer / llvm-epi-0.8

Jan 14, 2012

misched: Invoke the DAG builder on each sequence of schedulable instructions. · 7e120f4e
Andrew Trick authored Jan 14, 2012
```
llvm-svn: 148171
```
7e120f4e
Move things around to make the file navigable, even though it will probably be split up later. · 6344087e
Andrew Trick authored Jan 14, 2012
```
llvm-svn: 148170
```
6344087e

After r147827 and r147902, it's now possible for unallocatable registers to be · 6bb95253

Evan Cheng authored Jan 14, 2012

live across BBs before register allocation. This miscompiled 197.parser
when a cmp + b are optimized to a cbnz instruction even though the CPSR def
is live-in a successor.
        cbnz    r6, LBB89_12
...
LBB89_12:
        ble     LBB89_1

The fix consists of two parts. 1) Teach LiveVariables that some unallocatable
registers might be liveouts so don't mark their last use as kill if they are.
2) ARM constantpool island pass shouldn't form cbz / cbnz if the conditional
branch does not kill CPSR.

rdar://10676853

llvm-svn: 148168

6bb95253

Remove previous commit while I debug the bot failures. · a6931287
Rafael Espindola authored Jan 13, 2012
```
llvm-svn: 148156
```
a6931287

Jan 13, 2012
- Remove label that is not used anymore. · cef42c30
  Rafael Espindola authored Jan 13, 2012
```
llvm-svn: 148150
```
  cef42c30
- Remove pointless mode line in .cpp file. · f35c8403
  Andrew Trick authored Jan 13, 2012
```
llvm-svn: 148143
```
  f35c8403
- Added the MachineSchedulerPass skeleton. · e77e84e4
  Andrew Trick authored Jan 13, 2012
```
llvm-svn: 148105
```
  e77e84e4
- wrong filename · 4d4fef23
  Andrew Trick authored Jan 13, 2012
```
llvm-svn: 148103
```
  4d4fef23
- 80-col violation · b1be1aa8
  Andrew Trick authored Jan 13, 2012
```
llvm-svn: 148102
```
  b1be1aa8
- DAGCombine's logic for forming pre- and post- indexed loads / stores were being · fa832633
  Evan Cheng authored Jan 13, 2012
```
overly conservative. It was concerned about cases where it would prohibit
folding simple [r, c] addressing modes. e.g.
  ldr r0, [r2]
  ldr r1, [r2, #4]
=>
  ldr r0, [r2], #4
  ldr r1, [r2]
Change the logic to look for such cases which allows it to form indexed memory
ops more aggressively.

rdar://10674430

llvm-svn: 148086
```
  fa832633
- Revert accidental commit. · 49c4dfb5
  Bill Wendling authored Jan 12, 2012
```
llvm-svn: 148065
```
  49c4dfb5
- Fix the code that was WRONG. · ee5eaebc
  Bill Wendling authored Jan 12, 2012
```
The registers are placed into the saved registers list in the reverse order,
which is why the original loop was written to loop backwards.

llvm-svn: 148064
```
  ee5eaebc
Jan 12, 2012
- Added FPOW, FEXP, FLOG to PromoteNode so that custom actions can be set to... · 99415fea
  Pete Cooper authored Jan 12, 2012
```
Added FPOW, FEXP, FLOG to PromoteNode so that custom actions can be set to Promote for those operations.

Sorry, no test case yet

llvm-svn: 148050
```
  99415fea
- When hoisting common code, watch out for uses which are marked "kill". If the · 5c03a6b8
  Evan Cheng authored Jan 12, 2012
```
killed registers are needed below the insertion point, then unset the kill
marker.

Sorry I'm not able to find a reduced test case.

rdar://10660944

llvm-svn: 148043
```
  5c03a6b8
- Allow targets to select source order pre-RA scheduler. · 09cc429c
  Evan Cheng authored Jan 12, 2012
```
llvm-svn: 148033
```
  09cc429c
- Make SplitAnalysis::UseSlots private. · 994fed68
  Jakob Stoklund Olesen authored Jan 12, 2012
```
llvm-svn: 148031
```
  994fed68
- Make data structures private. · 20f19eb9
  Jakob Stoklund Olesen authored Jan 11, 2012
```
llvm-svn: 147979
```
  20f19eb9
Jan 11, 2012

Sink spillInterferences into RABasic. · 73edbf16
Jakob Stoklund Olesen authored Jan 11, 2012
```
This helper method is too simplistic for RAGreedy.

llvm-svn: 147976
```
73edbf16
Cleanup. · 06ec4203
Jakob Stoklund Olesen authored Jan 11, 2012
```
llvm-svn: 147975
```
06ec4203
Move RegAllocBase into its own cpp file separate from RABasic. · a818d804
Jakob Stoklund Olesen authored Jan 11, 2012
```
No functional change.

llvm-svn: 147972
```
a818d804

On AVX, we can load v8i32 at a time. The bug happens when two uneven loads are used. · b5ce6ee8

Nadav Rotem authored Jan 11, 2012

When we load the v12i32 type, the GenWidenVectorLoads method generates two loads: v8i32 and v4i32
and attempts to use CONCAT_VECTORS to join them. In this fix I concat undef values to widen
the smaller value. The test "widen_load-2.ll" also exposes this bug on AVX.

llvm-svn: 147964

b5ce6ee8

Teach the X86 instruction selection to do some heroic transforms to · 55b2cdee

Chandler Carruth authored Jan 11, 2012

detect a pattern which can be implemented with a small 'shl' embedded in
the addressing mode scale. This happens in real code as follows:

  unsigned x = my_accelerator_table[input >> 11];

Here we have some lookup table that we look into using the high bits of
'input'. Each entity in the table is 4-bytes, which means this
implicitly gets turned into (once lowered out of a GEP):

  *(unsigned*)((char*)my_accelerator_table + ((input >> 11) << 2));

The shift right followed by a shift left is canonicalized to a smaller
shift right and masking off the low bits. That hides the shift right
which x86 has an addressing mode designed to support. We now detect
masks of this form, and produce the longer shift right followed by the
proper addressing mode. In addition to saving a (rather large)
instruction, this also reduces stalls in Intel chips on benchmarks I've
measured.

In order for all of this to work, one part of the DAG needs to be
canonicalized *still further* than it currently is. This involves
removing pointless 'trunc' nodes between a zextload and a zext. Without
that, we end up generating spurious masks and hiding the pattern.

llvm-svn: 147936

55b2cdee

Detect when a value is undefined on an edge to a landing pad. · 8b1d023a

Jakob Stoklund Olesen authored Jan 11, 2012

Consider this code:

int h() {
  int x;
  try {
    x = f();
    g();
  } catch (...) {
    return x+1;
  }
  return x;
}

The variable x is undefined on the first edge to the landing pad, but it
has the f() return value on the second edge to the landing pad.

SplitAnalysis::getLastSplitPoint() would assume that the return value
from f() was live into the landing pad when f() throws, which is of
course impossible.

Detect these cases, and treat them as if the landing pad wasn't there.
This allows spill code to be inserted after the function call to f().

<rdar://problem/10664933>

llvm-svn: 147912

8b1d023a

Exclusively use SplitAnalysis::getLastSplitPoint(). · 67aec124

Jakob Stoklund Olesen authored Jan 11, 2012

Delete the alternative implementation in LiveIntervalAnalysis.

These functions computed the same thing, but SplitAnalysis caches the
result.

llvm-svn: 147911

67aec124

Avoid CSE of instructions which define physical registers across MBBs unless · d9725a38
Evan Cheng authored Jan 11, 2012
```
the physical registers are not allocatable.

llvm-svn: 147902
```
d9725a38

Jan 10, 2012
- 80 col violation. · da46832e
  Evan Cheng authored Jan 10, 2012
```
llvm-svn: 147884
```
  da46832e
- Add 'llvm_unreachable' to passify GCC's understanding of the constraints · f3e8502c
  Chandler Carruth authored Jan 10, 2012
```
of several newly un-defaulted switches. This also helps optimizers
(including LLVM's) recognize that every case is covered, and we should
assume as much.

llvm-svn: 147861
```
  f3e8502c
- Remove unnecessary default cases in switches that cover all enum values. · edbb58c5
  David Blaikie authored Jan 10, 2012
```
llvm-svn: 147855
```
  edbb58c5
- Fix a bug in the legalization of shuffle vectors. When we emulate shuffles... · 61bdf790
  Nadav Rotem authored Jan 10, 2012
```
Fix a bug in the legalization of shuffle vectors. When we emulate shuffles using BUILD_VECTORS we may be using a BV of different type. Make sure to cast it back.

llvm-svn: 147851
```
  61bdf790
- Allow machine-cse to look across MBB boundary when cse'ing instructions that · 0be4144a
  Evan Cheng authored Jan 10, 2012
```
define physical registers. It's currently very restrictive, only catching
cases where the CE is in an immediate (and only) predecessor. But it catches
a surprising large number of cases.

rdar://10660865

llvm-svn: 147827
```
  0be4144a
- Remove the logging streamer. · 5cb98f10
  Rafael Espindola authored Jan 10, 2012
```
llvm-svn: 147820
```
  5cb98f10
Jan 08, 2012
- Avoid eraseing copies from a reserved register unless the definition can be · 520730ff
  Evan Cheng authored Jan 08, 2012
```
safely proven not to have been clobbered. No small test case possible.

llvm-svn: 147751
```
  520730ff
Jan 07, 2012

Replace some uses of hasNUsesOfValue(0, X) with !hasAnyUseOfValue(X) · 0515cd41
Craig Topper authored Jan 07, 2012
```
llvm-svn: 147733
```
0515cd41

Add some DAG combines for SUBC/SUBE. If nothing uses the carry/borrow out of... · 43a1bd6a

Craig Topper authored Jan 07, 2012

Add some DAG combines for SUBC/SUBE. If nothing uses the carry/borrow out of subc, turn it into a sub. Turn (subc x, x) into 0 with no borrow. Turn (subc x, 0) into x with no borrow. Turn (subc -1, x) into (xor x, -1) with no borrow. Turn sube with no borrow in into subc.

llvm-svn: 147728

43a1bd6a

Optimize reserved register coalescing. · 434fb37b

Jakob Stoklund Olesen authored Jan 07, 2012

Reserved registers don't have proper live ranges, their LiveInterval
simply has a snippet of liveness for each def.  Virtual registers with a
single value that is a copy of a reserved register (typically %esp) can
be coalesced with the reserved register if the live range doesn't
overlap any reserved register defs.

When coalescing with a reserved register, don't modify the reserved
register live range.  Just leave it as a bunch of dead defs.  This
eliminates quadratic coalescer behavior in i386 functions with many
function calls.

PR11699

llvm-svn: 147726

434fb37b

Use the 'regalloc' debug tag for most register allocator tracing. · a8879087
Jakob Stoklund Olesen authored Jan 07, 2012
```
llvm-svn: 147725
```
a8879087

Revert part of r147716. Looks like x87 instructions kill markers are all messed · 6cc8d498

Evan Cheng authored Jan 07, 2012

up so branch folding pass can't use the scavenger. :-(  This doesn't breaks
anything currently. It just means targets which do not carefully update kill
markers cannot run post-ra scheduler (not new, it has always been the case).

We should fix this at some point since it's really hacky.

llvm-svn: 147719

6cc8d498

Added a late machine instruction copy propagation pass. This catches · 00b1a3cd

Evan Cheng authored Jan 07, 2012

opportunities that only present themselves after late optimizations
such as tail duplication .e.g.
## BB#1:
        movl    %eax, %ecx
        movl    %ecx, %eax
        ret

The register allocator also leaves some of them around (due to false
dep between copies from phi-elimination, etc.)

This required some changes in codegen passes. Post-ra scheduler and the
pseudo-instruction expansion passes have been moved after branch folding
and tail merging. They were before branch folding before because it did
not always update block livein's. That's fixed now. The pass change makes
independently since we want to properly schedule instructions after
branch folding / tail duplication.

rdar://10428165
rdar://10640363

llvm-svn: 147716

00b1a3cd

Missing raw_ostream.h breaks MSVC build. · ff4e2b7d
Andrew Trick authored Jan 07, 2012
```
llvm-svn: 147703
```
ff4e2b7d
Add comment. · 73a3fab4
Chad Rosier authored Jan 06, 2012
```
llvm-svn: 147696
```
73a3fab4