Commits · 8e1d906734d72a94afa1c2ca35465265ff0df50d · Roger Ferrer / llvm-epi-0.8

Nov 13, 2011

Enhance the assertion mechanisms in place to make it easier to catch · 8e1d9067

Chandler Carruth authored Nov 13, 2011

when we fail to place all the blocks of a loop. Currently this is
happening for unnatural loops, and this logic helps more immediately
point to the problem.

llvm-svn: 144504

8e1d9067

Rename SlotIndexes to match how they are used. · 90b5e565

Jakob Stoklund Olesen authored Nov 13, 2011

The old naming scheme (load/use/def/store) can be traced back to an old
linear scan article, but the names don't match how slots are actually
used.

The load and store slots are not needed after the deferred spill code
insertion framework was deleted.

The use and def slots don't make any sense because we are using
half-open intervals as is customary in C code, but the names suggest
closed intervals.  In reality, these slots were used to distinguish
early-clobber defs from normal defs.

The new naming scheme also has 4 slots, but the names match how the
slots are really used.  This is a purely mechanical renaming, but some
of the code makes a lot more sense now.

llvm-svn: 144503

90b5e565

Add BLSI, BLSMSK, and BLSR to getTargetNodeName. · b8bcb473
Craig Topper authored Nov 13, 2011
```
llvm-svn: 144502
```
b8bcb473

Teach MBP to force-merge layout successors for blocks with unanalyzable · 0bb42c0f

Chandler Carruth authored Nov 13, 2011

branches that also may involve fallthrough. In the case of blocks with
no fallthrough, we can still re-order the blocks profitably. For example
instruction decoding will in some cases continue past an indirect jump,
making laying out its most likely successor there profitable.

Note, no test case. I don't know how to write a test case that exercises
this logic, but it matches the described desired semantics in
discussions with Jakob and others. If anyone has a nice example of IR
that will trigger this, that would be lovely.

Also note, there are still assertion failures in real world code with
this. I'm digging into those next, now that I know this isn't the cause.

llvm-svn: 144499

0bb42c0f

Hoist another gross nested loop into a helper method. · f9213fe7
Chandler Carruth authored Nov 13, 2011
```
llvm-svn: 144498
```
f9213fe7
Add a missing doxygen comment for a helper method. · eb4ec3ae
Chandler Carruth authored Nov 13, 2011
```
llvm-svn: 144497
```
eb4ec3ae
Hoist a nested loop into its own method. · b336172f
Chandler Carruth authored Nov 13, 2011
```
llvm-svn: 144496
```
b336172f

Rewrite #3 of machine block placement. This is based somewhat on the · 8d150789

Chandler Carruth authored Nov 13, 2011

second algorithm, but only loosely. It is more heavily based on the last
discussion I had with Andy. It continues to walk from the inner-most
loop outward, but there is a key difference. With this algorithm we
ensure that as we visit each loop, the entire loop is merged into
a single chain. At the end, the entire function is treated as a "loop",
and merged into a single chain. This chain forms the desired sequence of
blocks within the function. Switching to a single algorithm removes my
biggest problem with the previous approaches -- they had different
behavior depending on which system triggered the layout. Now there is
exactly one algorithm and one basis for the decision making.

The other key difference is how the chain is formed. This is based
heavily on the idea Andy mentioned of keeping a worklist of blocks that
are viable layout successors based on the CFG. Having this set allows us
to consistently select the best layout successor for each block. It is
expensive though.

The code here remains very rough. There is a lot that needs to be done
to clean up the code, and to make the runtime cost of this pass much
lower. Very much WIP, but this was a giant chunk of code and I'd rather
folks see it sooner than later. Everything remains behind a flag of
course.

I've added a couple of tests to exercise the issues that this iteration
was motivated by: loop structure preservation. I've also fixed one test
that was exhibiting the broken behavior of the previous version.

llvm-svn: 144495

8d150789

The order in which the predicate is added differs between Thumb and ARM mode. ... · 1198d894

Chad Rosier authored Nov 13, 2011

The order in which the predicate is added differs between Thumb and ARM mode.  Fix predicate when in ARM mode and restore SelectIntrinsicCall.

llvm-svn: 144494

1198d894

Temporarily disable SelectIntrinsicCall when in ARM mode. This is causing failures. · a476e391
Chad Rosier authored Nov 13, 2011
```
llvm-svn: 144492
```
a476e391
Fix comments. · 5196efdf
Chad Rosier authored Nov 13, 2011
```
llvm-svn: 144490
```
5196efdf

Add support for emitting both signed- and zero-extend loads. Fix · c8cfd3a8

Chad Rosier authored Nov 13, 2011

SimplifyAddress to handle either a 12-bit unsigned offset or the ARM +/-imm8
offsets (addressing mode 3).  This enables a load followed by an integer 
extend to be folded into a single load.

For example:
ldrb r1, [r0]       ldrb r1, [r0]
uxtb r2, r1     =>
mov  r3, r2         mov  r3, r1

llvm-svn: 144488

c8cfd3a8

Prune more RALinScan. RALinScan was also here! · 4784df71
NAKAMURA Takumi authored Nov 13, 2011
```
llvm-svn: 144487
```
4784df71
More dead code elimination in VirtRegMap. · c601d8c7
Jakob Stoklund Olesen authored Nov 13, 2011
```
This thing is looking a lot like a virtual register map now.

llvm-svn: 144486
```
c601d8c7
Stop tracking spill slot uses in VirtRegMap. · 28df7ef8
Jakob Stoklund Olesen authored Nov 13, 2011
```
Nobody cared, StackSlotColoring scans the instructions to find used stack
slots.

llvm-svn: 144485
```
28df7ef8

Remove dead code and data from VirtRegMap. · 92255f27

Jakob Stoklund Olesen authored Nov 13, 2011

Most of this stuff was supporting the old deferred spill code insertion
mechanism.  Modern spillers just edit machine code in place.

llvm-svn: 144484

92255f27

Stop tracking unused registers in VirtRegMap. · 38b3f312
Jakob Stoklund Olesen authored Nov 13, 2011
```
The information was only used by the register allocator in
StackSlotColoring.

llvm-svn: 144482
```
38b3f312

Remove the -color-ss-with-regs option. · 6ddb767f

Jakob Stoklund Olesen authored Nov 13, 2011

It was off by default.

The new register allocators don't have the problems that made it
necessary to reallocate registers during stack slot coloring.

llvm-svn: 144481

6ddb767f

Delete VirtRegRewriter. · 5343da64
Jakob Stoklund Olesen authored Nov 13, 2011
```
And there was much rejoicing.

llvm-svn: 144480
```
5343da64
Switch PBQP to VRM's trivial rewriter. · 03f73ab7
Jakob Stoklund Olesen authored Nov 13, 2011
```
The very complicated VirtRegRewriter is going away.

llvm-svn: 144479
```
03f73ab7
Delete the old spilling framework from LiveIntervalAnalysis. · f61a6fe2
Jakob Stoklund Olesen authored Nov 12, 2011
```
This is dead code, all register allocators use InlineSpiller.

llvm-svn: 144478
```
f61a6fe2
Delete the 'standard' spiller with used the old spilling framework. · 7ef502f6
Jakob Stoklund Olesen authored Nov 12, 2011
```
The current register allocators all use the inline spiller.

llvm-svn: 144477
```
7ef502f6

Switch PBQP to the modern InlineSpiller framework. · 11bb63a7

Jakob Stoklund Olesen authored Nov 12, 2011

It is worth noting that the old spiller would split live ranges around
basic blocks. The new spiller doesn't do that.

PBQP should do its own live range splitting with
SplitEditor::splitSingleBlock() if desired.  See
RAGreedy::tryBlockSplit().

llvm-svn: 144476

11bb63a7

Nov 12, 2011
- Delete the linear scan register allocator. · e7e50e6f
  Jakob Stoklund Olesen authored Nov 12, 2011
```
RegAllocGreedy has been the default for six months now.

Deleting RegAllocLinearScan makes it possible to also delete
VirtRegRewriter and clean up the spiller code.

llvm-svn: 144475
```
  e7e50e6f
- Remove histogram tests. · ce4ef9f8
  Jakob Stoklund Olesen authored Nov 12, 2011
```
Counting the number of occurences of each opcode is not a useful test.

llvm-svn: 144474
```
  ce4ef9f8
- RAGreedy is better about hinting now. · 0eac531b
  Jakob Stoklund Olesen authored Nov 12, 2011
```
Or maybe we are just getting lucky.

llvm-svn: 144473
```
  0eac531b
- Linear scan is going away. · 8ec1a92a
  Jakob Stoklund Olesen authored Nov 12, 2011
```
llvm-svn: 144472
```
  8ec1a92a
- XFAIL test that depends on linear scan to remove dead code. · 654d6088
  Jakob Stoklund Olesen authored Nov 12, 2011
```
Filed PR11364 to track the problem.  Should the register allocator
eliminate dead code?

llvm-svn: 144471
```
  654d6088
- Remove obsolete test. · fa3a8ee6
  Jakob Stoklund Olesen authored Nov 12, 2011
```
This test was committed with a bugfix to RemoveCopyByCommutingDef, but
that optimization is no longer triggered by this test.

llvm-svn: 144470
```
  fa3a8ee6
- Remove obsolete test. · 80b3d299
  Jakob Stoklund Olesen authored Nov 12, 2011
```
This test is for a very specific LocalRewriter bug.  LocalRewriter is
going away.

llvm-svn: 144469
```
  80b3d299
- Remove obsolete test. · 0c7d9d90
  Jakob Stoklund Olesen authored Nov 12, 2011
```
I don't think this test does what is was supposed to do, and
LocalRewriter is going away anyway.

llvm-svn: 144463
```
  0c7d9d90
- Eliminate more linear scan tests. · 126f9779
  Jakob Stoklund Olesen authored Nov 12, 2011
```
llvm-svn: 144462
```
  126f9779
- Switch a couple -O0 tests to RABasic. · 9d090daa
  Jakob Stoklund Olesen authored Nov 12, 2011
```
llvm-svn: 144461
```
  9d090daa
- Switch a few tests off linearscan. · 4deff7bc
  Jakob Stoklund Olesen authored Nov 12, 2011
```
llvm-svn: 144460
```
  4deff7bc
- Delete old test of a VirtRegRewriter feature. · 6ac6aa78
  Jakob Stoklund Olesen authored Nov 12, 2011
```
This test doesn't expose the issue with RAGreedy.

I filed PR11363 to track the missing InlineSpiller feature.

llvm-svn: 144459
```
  6ac6aa78
- Remove old test that doesn't make sense. · 74d091b3
  Jakob Stoklund Olesen authored Nov 12, 2011
```
The test is checking that the output doesn't contains any 'mov '
strings. It does contain movl, though.

llvm-svn: 144458
```
  74d091b3
- Add more AVX2 shift lowering support. Move AVX2 variable shift to use patterns... · 3dc75f9e
  Craig Topper authored Nov 12, 2011
```
Add more AVX2 shift lowering support. Move AVX2 variable shift to use patterns instead of custom lowering code.

llvm-svn: 144457
```
  3dc75f9e
- Don't try to loop on iterators that are potentially invalidated inside the loop. Fixes PR11361! · d48ab845
  Nick Lewycky authored Nov 12, 2011
```
llvm-svn: 144454
```
  d48ab845
- Fix typo. · 77733535
  Akira Hatanaka authored Nov 12, 2011
```
llvm-svn: 144453
```
  77733535
- Implement Mips64's handling of byval arguments in LowerCall. · 19891f84
  Akira Hatanaka authored Nov 12, 2011
```
llvm-svn: 144452
```
  19891f84