Commits · bc55bfde033d67329f9d5dd3ec5e1fbf665297ad · Roger Ferrer / llvm-epi-0.8

Aug 11, 2012

Add a proper if-conversion cost model. · bc55bfde

Jakob Stoklund Olesen authored Aug 10, 2012

Detect when there is not enough available ILP, so if-conversion can't
speculate instructions for free.

Compute the lengthening of the critical path when inserting a select
instruction that depends on the condition as well as both sides of the
if.

Reject conversions that would stretch the critical path by more than
half a mispredict penalty.

llvm-svn: 161713

bc55bfde

Give MachineTraceMetrics its own debug tag. · a0042acd
Jakob Stoklund Olesen authored Aug 10, 2012
```
llvm-svn: 161712
```
a0042acd

Add more trace query functions. · 34844209

Jakob Stoklund Olesen authored Aug 10, 2012

Trace::getResourceLength() computes the number of cycles required to
execute the trace when ignoring data dependencies. The number can be
compared to the critical path to estimate the trace ILP.

Trace::getPHIDepth() computes the data dependency depth of a PHI in a
trace successor that isn't necessarily part of the trace.

llvm-svn: 161711

34844209

Aug 10, 2012

Add getTPred() and getFPred() functions. · 0a99062c
Jakob Stoklund Olesen authored Aug 10, 2012
```
They identify the PHI predecessors in both diamonds and triangles.

llvm-svn: 161689
```
0a99062c

Include loop-carried dependencies when computing instr heights. · 0954d419

Jakob Stoklund Olesen authored Aug 10, 2012

When a trace ends with a back-edge, include PHIs in the loop header in
the height computations. This makes the critical path through a loop
more accurate by including the latencies of the last instructions in the
loop.

llvm-svn: 161688

0954d419

Update edge weights correctly in replaceSuccessor(). · 8c28ac9e

Jakob Stoklund Olesen authored Aug 10, 2012

When replacing Old with New, it can happen that New is already a
successor. Add the old and new edge weights instead of creating a
duplicate edge.

llvm-svn: 161653

8c28ac9e

Reapply r161633-161634 "Partition use lists so defs always come before uses."" · d9b66506
Jakob Stoklund Olesen authored Aug 10, 2012
```
No changes to these patches, MRI needed to be notified when changing
uses into defs and vice versa.

llvm-svn: 161644
```
d9b66506
Also update MRI use lists when changing a use to a def and vice versa. · ae7b9711
Jakob Stoklund Olesen authored Aug 10, 2012
```
This was the cause of the buildbot failures.

llvm-svn: 161643
```
ae7b9711
Revert r161633-161634 "Partition use lists so defs always come before uses." · acd27c92
Jakob Stoklund Olesen authored Aug 09, 2012
```
These commits broke a number of buildbots.

llvm-svn: 161640
```
acd27c92

Partition use lists so defs always come before uses. · df01e007

Jakob Stoklund Olesen authored Aug 09, 2012

This makes it possible to speed up def_iterator by stopping at the first
use. This makes def_empty() and getUniqueVRegDef() much faster when
there are many uses.

In a +Asserts build, LiveVariables is 100x faster in one case because
getVRegDef() has an assertion that would scan to the end of a
def_iterator chain.

Spill weight calculation is significantly faster (300x in one case)
because isTriviallyReMaterializable() calls MRI->isConstantPhysReg(%RIP)
which calls def_empty(%RIP).

llvm-svn: 161634

df01e007

Don't use pointer-pointers for the register use lists. · 7d7051ca

Jakob Stoklund Olesen authored Aug 09, 2012

Use a more conventional doubly linked list where the Prev pointers form
a cycle. This means it is no longer necessary to adjust the Prev
pointers when reallocating the VRegInfo array.

The test changes are required because the register allocation hint is
using the use-list order to break ties.

llvm-svn: 161633

7d7051ca

Move use list management into MachineRegisterInfo. · c4102d49

Jakob Stoklund Olesen authored Aug 09, 2012

Register MachineOperands are kept in linked lists accessible via MRI's
reg_iterator interfaces. The linked list management was handled partly
by MachineOperand methods, partly by MRI methods.

Move all of the list management into MRI, delete
MO::AddRegOperandToRegInfo() and MO::RemoveRegOperandFromRegInfo().

Be more explicit about handling the cases where an MRI pointer isn't
available.

llvm-svn: 161632

c4102d49

Fix a future TwoAddressInstructionPass crash. · 420798ca
Jakob Stoklund Olesen authored Aug 09, 2012
```
No test case, the crash only happens when the default use list order is
changed.

llvm-svn: 161627
```
420798ca

Aug 09, 2012

Fix the legalization of ExtLoad on ARM. ExpandUnalignedLoad did not properly · e0f84d31
Nadav Rotem authored Aug 09, 2012
```
handle the cases where the memory value type was illegal. 
PR 13111. 

llvm-svn: 161565
```
e0f84d31
Don't use getNextOperandForReg() in RAFast. · f71bc7b2
Jakob Stoklund Olesen authored Aug 08, 2012
```
That particular optimization was probably premature anyway.

llvm-svn: 161541
```
f71bc7b2

Deal with irreducible control flow when building traces. · bf1ac4bd

Jakob Stoklund Olesen authored Aug 08, 2012

We filter out MachineLoop back-edges during the trace-building PO
traversals, but it is possible to have CFG cycles that aren't natural
loops, and MachineLoopInfo doesn't include such cycles.

Use a standard visited set to detect such CFG cycles, and completely
ignore them when picking traces.

llvm-svn: 161532

bf1ac4bd

Aug 08, 2012

Heed -stress-early-ifcvt. · fa8a26f9
Jakob Stoklund Olesen authored Aug 08, 2012
```
llvm-svn: 161513
```
fa8a26f9
Get the MispredictPenalty from MCSchedModel. · e71b6c6b
Jakob Stoklund Olesen authored Aug 08, 2012
```
Thanks, Andy!

llvm-svn: 161507
```
e71b6c6b
Minor cleanup of defaultDefLatency API · db9b1b5e
Andrew Trick authored Aug 08, 2012
```
llvm-svn: 161470
```
db9b1b5e
Revert "Fix a quadratic algorithm in MachineBranchProbabilityInfo." · 0556be98
Jakob Stoklund Olesen authored Aug 08, 2012
```
It caused an assertion failure when compiling consumer-typeset.

llvm-svn: 161463
```
0556be98

X86: enable CSE between CMP and SUB · 1be131ba

Manman Ren authored Aug 08, 2012

We perform the following:
1> Use SUB instead of CMP for i8,i16,i32 and i64 in ISel lowering.
2> Modify MachineCSE to correctly handle implicit defs.
3> Convert SUB back to CMP if possible at peephole.

Removed pattern matching of (a>b) ? (a-b):0 and like, since they are handled
by peephole now.

rdar://11873276

llvm-svn: 161462

1be131ba

Fix a quadratic algorithm in MachineBranchProbabilityInfo. · c0b61ff9

Jakob Stoklund Olesen authored Aug 08, 2012

The getSumForBlock function was quadratic in the number of successors
because getSuccWeight would perform a linear search for an already known
iterator.

llvm-svn: 161460

c0b61ff9

Skip tied operand pairs that already have the same register. · fbf45dc2
Jakob Stoklund Olesen authored Aug 07, 2012
```
llvm-svn: 161454
```
fbf45dc2

Add SelectionDAG::getTargetIndex. · 505715d8

Jakob Stoklund Olesen authored Aug 07, 2012

This adds support for TargetIndex operands during isel. The meaning of
these (index, offset, flags) operands is entirely defined by the target.

llvm-svn: 161453

505715d8

Aug 07, 2012

For non-Darwin platforms, we want to generate stack protectors only for · 61396b81
Bill Wendling authored Aug 07, 2012
```
character arrays. This is in line with what GCC does.
<rdar://problem/10529227>

llvm-svn: 161446
```
61396b81

Add a new kind of MachineOperand: MO_TargetIndex. · 84689b0d

Jakob Stoklund Olesen authored Aug 07, 2012

A target index operand looks a lot like a constant pool reference, but
it is completely target-defined. It contains the 8-bit TargetFlags, a
32-bit index, and a 64-bit offset. It is preserved by all code generator
passes.

TargetIndex operands can be used to carry target-specific information in
cases where immediate operands won't suffice.

llvm-svn: 161441

84689b0d

Fix a couple of typos. · 296448b2
Jakob Stoklund Olesen authored Aug 07, 2012
```
llvm-svn: 161437
```
296448b2

Add trace accessor methods, implement primitive if-conversion heuristic. · 75d9d515

Jakob Stoklund Olesen authored Aug 07, 2012

Compare the critical paths of the two traces through an if-conversion
candidate. If the difference is larger than the branch brediction
penalty, reject the if-conversion. If would never pay.

llvm-svn: 161433

75d9d515

Add a much more conservative strategy for aligning branch targets. · 881d0a79

Chandler Carruth authored Aug 07, 2012

Previously, MBP essentially aligned every branch target it could. This
bloats code quite a bit, especially non-looping code which has no real
reason to prefer aligned branch targets so heavily.

As Andy said in review, it's still a bit odd to do this without a real
cost model, but this at least has much more plausible heuristics.

Fixes PR13265.

llvm-svn: 161409

881d0a79

MachineCSE: Update the heuristics for isProfitableToCSE. · cb36b8c2

Manman Ren authored Aug 07, 2012

If the result of a common subexpression is used at all uses of the candidate
expression, CSE should not increase the live range of the common subexpression.

rdar://11393714 and rdar://11819721

llvm-svn: 161396

cb36b8c2

Aug 04, 2012

Delete a dead variable. · a9d0b850
Jakob Stoklund Olesen authored Aug 04, 2012
```
TwoAddressInstructionPass doesn't remat any more.

llvm-svn: 161285
```
a9d0b850
TwoAddressInstructionPass refactoring: Extract another method. · a0c72ecf
Jakob Stoklund Olesen authored Aug 03, 2012
```
llvm-svn: 161284
```
a0c72ecf

Refactor and check "onlyReadsMemory" before optimizing builtins. · 874886cd

Bob Wilson authored Aug 03, 2012

This patch is mostly just refactoring a bunch of copy-and-pasted code, but
it also adds a check that the call instructions are readnone or readonly.
That check was already present for sin, cos, sqrt, log2, and exp2 calls, but
it was missing for the rest of the builtins being handled in this code.

llvm-svn: 161282

874886cd

TwoAddressInstructionPass refactoring: Extract a method. · 1162a154

Jakob Stoklund Olesen authored Aug 03, 2012

No functional change intended, except replacing a DenseMap with a
SmallDenseMap which should behave identically.

llvm-svn: 161281

1162a154

Begin adding support for updating LiveIntervals in TwoAddressInstructionPass. · 24bc514c
Jakob Stoklund Olesen authored Aug 03, 2012
```
This is far from complete, and only changes behavior when the
-early-live-intervals flag is passed to llc.

llvm-svn: 161273
```
24bc514c

Add an experimental -early-live-intervals option. · 1c465892

Jakob Stoklund Olesen authored Aug 03, 2012

This option runs LiveIntervals before TwoAddressInstructionPass which
will eventually learn to exploit and update the analysis.

Eventually, LiveIntervals will run before PHIElimination, and we can get
rid of LiveVariables.

llvm-svn: 161270

1c465892

Delete merged physreg copies in joinReservedPhysReg(). · 918999db

Jakob Stoklund Olesen authored Aug 03, 2012

Previously, the identity copy would survive through register allocation
before it was removed by the rewriter.

llvm-svn: 161269

918999db

Aug 03, 2012

Try to reduce the compile time impact of r161232. · 871701c6

Bob Wilson authored Aug 03, 2012

The previous change caused fast isel to not attempt handling any calls to
builtin functions. That included things like "printf" and caused some
noticable regressions in compile time. I wanted to avoid having fast isel
keep a separate list of functions that had to be kept in sync with what the
code in SelectionDAGBuilder.cpp was handling. I've resolved that here by
moving the list into TargetLibraryInfo. This is somewhat redundant in
SelectionDAGBuilder but it will ensure that we keep things consistent.

llvm-svn: 161263

871701c6

Fix memcmp code-gen to honor -fno-builtin. · fa59485b

Bob Wilson authored Aug 03, 2012

I noticed that SelectionDAGBuilder::visitCall was missing a check for memcmp
in TargetLibraryInfo, so that it would use custom code for memcmp calls even
with -fno-builtin.  I also had to add a new -disable-simplify-libcalls option
to llc so that I could write a test for this.

llvm-svn: 161262

fa59485b

Completely eliminate VNInfo flags. · daae19f7

Jakob Stoklund Olesen authored Aug 03, 2012

The 'unused' state of a value number can be represented as an invalid
def SlotIndex. This also exposed code that shouldn't have been looking
at unused value VNInfos.

llvm-svn: 161258

daae19f7