Commits · bc07a8900c2277f2a00776035359d5453d364757 · Roger Ferrer / llvm-epi-0.8

Jun 15, 2013

Machine Model: Add MicroOpBufferSize and resource BufferSize. · de2109eb

Andrew Trick authored Jun 15, 2013

Replace the ill-defined MinLatency and ILPWindow properties with
with straightforward buffer sizes:
MCSchedMode::MicroOpBufferSize
MCProcResourceDesc::BufferSize

These can be used to more precisely model instruction execution if desired.

Disabled some misched tests temporarily. They'll be reenabled in a few commits.

llvm-svn: 184032

de2109eb

Apr 27, 2013

Generalize the MachineTraceMetrics public API. · 85058af6

Andrew Trick authored Apr 27, 2013

Naturally, we should be able to pass in extra instructions, not just
extra blocks.

llvm-svn: 180667

85058af6

Apr 03, 2013
- Allow MachineTraceMetrics to be used when the model has no resources. · aeb69a54
  Jakob Stoklund Olesen authored Apr 02, 2013
```
It it still possible to extract information from itineraries, for
example.

llvm-svn: 178582
```
  aeb69a54
Apr 02, 2013

Count processor resources individually in MachineTraceMetrics. · 3ca14772

Jakob Stoklund Olesen authored Apr 02, 2013

The new instruction scheduling models provide information about the
number of cycles consumed on each processor resource. This makes it
possible to estimate ILP more accurately than simply counting
instructions / issue width.

The functions getResourceDepth() and getResourceLength() now identify
the limiting processor resource, and return a cycle count based on that.

This gives more precise resource information, particularly in traces
that use one resource a lot more than others.

llvm-svn: 178553

3ca14772

Mar 08, 2013

Rename isEarlierInSameTrace to isUsefulDominator. · 299cedc7

Jakob Stoklund Olesen authored Mar 07, 2013

In very rare cases caused by irreducible control flow, the dominating
block can have the same trace head without actually being part of the
trace.

As long as such a dominator still has valid instruction depths, it is OK
to use it for computing instruction depths.

Rename the function to avoid lying, and add a check that instruction
depths are computed for the dominator.

llvm-svn: 176668

299cedc7

Jan 17, 2013
- Move MachineTraceMetrics.h into include/llvm/CodeGen. · 965665bb
  Jakob Stoklund Olesen authored Jan 17, 2013
```
Let targets use it.

llvm-svn: 172688
```
  965665bb
Dec 03, 2012

Use the new script to sort the includes of every file under lib. · ed0881b2

Chandler Carruth authored Dec 03, 2012

Sooooo many of these had incorrect or strange main module includes.
I have manually inspected all of these, and fixed the main module
include to be the nearest plausible thing I could find. If you own or
care about any of these source files, I encourage you to take some time
and check that these edits were sensible. I can't have broken anything
(I strictly added headers, and reordered them, never removed), but they
may not be the headers you'd really like to identify as containing the
API being implemented.

Many forward declarations and missing includes were added to a header
files to allow them to parse cleanly when included first. The main
module rule does in fact have its merits. =]

llvm-svn: 169131

ed0881b2

Oct 11, 2012

Pass an explicit operand number to addLiveIns. · d0d7860f

Jakob Stoklund Olesen authored Oct 11, 2012

Not all instructions define a virtual register in their first operand.
Specifically, INLINEASM has a different format.

<rdar://problem/12472811>

llvm-svn: 165721

d0d7860f

Oct 09, 2012

Don't crash on extra evil irreducible control flow. · 9d1173a8

Jakob Stoklund Olesen authored Oct 08, 2012

When the CFG contains a loop with multiple entry blocks, the traces
computed by MachineTraceMetrics don't always have the same nice
properties. Loop back-edges are normally excluded from traces, but
MachineLoopInfo doesn't recognize loops with multiple entry blocks, so
those back-edges may be included.

Avoid asserting when that happens by adding an isEarlierInSameTrace()
function that accurately determines if a dominating block is part of the
same trace AND is above the currrent block in the trace.

llvm-svn: 165434

9d1173a8

Oct 04, 2012
- Switch MachineTraceMetrics to the new TargetSchedModel interface. · 89822229
  Jakob Stoklund Olesen authored Oct 04, 2012
```
llvm-svn: 165235
```
  89822229
Aug 11, 2012

Give MachineTraceMetrics its own debug tag. · a0042acd
Jakob Stoklund Olesen authored Aug 10, 2012
```
llvm-svn: 161712
```
a0042acd

Add more trace query functions. · 34844209

Jakob Stoklund Olesen authored Aug 10, 2012

Trace::getResourceLength() computes the number of cycles required to
execute the trace when ignoring data dependencies. The number can be
compared to the critical path to estimate the trace ILP.

Trace::getPHIDepth() computes the data dependency depth of a PHI in a
trace successor that isn't necessarily part of the trace.

llvm-svn: 161711

34844209

Aug 10, 2012

Include loop-carried dependencies when computing instr heights. · 0954d419

Jakob Stoklund Olesen authored Aug 10, 2012

When a trace ends with a back-edge, include PHIs in the loop header in
the height computations. This makes the critical path through a loop
more accurate by including the latencies of the last instructions in the
loop.

llvm-svn: 161688

0954d419

Aug 09, 2012

Deal with irreducible control flow when building traces. · bf1ac4bd

Jakob Stoklund Olesen authored Aug 08, 2012

We filter out MachineLoop back-edges during the trace-building PO
traversals, but it is possible to have CFG cycles that aren't natural
loops, and MachineLoopInfo doesn't include such cycles.

Use a standard visited set to detect such CFG cycles, and completely
ignore them when picking traces.

llvm-svn: 161532

bf1ac4bd

Aug 07, 2012

Fix a couple of typos. · 296448b2
Jakob Stoklund Olesen authored Aug 07, 2012
```
llvm-svn: 161437
```
296448b2

Add trace accessor methods, implement primitive if-conversion heuristic. · 75d9d515

Jakob Stoklund Olesen authored Aug 07, 2012

Compare the critical paths of the two traces through an if-conversion
candidate. If the difference is larger than the branch brediction
penalty, reject the if-conversion. If would never pay.

llvm-svn: 161433

75d9d515

Aug 02, 2012

Compute the critical path length through a trace. · 5d30630e

Jakob Stoklund Olesen authored Aug 02, 2012

Whenever both instruction depths and instruction heights are known in a
block, it is possible to compute the length of the critical path as
max(depth+height) over the instructions in the block.

The stored live-in lists make it possible to accurately compute the
length of a critical path that bypasses the current (small) block.

llvm-svn: 161197

5d30630e

Compute instruction heights through a trace. · 2db6b653

Jakob Stoklund Olesen authored Aug 01, 2012

The height on an instruction is the minimum number of cycles from the
instruction is issued to the end of the trace. Heights are computed for
all instructions in and below the trace center block.

The method for computing heights is different from the depth
computation. As we visit instructions in the trace bottom-up, heights of
used instructions are pushed upwards. This way, we avoid scanning long
use lists, looking for uses in the current trace.

At each basic block boundary, a list of live-in registers and their
minimum heights is saved in the trace block info. These live-in lists
are used when restarting depth computations on a trace that
converges with an already computed trace. They will also be used to
accurately compute the critical path length.

llvm-svn: 161138

2db6b653

Aug 01, 2012
- Add DataDep constructors. Explicitly check SSA form. · 5e19d35e
  Jakob Stoklund Olesen authored Aug 01, 2012
```
llvm-svn: 161115
```
  5e19d35e
Jul 31, 2012

Compute instruction depths through the current trace. · 059e647c

Jakob Stoklund Olesen authored Jul 31, 2012

Assuming infinite issue width, compute the earliest each instruction in
the trace can issue, when considering the latency of data dependencies.
The issue cycle is record as a 'depth' from the beginning of the trace.

This is half the computation required to find the length of the critical
path through the trace. Heights are next.

llvm-svn: 161074

059e647c

Rename CT -> MTM. MachineTraceMetrics is abbreviated MTM. · 1dfb1018
Jakob Stoklund Olesen authored Jul 31, 2012
```
llvm-svn: 161072
```
1dfb1018
Avoid looking at stale data in verifyAnalysis(). · 68c2cd05
Jakob Stoklund Olesen authored Jul 30, 2012
```
llvm-svn: 161004
```
68c2cd05

Allow traces to enter nested loops. · c14cf57b

Jakob Stoklund Olesen authored Jul 30, 2012

This lets traces include the final iteration of a nested loop above the
center block, and the first iteration of a nested loop below the center
block.

We still don't allow traces to contain backedges, and traces are
truncated where they would leave a loop, as seen from the center block.

llvm-svn: 161003

c14cf57b

Jul 30, 2012

Assert that all trace candidate blocks have been visited by the PO. · f308c128

Jakob Stoklund Olesen authored Jul 30, 2012

When computing a trace, all the candidates for pred/succ must have been
visited. Filter out back-edges first, though. The PO traversal ignores
them.

Thanks to Andy for spotting this in review.

llvm-svn: 160995

f308c128

Hook into PassManager's analysis verification. · a12a7d5f

Jakob Stoklund Olesen authored Jul 30, 2012

By overriding Pass::verifyAnalysis(), the pass contents will be verified
by the pass manager.

llvm-svn: 160994

a12a7d5f

Add MachineInstr::isTransient(). · 7361846f

Jakob Stoklund Olesen authored Jul 30, 2012

This is a cleaned up version of the isFree() function in
MachineTraceMetrics.cpp.

Transient instructions are very unlikely to produce any code in the
final output. Either because they get eliminated by RegisterCoalescing,
or because they are pseudo-instructions like labels and debug values.

llvm-svn: 160977

7361846f

Add MachineTraceMetrics::verify(). · 3df6c46f

Jakob Stoklund Olesen authored Jul 30, 2012

This function verifies the consistency of cached data in the
MachineTraceMetrics analysis.

llvm-svn: 160976

3df6c46f

Verify that the CFG hasn't changed during invalidate(). · eb488fe1

Jakob Stoklund Olesen authored Jul 30, 2012

The MachineTraceMetrics analysis must be invalidated before modifying
the CFG. This will catch some of the violations of that rule.

llvm-svn: 160969

eb488fe1

Jul 28, 2012
- Add more debug output to MachineTraceMetrics. · 05633697
  Jakob Stoklund Olesen authored Jul 27, 2012
```
llvm-svn: 160905
```
  05633697
- Keep track of the head and tail of the trace through each block. · 1152202c
  Jakob Stoklund Olesen authored Jul 27, 2012
```
This makes it possible to quickly detect blocks that are outside the
trace.

llvm-svn: 160904
```
  1152202c
Jul 26, 2012

Use an otherwise unused variable. · 35400b1d
Jakob Stoklund Olesen authored Jul 26, 2012
```
llvm-svn: 160798
```
35400b1d

Start scaffolding for a MachineTraceMetrics analysis pass. · f9029fef

Jakob Stoklund Olesen authored Jul 26, 2012

This is still a work in progress.

Out-of-order CPUs usually execute instructions from multiple basic
blocks simultaneously, so it is necessary to look at longer traces when
estimating the performance effects of code transformations.

The MachineTraceMetrics analysis will pick a typical trace through a
given basic block and provide performance metrics for the trace. Metrics
will include:

- Instruction count through the trace.
- Issue count per functional unit.
- Critical path length, and per-instruction 'slack'.

These metrics can be used to determine the performance limiting factor
when executing the trace, and how it will be affected by a code
transformation.

Initially, this will be used by the early if-conversion pass.

llvm-svn: 160796

f9029fef