Commits · c60fbe6b58b88d7a06772d4a4e7bb542f850920f · Roger Ferrer / llvm-epi-0.8

Jun 20, 2012

Fix two rather subtle internal vs. external linker issues. · c60fbe6b

Chandler Carruth authored Jun 20, 2012

I'll admit I'm not entirely satisfied with this change, but it seemed
the cleanest option. Other suggestions quite welcome

The issue is that the traits specializations have static methods which
return the typedef'ed PHI_iterator type. In both the IR and MI layers
this is typedef'ed to a custom iterator class defined in an anonymous
namespace giving the types and the functions returning them internal
linkage. However, because the traits specialization is defined in the
'llvm' namespace (where it has to be, specialized template lives there),
and is in turn used in the templated implementation of the SSAUpdater.
This led to the linkage conflict that Clang now warns about.

The simplest solution to me was just to define the PHI_iterator as
a nested class inside the trait specialization. That way it still
doesn't get scoped widely, it can't be accidentally reused somewhere,
etc. This is a little gross just because nested class definitions are
a little gross, but the alternatives seem more ad-hoc.

llvm-svn: 158799

c60fbe6b

A new algorithm for computing LoopInfo. Temporarily disabled. · ff2ed7b6

Andrew Trick authored Jun 20, 2012

-stable-loops enables a new algorithm for generating the Loop
forest. It differs from the original algorithm in a few respects:

- Not determined by use-list order.
- Initially guarantees RPO order of block and subloops.
- Linear in the number of CFG edges.
- Nonrecursive.

I didn't want to change the LoopInfo API yet, so the block lists are
still inclusive. This seems strange to me, and it means that building
LoopInfo is not strictly linear, but it may not be a problem in
practice. At least the block lists start out in RPO order now. In the
future we may add an attribute or wrapper analysis that allows other
passes to assume RPO order.

The primary motivation of this work was not to optimize LoopInfo, but
to allow reproducing performance issues by decomposing the compilation
stages. I'm often unable to do this with the current LoopInfo, because
the loop tree order determines Loop pass order. Serializing the IR
tends to invert the order, which reverses the optimization order. This
makes it nearly impossible to debug interdependent loop optimizations
such as LSR.

I also believe this will provide more stable performance results across time.

llvm-svn: 158790

ff2ed7b6

Move the implementation of LoopInfo into LoopInfoImpl.h. · cda51d43

Andrew Trick authored Jun 20, 2012

The implementation only needs inclusion from LoopInfo.cpp and
MachineLoopInfo.cpp. Clients of the interface should only include the
interface. This makes the interface readable and speeds up rebuilds
after modifying the implementation.

llvm-svn: 158787

cda51d43

Add regunit liveness support to LiveIntervals::handleMove(). · 3802bbf3

Jakob Stoklund Olesen authored Jun 19, 2012

When LiveIntervals is tracking fixed interference in regunits, make sure
to update those intervals as well. Currently guarded by -live-regunits.

llvm-svn: 158766

3802bbf3

Tidy up. · 651f9a48
Chad Rosier authored Jun 19, 2012
```
llvm-svn: 158762
```
651f9a48

Add an ensureMaxAlignment() function to MachineFrameInfo (analogous to · 73696927

Chad Rosier authored Jun 19, 2012

ensureAlignment() in MachineFunction).  Also, drop setMaxAlignment() in
favor of this new function.  This creates a main entry point to setting
MaxAlignment, which will be helpful for future work.  No functionality
change intended.

llvm-svn: 158758

73696927

Add DAG-combines for aggressive FMA formation. · 39fb1d08

Lang Hames authored Jun 19, 2012

This patch adds DAG combines to form FMAs from pairs of FADD + FMUL or
FSUB + FMUL. The combines are performed when:
(a) Either
      AllowExcessFPPrecision option (-enable-excess-fp-precision for llc)
        OR
      UnsafeFPMath option (-enable-unsafe-fp-math)
    are set, and
(b) TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) is true for the type of
    the FADD/FSUB, and
(c) The FMUL only has one user (the FADD/FSUB).

If your target has fast FMA instructions you can make use of these combines by
overriding TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) to return true for
types supported by your FMA instruction, and adding patterns to match ISD::FMA
to your FMA instructions.

llvm-svn: 158757

39fb1d08

80 col. · 2db1125b
Jakob Stoklund Olesen authored Jun 19, 2012
```
llvm-svn: 158755
```
2db1125b

Jun 19, 2012

Implement PPCInstrInfo::isCoalescableExtInstr(). · 0f855e42

Jakob Stoklund Olesen authored Jun 19, 2012

The PPC::EXTSW instruction preserves the low 32 bits of its input, just
like some of the x86 instructions. Use it to reduce register pressure
when the low 32 bits have multiple uses.

This requires a small change to PeepholeOptimizer since EXTSW takes a
64-bit input register.

This is related to PR5997.

llvm-svn: 158743

0f855e42

Style: Don't reuse variables for multiple purposes. · 8eb9905a
Jakob Stoklund Olesen authored Jun 19, 2012
```
No functional change.

llvm-svn: 158742
```
8eb9905a

Move the support for using .init_array from ARM to the generic · ca3e0ee8

Rafael Espindola authored Jun 19, 2012

TargetLoweringObjectFileELF. Use this to support it on X86. Unlike ARM,
on X86 it is not easy to find out if .init_array should be used or not, so
the decision is made via TargetOptions and defaults to off.

Add a command line option to llc that enables it.

llvm-svn: 158692

ca3e0ee8

Jun 18, 2012

Allow up to 64 functional units per processor itinerary. · 8eac0096

Hal Finkel authored Jun 18, 2012

This patch changes the type used to hold the FU bitset from unsigned to uint64_t.
This will be needed for some upcoming PowerPC itineraries.

llvm-svn: 158679

8eac0096

Jun 16, 2012
- Guard private fields that are unused in Release builds with #ifndef NDEBUG. · b9f84bb0
  Benjamin Kramer authored Jun 16, 2012
```
llvm-svn: 158608
```
  b9f84bb0
- Remove final verification in RABasic. · 38a6fbf9
  Jakob Stoklund Olesen authored Jun 15, 2012
```
We now have a proper machine code verifier pass between register
allocation and rewriting.

llvm-svn: 158577
```
  38a6fbf9
- Print out register number in InlineSpiller. · 45c1f997
  Jakob Stoklund Olesen authored Jun 15, 2012
```
llvm-svn: 158575
```
  45c1f997
- Accept null PhysReg arguments to checkRegMaskInterference. · 13dffcb7
  Jakob Stoklund Olesen authored Jun 15, 2012
```
Calling checkRegMaskInterference(VirtReg) checks if VirtReg crosses any
regmask operands, regardless of the registers they clobber.

llvm-svn: 158563
```
  13dffcb7
Jun 15, 2012
- Remove assignments which aren't used afterwards. · 4fd96634
  Bill Wendling authored Jun 15, 2012
```
llvm-svn: 158535
```
  4fd96634
- Use regunit liveness in RegisterCoalescer when it is available. · 5767ad72
  Jakob Stoklund Olesen authored Jun 15, 2012
```
We only do very limited physreg coalescing now, but we still merge
virtual registers into reserved registers.

llvm-svn: 158526
```
  5767ad72
Jun 14, 2012
- Make machine verifier check the first instruction of the last bundle instead of · 1b420ac4
  Akira Hatanaka authored Jun 14, 2012
```
the last instruction of a basic block.

llvm-svn: 158468
```
  1b420ac4
- Make comment slightly more helpful. · a33db65b
  Lang Hames authored Jun 14, 2012
```
llvm-svn: 158467
```
  a33db65b
- misched: disable SSA check pending PR13112. · 45877fa0
  Andrew Trick authored Jun 14, 2012
```
llvm-svn: 158461
```
  45877fa0
Jun 13, 2012

sched: fix latency of memory dependence chain edges for consistency. · 344fb64f

Andrew Trick authored Jun 13, 2012

For store->load dependencies that may alias, we should always use
TrueMemOrderLatency, which may eventually become a subtarget hook. In
effect, we should guarantee at least TrueMemOrderLatency on at least
one DAG path from a store to a may-alias load.

This should fix the standard mode as well as -enable-aa-sched-mi".

llvm-svn: 158380

344fb64f

sched: Avoid trivially redundant DAG edges. Take the one with higher latency. · 5b90645a
Andrew Trick authored Jun 13, 2012
```
llvm-svn: 158379
```
5b90645a

Jun 12, 2012
- misched: When querying RegisterPressureTracker, always save current and max pressure. · 3e465fb2
  Andrew Trick authored Jun 11, 2012
```
llvm-svn: 158340
```
  3e465fb2
- misched: regpressure getMaxPressureDelta, revert accidental checkin. · d054bd83
  Andrew Trick authored Jun 11, 2012
```
llvm-svn: 158339
```
  d054bd83
Jun 09, 2012

Allocate the contents of DwarfDebug's StringMaps in a single big BumpPtrAllocator. · 0748008d
Benjamin Kramer authored Jun 09, 2012
```
llvm-svn: 158265
```
0748008d
Register pressure: added getPressureAfterInstr. · fc8ce08b
Andrew Trick authored Jun 09, 2012
```
llvm-svn: 158256
```
fc8ce08b

Sketch a LiveRegMatrix analysis pass. · c26fbbfb

Jakob Stoklund Olesen authored Jun 09, 2012

The LiveRegMatrix represents the live range of assigned virtual
registers in a Live interval union per register unit. This is not
fundamentally different from the interference tracking in RegAllocBase
that both RABasic and RAGreedy use.

The important differences are:

- LiveRegMatrix tracks interference per register unit instead of per
  physical register. This makes interference checks cheaper and
  assignments slightly more expensive. For example, the ARM D7 reigster
  has 24 aliases, so we would check 24 physregs before assigning to one.
  With unit-based interference, we check 2 units before assigning to 2
  units.

- LiveRegMatrix caches regmask interference checks. That is currently
  duplicated functionality in RABasic and RAGreedy.

- LiveRegMatrix is a pass which makes it possible to insert
  target-dependent passes between register allocation and rewriting.
  Such passes could tweak the register assignments with interference
  checking support from LiveRegMatrix.

Eventually, RABasic and RAGreedy will be switched to LiveRegMatrix.

llvm-svn: 158255

c26fbbfb

Also compute MBB live-in lists in the new rewriter pass. · be336295

Jakob Stoklund Olesen authored Jun 09, 2012

This deduplicates some code from the optimizing register allocators, and
it means that it is now possible to change the register allocators'
solutions simply by editing the VirtRegMap between the register
allocator pass and the rewriter.

llvm-svn: 158249

be336295

Reintroduce VirtRegRewriter. · 1224312f

Jakob Stoklund Olesen authored Jun 08, 2012

OK, not really. We don't want to reintroduce the old rewriter hacks.

This patch extracts virtual register rewriting as a separate pass that
runs after the register allocator. This is possible now that
CodeGen/Passes.cpp can configure the full optimizing register allocator
pipeline.

The rewriter pass uses register assignments in VirtRegMap to rewrite
virtual registers to physical registers, and it inserts kill flags based
on live intervals.

These finalization steps are the same for the optimizing register
allocators: RABasic, RAGreedy, and PBQP.

llvm-svn: 158244

1224312f

Jun 08, 2012

Start implementing pre-ra if-converter: using speculation and selects to eliminate branches. · c5adccab
Evan Cheng authored Jun 08, 2012
```
llvm-svn: 158234
```
c5adccab
TargetInstrInfo hooks implemented in codegen should be declared pure virtual. · 423fa6fa
Andrew Trick authored Jun 08, 2012
```
llvm-svn: 158233
```
423fa6fa

Fix Target->Codegen dependence. · 596af1b0

Andrew Trick authored Jun 08, 2012

Bulk move of TargetInstrInfo implementation into
TargetInstrInfoImpl. This is dirty because the code isn't part of
TargetInstrInfoImpl class, nor should it be, because the methods are
not target hooks. However, it's the current mechanism for keeping
libTarget useful outside the backend. You'll get a not-so-nice link
error if you invoke a TargetInstrInfo method that depends on CodeGen.

The TargetInstrInfoImpl class should probably be removed since it
doesn't really solve this problem.

To really fix this, we probably need separate interfaces for the
CodeGen/nonCodeGen sides of TargetInstrInfo.

llvm-svn: 158212

596af1b0

Jun 07, 2012

Move terminator machine verification to check... · cd72016c

Pete Cooper authored Jun 07, 2012

Move terminator machine verification to check MachineBasicBlock::instr_iterator instead of MBB::iterator

llvm-svn: 158154

cd72016c

Revert r157755. · 9c964181

Manman Ren authored Jun 06, 2012

The commit is intended to fix rdar://11540023.
It is implemented as part of peephole optimization. We can actually implement
this in the SelectionDAG lowering phase.

llvm-svn: 158122

9c964181

Properly verify liveness with bundled machine instructions. · 00e7dffe

Jakob Stoklund Olesen authored Jun 06, 2012

Bundles should be treated as one atomic transaction when checking
liveness. That is how the register allocator (and VLIW targets) treats
bundles.

llvm-svn: 158116

00e7dffe

Jun 06, 2012

Move RegisterClassInfo.h. · 05ff4667

Andrew Trick authored Jun 06, 2012

Allow targets to access this API. It's required for RegisterPressure.

llvm-svn: 158102

05ff4667

Move RegisterPressure.h. · 88517f60
Andrew Trick authored Jun 06, 2012
```
Make it a general utility for use by Targets.

llvm-svn: 158097
```
88517f60

Round 2 of dead private variable removal. · 009b1c1c

Benjamin Kramer authored Jun 06, 2012

LLVM is now -Wunused-private-field clean except for
- lib/MC/MCDisassembler/Disassembler.h. Not sure why it keeps all those unaccessible fields.
- gtest.

llvm-svn: 158096

009b1c1c

Remove unused private fields found by clang's new -Wunused-private-field. · 628a39fa

Benjamin Kramer authored Jun 06, 2012

There are some that I didn't remove this round because they looked like
obvious stubs. There are dead variables in gtest too, they should be
fixed upstream.

llvm-svn: 158090

628a39fa