Commits · fd3a5e33d50dd4ef0b6a2f8ab8cd03554747cc27 · Roger Ferrer / llvm-epi-0.8

Jun 26, 2012

Allow targets to inject passes before the virtual register rewriter. · 59a0d324

Jakob Stoklund Olesen authored Jun 26, 2012

Such passes can be used to tweak the register assignments in a
target-dependent way, for example to avoid write-after-write
dependencies.

llvm-svn: 159209

59a0d324

Update a bunch of stale comments that dated from when this folled the · 9139f44d

Chandler Carruth authored Jun 26, 2012

very first (and worst) placement algorithm. These should now more
accurately reflect the reality of the pass.

llvm-svn: 159185

9139f44d

Enable the new LoopInfo algorithm by default. · fb2ba3e1

Andrew Trick authored Jun 26, 2012

The primary advantage is that loop optimizations will be applied in a
stable order. This helps debugging and unit test creation. It is also
a better overall implementation without pathologically bad performance
on deep functions.

On large functions (llvm-stress --size=200000 | opt -loops)
Before: 0.1263s
After:  0.0225s

On deep functions (after tweaking llvm-stress, thanks Nadav):
Before: 0.2281s
After:  0.0227s

See r158790 for more comments.

The loop tree is now consistently generated in forward order, but loop
passes are applied in reverse order over the program. If we have a
loop optimization that prefers forward order, that can easily be
achieved by adding a different type of LoopPassManager.

llvm-svn: 159183

fb2ba3e1

Make sure type is not extended or untyped before create a constant of the... · 4c6f917d

Evan Cheng authored Jun 26, 2012

Make sure type is not extended or untyped before create a constant of the type. No test case. Found by inspection.

llvm-svn: 159179

4c6f917d

Jun 25, 2012

Enforce stricter liveness rules for PHIs. · a57fc12e

Jakob Stoklund Olesen authored Jun 25, 2012

Verify that all paths from the entry block to a virtual register read
pass through a def. Enable this check even when MRI->isSSA() is false.

Verify that the live range of a virtual register is live out of all
predecessor blocks, even for PHI-values.

This requires that PHIElimination sometimes inserts IMPLICIT_DEF
instruction in predecessor blocks.

llvm-svn: 159150

a57fc12e

Run ProcessImplicitDefs on SSA form where it can be much simpler. · eb495664

Jakob Stoklund Olesen authored Jun 25, 2012

Implicitly defined virtual registers can simply have the <undef> bit set
on all uses, and copies can be turned into implicit defs recursively.

Physical registers are a bit trickier. We handle the common case where a
physreg def is used by a nearby instruction in the same basic block. For
more complicated cases, just leave the IMPLICIT_DEF instruction in.

llvm-svn: 159149

eb495664

Teach PHIElimination to handle <undef> operands. · 70ed924e

Jakob Stoklund Olesen authored Jun 25, 2012

When a PHI use is <undef>, don't emit a copy in the predecessor block,
but insert an IMPLICIT_DEF instruction instead. This ensures that
virtual register uses are always jointly dominated by defs, even if some
of them are IMPLICIT_DEF.

llvm-svn: 159121

70ed924e

Handle <undef> operands in TwoAddressInstructionPass. · 6b556f82

Jakob Stoklund Olesen authored Jun 25, 2012

When the source register to a 2-addr instruction is undefined, there is
no need to attempt any transformations - simply replace the source
register with the destination register.

This also comes up when lowering IMPLICIT_DEF instructions - make sure
the <undef> flag is moved to the new partial register def operand:

  %vreg8<def> = INSERT_SUBREG %vreg9<undef>, %vreg0<kill>, sub_16bit
rewrite undef:
  %vreg8<def> = INSERT_SUBREG %vreg8<undef>, %vreg0<kill>, sub_16bit
convert to:
  %vreg8:sub_16bit<def,read-undef> = COPY %vreg0<kill>

llvm-svn: 159120

6b556f82

Jun 24, 2012
- llvm/lib: [CMake] Add explicit dependency to intrinsics_gen. · 704de074
  NAKAMURA Takumi authored Jun 24, 2012
```
llvm-svn: 159112
```
  704de074
- DAG legalisation can now handle illegal fma vector types by scalarisation · fe212e76
  Pete Cooper authored Jun 24, 2012
```
llvm-svn: 159092
```
  fe212e76
Jun 23, 2012
- Teach LiveVariables to handle <undef> operands. · 502e4c6a
  Jakob Stoklund Olesen authored Jun 23, 2012
```
It's simple: Don't treat <undef> operands as uses, and don't assume a
virtual register has a defining instruction unless a real use has been
seen.

llvm-svn: 159061
```
  502e4c6a
- Remove ProcessImplicitDefs.h which was unused. · a127fc78
  Jakob Stoklund Olesen authored Jun 22, 2012
```
The ProcessImplicitDefs class can be local to its implementation file.

llvm-svn: 159041
```
  a127fc78
- Also verify the def index for early clobbers. · b033dede
  Jakob Stoklund Olesen authored Jun 22, 2012
```
llvm-svn: 159039
```
  b033dede
Jun 22, 2012

Delete a boring statistic. · 4fa84ba8
Jakob Stoklund Olesen authored Jun 22, 2012
```
llvm-svn: 159030
```
4fa84ba8
Store live intervals in an IndexedMap. · c61edda0
Jakob Stoklund Olesen authored Jun 22, 2012
```
It is both smaller and faster than DenseMap.

llvm-svn: 159029
```
c61edda0

Revert r158679 - use case is unclear (and it increases the memory footprint). · 8db55472

Hal Finkel authored Jun 22, 2012

Original commit message:
    Allow up to 64 functional units per processor itinerary.

    This patch changes the type used to hold the FU bitset from unsigned to uint64_t.
    This will be needed for some upcoming PowerPC itineraries.

llvm-svn: 159027

8db55472

Fix a crash in --debug code. · 48828bb4
Jakob Stoklund Olesen authored Jun 22, 2012
```
Don't try to print out the live range of a physreg.

llvm-svn: 159021
```
48828bb4
Don't depend on live ranges being present. · 48a1647c
Jakob Stoklund Olesen authored Jun 22, 2012
```
DBG_VALUE instructions could be referring to non-existing virtual
registers.

llvm-svn: 159020
```
48a1647c

Simplify handleMove() a bit. · 8a833649

Jakob Stoklund Olesen authored Jun 22, 2012

There is no need to check for physreg live ranges. They don't exist any
more.

llvm-svn: 159019

8a833649

Stop computing physreg live ranges. · 37e797fe
Jakob Stoklund Olesen authored Jun 22, 2012
```
Everyone is using on-demand regunit ranges now.

llvm-svn: 159018
```
37e797fe
Remove some redundant LIS->hasInterval() checks. · bbad269a
Jakob Stoklund Olesen authored Jun 22, 2012
```
These functions only operate on virtual registers now, and they all have
live ranges.

llvm-svn: 159015
```
bbad269a
Use MRI::isConstantPhysReg() to check remat feasibility. · 7809578c
Jakob Stoklund Olesen authored Jun 22, 2012
```
Don't depend on LiveIntervals::hasInterval() to determine if a physreg
is reserved and constant.

llvm-svn: 159013
```
7809578c
Use regunit liveness to guide LiveDebugVariables. · 3244963e
Jakob Stoklund Olesen authored Jun 22, 2012
```
This should produce the same results as using physreg liveness directly.

llvm-svn: 159009
```
3244963e

Remove LiveIntervals::trackingRegUnits(). · b1b3e4aa

Jakob Stoklund Olesen authored Jun 22, 2012

With regunit liveness permanently enabled, this function would always
return true.

Also remove now obsolete code for checking physreg interference.

llvm-svn: 159006

b1b3e4aa

Remove another duplicated variable. We only need one to tell us if the linker · ea591661
Rafael Espindola authored Jun 22, 2012
```
knows dwarf or not.

llvm-svn: 158993
```
ea591661
Fix a FIXME: DwarfRequiresRelocationForSectionOffset is the same as · d7bdaf57
Rafael Espindola authored Jun 22, 2012
```
DwarfUsesRelocationsAcrossSections.

llvm-svn: 158992
```
d7bdaf57
Emit relocations for DW_AT_location entries on systems which need it. This is · 33da3367
Nick Lewycky authored Jun 22, 2012
```
a recommit of r127757. Fixes PR9493. Patch by Paul Robinson!

llvm-svn: 158957
```
33da3367

Rename -allow-excess-fp-precision flag to -fuse-fp-ops, and switch from a · b8650f10

Lang Hames authored Jun 22, 2012

boolean flag to an enum: { Fast, Standard, Strict } (default = Standard).

This option controls the creation by optimizations of fused FP ops that store
intermediate results in higher precision than IEEE allows (E.g. FMAs). The
behavior of this option is intended to match the behaviour specified by a
soon-to-be-introduced frontend flag: '-ffuse-fp-ops'.

Fast mode - allows formation of fused FP ops whenever they're profitable.

Standard mode - allow fusion only for 'blessed' FP ops. At present the only
blessed op is the fmuladd intrinsic. In the future more blessed ops may be
added.

Strict mode - allow fusion only if/when it can be proven that the excess
precision won't effect the result.

Note: This option only controls formation of fused ops by the optimizers.  Fused
operations that are explicitly requested (e.g. FMA via the llvm.fma.* intrinsic)
will always be honored, regardless of the value of this option.

Internally TargetOptions::AllowExcessFPPrecision has been replaced by
TargetOptions::AllowFPOpFusion.

llvm-svn: 158956

b8650f10

Jun 21, 2012

The inline asm operand modifier 'n' is suppose · c457f620

Jack Carter authored Jun 21, 2012

to be generic across architectures. It has the
following description in the gnu sources:

    Negate the immediate constant

Several Architectures such as x86 have local implementations
of operand modifier 'n' which go beyond the above description
slightly. This won't affect them.

Affected files:

    lib/CodeGen/AsmPrinter/AsmPrinterInlineAsm.cpp
        Added 'n' to the switch cases.

    test/CodeGen/Generic/asm-large-immediate.ll
        Generic compiled test (x86 for me)

    test/CodeGen/Mips/asm-large-immediate.ll
        Mips compiled version of the generic one

Contributer: Jack Carter
llvm-svn: 158939

c457f620

Fix potential crash if DAGCombine on stores sees a half type · 5b61422d
Pete Cooper authored Jun 21, 2012
```
llvm-svn: 158927
```
5b61422d

The inline asm operand modifier 'c' is suppose · b2fd5f66

Jack Carter authored Jun 21, 2012

to be generic across architectures. It has the
following description in the gnu sources:

    Substitute immediate value without immediate syntax

Several Architectures such as x86 have local implementations
of operand modifier 'c' which go beyond the above description
slightly. To make use of the generic modifiers without overriding
local implementation one can make a call to the base class method
for AsmPrinter::PrintAsmOperand() in the locally derived method's 
"default" case in the switch statement. That way if it is already
defined locally the generic version will never get called.

This change is needed when test/CodeGen/generic/asm-large-immediate.ll
failed on a native Mips board. The test was assuming a generic
implementation was in place.

Affected files:

    lib/Target/Mips/MipsAsmPrinter.cpp:
        Changed the default case to call the base method.
    lib/CodeGen/AsmPrinter/AsmPrinterInlineAsm.cpp
        Added 'c' to the switch cases.
    test/CodeGen/Mips/asm-large-immediate.ll
        Mips compiled version of the generic one

Contributer: Jack Carter
llvm-svn: 158925

b2fd5f66

Emit a single _udivmodsi4 libcall instead of two separate _udivsi3 and · 8c2ad812

Evan Cheng authored Jun 21, 2012

_umodsi3 libcalls if they have the same arguments. This optimization
was apparently broken if one of the node was replaced in place.
rdar://11714607

llvm-svn: 158900

8c2ad812

Update regunits in RegisterCoalescer::reMaterializeTrivialDef. · 58713de5
Jakob Stoklund Olesen authored Jun 21, 2012
```
Old code would only update physreg live intervals.

llvm-svn: 158881
```
58713de5
Remove spurious typedefs. · 37a1338a
Jakob Stoklund Olesen authored Jun 20, 2012
```
llvm-svn: 158878
```
37a1338a

Remove the RenderMachineFunction HTML output pass. · 1911a020

Jakob Stoklund Olesen authored Jun 20, 2012

I don't think anyone has been using this functionality for a while, and
it is getting in the way of refactoring now.

llvm-svn: 158876

1911a020

Remove the -live-regunits command line option. · 51c63e64
Jakob Stoklund Olesen authored Jun 20, 2012
```
Register allocators depend on it being permanently enabled now.

llvm-svn: 158873
```
51c63e64
Fix some more LiveInterval enumerations. · 781e0b9f
Jakob Stoklund Olesen authored Jun 20, 2012
```
Deterministically enumerate the virtual registers instead.

llvm-svn: 158872
```
781e0b9f
Remove LiveIntervalUnions from RegAllocBase. · 2d2dec96
Jakob Stoklund Olesen authored Jun 20, 2012
```
They are living in LiveRegMatrix now.

llvm-svn: 158868
```
2d2dec96

Convert RAGreedy to LiveRegMatrix interference checking. · 96eebf0b

Jakob Stoklund Olesen authored Jun 20, 2012

Stop depending on the LiveIntervalUnions in RegAllocBase, they are about
to be removed.

The changes are mostly replacing register alias iterators with regunit
iterators, and querying LiveRegMatrix instrad of RegAllocBase.

InterferenceCache is converted to work with per-regunit
LiveIntervalUnions, and it checks fixed regunit interference separately,
using the fixed live intervals provided by LiveIntervalAnalysis.

The local splitting helper calcGapWeights() is also considering fixed
regunit interference which is kept on the side now.

llvm-svn: 158867

96eebf0b

Convert RABasic to using LiveRegMatrix interference checking. · 03b87d5a
Jakob Stoklund Olesen authored Jun 20, 2012
```
Stop using the LiveIntervalUnions provided by RegAllocBase, they will be
removed soon.

llvm-svn: 158866
```
03b87d5a