Commits · 7f8e563a691bf2dbe11132ad35b5e94a5b1b5325 · Roger Ferrer / llvm-epi-0.8

Dec 07, 2011

Add bundle aware API for querying instruction properties and switch the code · 7f8e563a

Evan Cheng authored Dec 07, 2011

generator to it. For non-bundle instructions, these behave exactly the same
as the MC layer API.

For properties like mayLoad / mayStore, look into the bundle and if any of the
bundled instructions has the property it would return true.
For properties like isPredicable, only return true if *all* of the bundled
instructions have the property.
For properties like canFoldAsLoad, isCompare, conservatively return false for
bundles.

llvm-svn: 146026

7f8e563a

Zap unnecessary isIntDivCheap() check. PR11485. No testcase because this... · f9081a8a
Eli Friedman authored Dec 07, 2011
```
Zap unnecessary isIntDivCheap() check.  PR11485.  No testcase because this doesn't affect any in-tree target.

llvm-svn: 146015
```
f9081a8a
Add missing check. · 6ad68485
Jakob Stoklund Olesen authored Dec 07, 2011
```
llvm-svn: 146004
```
6ad68485
Support vector bitcasts in the AsmPrinter. PR11495. · ed8b3e38
Eli Friedman authored Dec 07, 2011
```
llvm-svn: 146001
```
ed8b3e38

Add MachineOperand IsInternalRead flag. · b0d91abe

Jakob Stoklund Olesen authored Dec 07, 2011

This flag is used when bundling machine instructions.  It indicates
whether the operand reads a value defined inside or outside its bundle.

llvm-svn: 145997

b0d91abe

Fix an optimization involving EXTRACT_SUBVECTOR in DAGCombine so it behaves correctly. PR11494. · 0e58cba2
Eli Friedman authored Dec 07, 2011
```
llvm-svn: 145996
```
0e58cba2
Remove unneeded type. · c007ab85
Jakub Staszak authored Dec 07, 2011
```
llvm-svn: 145995
```
c007ab85
- Remove unneeded #includes. · d4d2b05e
Jakub Staszak authored Dec 06, 2011
```
- Remove unused types/fields.
- Add some constantness.

llvm-svn: 145993
```
d4d2b05e

Dec 06, 2011

First chunk of MachineInstr bundle support. · 2a81dd4a

Evan Cheng authored Dec 06, 2011

1. Added opcode BUNDLE
2. Taught MachineInstr class to deal with bundled MIs
3. Changed MachineBasicBlock iterator to skip over bundled MIs; added an iterator to walk all the MIs
4. Taught MachineBasicBlock methods about bundled MIs

llvm-svn: 145975

2a81dd4a

Pretty-print basic block alignment. · 2a2b37ea
Jakob Stoklund Olesen authored Dec 06, 2011
```
llvm-svn: 145965
```
2a2b37ea
use space star instead of star space · ac35a4d0
Sebastian Pop authored Dec 06, 2011
```
llvm-svn: 145944
```
ac35a4d0
add missing point at the end of sentences · 9aa6137d
Sebastian Pop authored Dec 06, 2011
```
llvm-svn: 145943
```
9aa6137d
Mix some minor misuse of MachineBasicBlock iterator. · c1610bed
Evan Cheng authored Dec 06, 2011
```
llvm-svn: 145903
```
c1610bed

Removed isWinToJoinCrossClass from the register coalescer. · d2971264

Pete Cooper authored Dec 06, 2011

The new register allocator is much more able to split back up ranges too constrained by register classes.

Fixes <rdar://problem/10466609>

llvm-svn: 145899

d2971264

Kill off the LoopSplitter. It's not being used or maintained. · 52f24d7a
Lang Hames authored Dec 06, 2011
```
llvm-svn: 145897
```
52f24d7a
Update PBQP's analysis usage to reflect the requirements of the inline spiller. · b13b6a04
Lang Hames authored Dec 06, 2011
```
llvm-svn: 145893
```
b13b6a04

Use logarithmic units for basic block alignment. · 10e12522

Jakob Stoklund Olesen authored Dec 06, 2011

This was actually a bit of a mess. TLI.setPrefLoopAlignment was clearly
documented as taking log2(bytes) units, but the x86 target would still
set a preferred loop alignment of '16'.

CodePlacementOpt passed this number on to the basic block, and
AsmPrinter interpreted it as bytes.

Now both MachineFunction and MachineBasicBlock use logarithmic
alignments.

Obviously, MachineConstantPool still measures alignments in bytes, so we
can emulate the thrill of using as.

llvm-svn: 145889

10e12522

Dec 05, 2011

· 3924cb02

Nadav Rotem authored Dec 05, 2011

Add support for vectors of pointers.

llvm-svn: 145801

3924cb02

Dec 04, 2011
- Add inline subprogram names to the name lookup table since they may · 8dda5d0f
  Eric Christopher authored Dec 04, 2011
```
not get there any other way.

llvm-svn: 145789
```
  8dda5d0f
- Emit the ctors in the proper order on ARM/EABI. · 965e0c6d
  Anton Korobeynikov authored Dec 03, 2011
```
Maybe some targets should use this as well.

Patch by Evgeniy Stepanov!

llvm-svn: 145781
```
  965e0c6d
Dec 03, 2011
- Simplify code. No functionality change. · 71ba18c1
  Benjamin Kramer authored Dec 03, 2011
```
-3% on ARMDissasembler.cpp.

llvm-svn: 145773
```
  71ba18c1
Dec 02, 2011

Move global variables in TargetMachine into new TargetOptions class. As an API · 50f02cb2

Nick Lewycky authored Dec 02, 2011

change, now you need a TargetOptions object to create a TargetMachine. Clang
patch to follow.

One small functionality change in PTX. PTX had commented out the machine
verifier parts in their copy of printAndVerify. That now calls the version in
LLVMTargetMachine. Users of PTX who need verification disabled should rely on
not passing the command-line flag to enable it.

llvm-svn: 145714

50f02cb2

make sure ScheduleDAGInstrs::EmitSchedule does not crash when the first... · 42018202

Hal Finkel authored Dec 02, 2011

make sure ScheduleDAGInstrs::EmitSchedule does not crash when the first instruction in Sequence is a Noop

llvm-svn: 145677

42018202

Dec 01, 2011
- CodeGen: fix CMake build · c19f0b73
  Dylan Noblesmith authored Dec 01, 2011
```
Missing file from r145629.

llvm-svn: 145634
```
  c19f0b73
- Add a deterministic finite automaton based packetizer for VLIW architectures · 08ebdc1e
  Anshuman Dasgupta authored Dec 01, 2011
```
llvm-svn: 145629
```
  08ebdc1e
Nov 29, 2011
- If fast-isel fails, remove dead instructions generated during the failed · 46addb9e
  Chad Rosier authored Nov 29, 2011
```
attempt.  

llvm-svn: 145425
```
  46addb9e
- build/CMake: Finish removal of add_llvm_library_dependencies. · 539d0a8a
  Daniel Dunbar authored Nov 29, 2011
```
llvm-svn: 145420
```
  539d0a8a
- On MachO, the pointer to the personality function should always be in the · e4cc3327
  Bill Wendling authored Nov 29, 2011
```
non_lazy_symbol_pointers section (__IMPORT,__pointers). Ignore the 'hidden' part
since that will place it in the wrong section.
<rdar://problem/10443720>

llvm-svn: 145356
```
  e4cc3327
Nov 28, 2011

Make SelectionDAG::InferPtrAlignment use llvm::ComputeMaskedBits instead of... · e7ab1a2f

Eli Friedman authored Nov 28, 2011

Make SelectionDAG::InferPtrAlignment use llvm::ComputeMaskedBits instead of duplicating the logic for globals.  Make llvm::ComputeMaskedBits handle GlobalVariables slightly more aggressively, to match what InferPtrAlignment knew how to do.

llvm-svn: 145304

e7ab1a2f

Revert r145273 and fix in SelectionDAG::InferPtrAlignment() instead. · 4a5b2040

Evan Cheng authored Nov 28, 2011

Conservatively returns zero when the GV does not specify an alignment nor is it
initialized. Previously it returns ABI alignment for type of the GV. However, if
the type is a "packed" type, then the under-specified alignments is attached to
the load / store instructions. In that case, the alignment of the type cannot be
trusted.
rdar://10464621

llvm-svn: 145300

4a5b2040

DAG combine should not increase alignment of loads / stores with alignment less · a4b6404c

Evan Cheng authored Nov 28, 2011

than ABI alignment. These are loads / stores from / to "packed" data structures.
Their alignments are intentionally under-specified.

rdar://10301431

llvm-svn: 145273

a4b6404c

80-column. · 61e8d102
Chad Rosier authored Nov 28, 2011
```
llvm-svn: 145267
```
61e8d102
Remove dead llvm.eh.sjlj.dispatchsetup intrinsic. · 5ebc95ff
Bill Wendling authored Nov 28, 2011
```
llvm-svn: 145263
```
5ebc95ff

Nov 27, 2011

Prevent rotating the blocks of a loop (and thus getting a backedge to be · 4f567207

Chandler Carruth authored Nov 27, 2011

fallthrough) in cases where we might fail to rotate an exit to an outer
loop onto the end of the loop chain.

Having *some* rotation, but not performing this rotation, is the primary
fix of thep performance regression with -enable-block-placement for
Olden/em3d (a whopping 30% regression). Still working on reducing the
test case that actually exercises this and the new rotation strategy out
of this code, but I want to check if this regresses other test cases
first as that may indicate it isn't the correct fix.

llvm-svn: 145195

4f567207

Take two on rotating the block ordering of loops. My previous attempt · 03adbd46

Chandler Carruth authored Nov 27, 2011

was centered around the premise of laying out a loop in a chain, and
then rotating that chain. This is good for preserving contiguous layout,
but bad for actually making sane rotations. In order to keep it safe,
I had to essentially make it impossible to rotate deeply nested loops.
The information needed to correctly reason about a deeply nested loop is
actually available -- *before* we layout the loop. We know the inner
loops are already fused into chains, etc. We lose information the moment
we actually lay out the loop.

The solution was the other alternative for this algorithm I discussed
with Benjamin and some others: rather than rotating the loop
after-the-fact, try to pick a profitable starting block for the loop's
layout, and then use our existing layout logic. I was worried about the
complexity of this "pick" step, but it turns out such complexity is
needed to handle all the important cases I keep teasing out of benchmarks.

This is, I'm afraid, a bit of a work-in-progress. It is still
misbehaving on some likely important cases I'm investigating in Olden.
It also isn't really tested. I'm going to try to craft some interesting
nested-loop test cases, but it's likely to be extremely time consuming
and I don't want to go there until I'm sure I'm testing the correct
behavior. Sadly I can't come up with a way of getting simple, fine
grained test cases for this logic. We need complex loop structures to
even trigger much of it.

llvm-svn: 145183

03adbd46

Fix an impressive type-o / spell-o Duncan noticed. · 9e466841
Chandler Carruth authored Nov 27, 2011
```
llvm-svn: 145181
```
9e466841

Rework a bit of the implementation of loop block rotation to not rely so · a0545809

Chandler Carruth authored Nov 27, 2011

heavily on AnalyzeBranch. That routine doesn't behave as we want given
that rotation occurs mid-way through re-ordering the function. Instead
merely check that there are not unanalyzable branching constructs
present, and then reason about the CFG via successor lists. This
actually simplifies my mental model for all of this as well.

The concrete result is that we now will rotate more loop chains. I've
added a test case from Olden highlighting the effect. There is still
a bit more to do here though in order to regain all of the performance
in Olden.

llvm-svn: 145179

a0545809

Introduce a loop block rotation optimization to the new block placement · 9ffb97e6

Chandler Carruth authored Nov 27, 2011

pass. This is designed to achieve one of the important optimizations
that the old code placement pass did, but more simply.

This is a somewhat rough and *very* conservative version of the
transform. We could get a lot fancier here if there are profitable cases
to do so. In particular, this only looks for a single pattern, it
insists that the loop backedge being rotated away is the last backedge
in the chain, and it doesn't provide any means of doing better in-loop
placement due to the rotation. However, it appears that it will handle
the important loops I am finding in the LLVM test suite.

llvm-svn: 145158

9ffb97e6

Move code into anonymous namespaces. · 7ba71be3
Benjamin Kramer authored Nov 26, 2011
```
llvm-svn: 145154
```
7ba71be3

Nov 24, 2011

Fix a silly use-after-free issue. A much earlier version of this code · 7adee1a0

Chandler Carruth authored Nov 24, 2011

need lots of fanciness around retaining a reference to a Chain's slot in
the BlockToChain map, but that's all gone now. We can just go directly
to allocating the new chain (which will update the mapping for us) and
using it.

Somewhat gross mechanically generated test case replicates the issue
Duncan spotted when actually testing this out.

llvm-svn: 145120

7adee1a0