Commits · b5e3e9dd27dce1b3bb10c4f453cea84a0b35bbca · Roger Ferrer / llvm-epi-0.8

Apr 15, 2011
- Revert r129518, "Change ELF systems to use CFI for producing the EH tables. This reduces the" · b5e3e9dd
  NAKAMURA Takumi authored Apr 15, 2011
```
It broke several builds.

llvm-svn: 129557
```
  b5e3e9dd
Apr 14, 2011

Fix another instance of the DAG combiner not using the correct type for the RHS of a shift. · a519284f
Owen Anderson authored Apr 14, 2011
```
llvm-svn: 129522
```
a519284f
Change ELF systems to use CFI for producing the EH tables. This reduces the · aa2a7cd8
Rafael Espindola authored Apr 14, 2011
```
size of the clang binary in Debug builds from 690MB to 679MB.

llvm-svn: 129518
```
aa2a7cd8

In the pre-RA scheduler, maintain cmp+br proximity. · bfbd972b

Andrew Trick authored Apr 14, 2011

This is done by pushing physical register definitions close to their
use, which happens to handle flag definitions if they're not glued to
the branch. This seems to be generally a good thing though, so I
didn't need to add a target hook yet.

The primary motivation is to generate code closer to what people
expect and rule out missed opportunity from enabling macro-op
fusion. As a side benefit, we get several 2-5% gains on x86
benchmarks. There is one regression:
SingleSource/Benchmarks/Shootout/lists slows down be -10%. But this is
an independent scheduler bug that will be tracked separately.
See rdar://problem/9283108.

Incidentally, pre-RA scheduling is only half the solution. Fixing the
later passes is tracked by:
<rdar://problem/8932804> [pre-RA-sched] on x86, attempt to schedule CMP/TEST adjacent with condition jump

Fixes:
<rdar://problem/9262453> Scheduler unnecessary break of cmp/jump fusion

llvm-svn: 129508

bfbd972b

sink a call into its only use. · 493b3e72
Chris Lattner authored Apr 14, 2011
```
llvm-svn: 129503
```
493b3e72

During post-legalization DAG combining, be careful to only create shifts where... · 9c12834e

Owen Anderson authored Apr 13, 2011

During post-legalization DAG combining, be careful to only create shifts where the RHS is of the legal type for the new operation.

llvm-svn: 129484

9c12834e

Apr 13, 2011

Remove extra bytes that were added for gdb. We do not have good poiner to... · e1412349

Devang Patel authored Apr 13, 2011

Remove extra bytes that were added for gdb.  We do not have good poiner to understand actual reason behind this fixme. Spot checking suggest that newer gdb does not need this.

llvm-svn: 129461

e1412349

Stop using dead function. · cda53feb
Jakob Stoklund Olesen authored Apr 13, 2011
```
llvm-svn: 129442
```
cda53feb

Recommit r129383. PreRA scheduler heuristic fixes: VRegCycle, TokenFactor latency. · b53a00d2

Andrew Trick authored Apr 13, 2011

Additional fixes:
Do something reasonable for subtargets with generic
itineraries by handle node latency the same as for an empty
itinerary. Now nodes default to unit latency unless an itinerary
explicitly specifies a zero cycle stage or it is a TokenFactor chain.

Original fixes:
UnitsSharePred was a source of randomness in the scheduler: node
priority depended on the queue data structure. I rewrote the recent
VRegCycle heuristics to completely replace the old heuristic without
any randomness. To make the ndoe latency adjustments work, I also
needed to do something a little more reasonable with TokenFactor. I
gave it zero latency to its consumers and always schedule it as low as
possible.

llvm-svn: 129421

b53a00d2

Temporarily revert r129408 to see if it brings the bots back. · 28f4c729
Eric Christopher authored Apr 13, 2011
```
llvm-svn: 129417
```
28f4c729
Fix a bug where we were counting the alias sets as completely used · d829f43c
Eric Christopher authored Apr 12, 2011
```
registers for fast allocation.

Fixes rdar://9207598

llvm-svn: 129408
```
d829f43c
I missed this new file in previous commit. · 0e821f46
Devang Patel authored Apr 12, 2011
```
llvm-svn: 129407
```
0e821f46
Simplify. There is no need to use static variable. · 28dce703
Devang Patel authored Apr 12, 2011
```
llvm-svn: 129406
```
28dce703
Do not reuse parameter name. · 13d47f0d
Devang Patel authored Apr 12, 2011
```
llvm-svn: 129405
```
13d47f0d

This mechanical patch moves type handling into CompileUnit from DwarfDebug. In... · f20c4f71

Devang Patel authored Apr 12, 2011

This mechanical patch moves type handling into CompileUnit from DwarfDebug. In case of multiple compile unit in one object file, each compile unit is responsible for its own set of type entries anyway. This refactoring makes this obvious.

llvm-svn: 129402

f20c4f71

Add more comments... err debug statements to the fast allocator. · de9d5856
Eric Christopher authored Apr 12, 2011
```
llvm-svn: 129400
```
de9d5856

Apr 12, 2011

SparseBitVector is SLOW. · c49df2c0

Jakob Stoklund Olesen authored Apr 12, 2011

Use a Bitvector instead, we didn't need the smaller memory footprint anyway.
This makes the greedy register allocator 10% faster.

llvm-svn: 129390

c49df2c0

Revert 129383. It causes some targets to hit a scheduler assert. · 1b60ad66
Andrew Trick authored Apr 12, 2011
```
llvm-svn: 129385
```
1b60ad66

PreRA scheduler heuristic fixes: VRegCycle, TokenFactor latency. · c5dd24a5

Andrew Trick authored Apr 12, 2011

UnitsSharePred was a source of randomness in the scheduler: node
priority depended on the queue data structure. I rewrote the recent
VRegCycle heuristics to completely replace the old heuristic without
any randomness. To make these heuristic adjustments to node latency work,
I also needed to do something a little more reasonable with TokenFactor. I
gave it zero latency to its consumers and always schedule it as low as
possible.

llvm-svn: 129383

c5dd24a5

Create new intervals for isolated blocks during region splitting. · c70b697a

Jakob Stoklund Olesen authored Apr 12, 2011

This merges the behavior of splitSingleBlocks into splitAroundRegion, so the
RS_Region and RS_Block register stages can be coalesced. That means the leftover
intervals after region splitting go directly to spilling instead of a second
pass of per-block splitting.

llvm-svn: 129379

c70b697a

Add SplitKit API to query and select the current interval being worked on. · 0840f50b
Jakob Stoklund Olesen authored Apr 12, 2011
```
This makes it possible to target multiple registers in one pass.

llvm-svn: 129374
```
0840f50b
Fix a bug in RegAllocBase::addMBBLiveIns() where a basic block could accidentally be skipped. · 68e84581
Jakob Stoklund Olesen authored Apr 12, 2011
```
llvm-svn: 129373
```
68e84581
Remove dead typedef. · 4547a9e6
Devang Patel authored Apr 12, 2011
```
llvm-svn: 129368
```
4547a9e6
Refactor CompileUnit into a separate header. · 5eb4319d
Devang Patel authored Apr 12, 2011
```
llvm-svn: 129367
```
5eb4319d
Fix typo. · c3783362
Eric Christopher authored Apr 12, 2011
```
llvm-svn: 129334
```
c3783362
Reuse live interval union between functions. This saves a bit of compile time · 507992e9
Jakob Stoklund Olesen authored Apr 11, 2011
```
when compiling many small functions.

llvm-svn: 129321
```
507992e9

Just because a GlobalVariable's initializer is [N x { i32, void ()* }] doesn't · 0f857898

Nick Lewycky authored Apr 11, 2011

mean that it has to be ConstantArray of ConstantStruct. We might have
ConstantAggregateZero, at either level, so don't crash on that.

Also, semi-deprecate the sentinal value. The linker isn't aware of sentinals so
we end up with the two lists appended, each with their "sentinals" on them.
Different parts of LLVM treated sentinals differently, so make them all just
ignore the single entry and continue on with the rest of the list.

llvm-svn: 129307

0f857898

Apr 11, 2011

Speed up eviction by stopping collectInterferingVRegs as soon as the spill · 0f175ebc
Jakob Stoklund Olesen authored Apr 11, 2011
```
weight limit has been exceeded.

llvm-svn: 129305
```
0f175ebc

The default of the dispatch switch statement was to branch to a BB that executed · 1e1f1c9c

Bill Wendling authored Apr 11, 2011

the 'unwind' instruction. However, later on that instruction was converted into
a jump to the basic block it was located in, causing an infinite loop when we
get there.

It turns out, we get there if the _Unwind_Resume_or_Rethrow call returns (which
it's not supposed to do). It returns if it cannot find a place to unwind
to. Thus we would get what appears to be a "hang" when in reality it's just that
the EH couldn't be propagated further along.

Instead of infinitely looping (or calling `unwind', which none of our back-ends
support (it's lowered into nothing...)), call the @llvm.trap() intrinsic
instead. This may not conform to specific rules of a particular language, but
it's rather better than infinitely looping.

<rdar://problem/9175843&9233582>

llvm-svn: 129302

1e1f1c9c

Look pass copies when determining whether hoisting would end up inserting more... · ef42bea7
Evan Cheng authored Apr 11, 2011
```
Look pass copies when determining whether hoisting would end up inserting more copies. rdar://9266679

llvm-svn: 129297
```
ef42bea7
Use a faster algorithm for computing MBB live-in registers after register allocation. · 7d05bce7
Jakob Stoklund Olesen authored Apr 11, 2011
```
LiveIntervals::findLiveInMBBs has to do a full binary search for each segment.

llvm-svn: 129292
```
7d05bce7
Fix a couple of places where changes are made but not tracked. · fe917efc
Evan Cheng authored Apr 11, 2011
```
llvm-svn: 129287
```
fe917efc

Don't add live ranges for sub-registers when clobbering a physical register. · f8beafe2

Jakob Stoklund Olesen authored Apr 11, 2011

Both coalescing and register allocation already check aliases for interference,
so these extra segments are only slowing us down.

This speeds up both linear scan and the greedy register allocator.

llvm-svn: 129283

f8beafe2

Speed up LiveIntervalUnion::unify by handling end insertion specially. · 4fbbe368
Jakob Stoklund Olesen authored Apr 11, 2011
```
This particularly helps with the initial transfer of fixed intervals.

llvm-svn: 129277
```
4fbbe368
Time the initial seeding of live registers · bfabc494
Jakob Stoklund Olesen authored Apr 11, 2011
```
llvm-svn: 129276
```
bfabc494
Don't shrink live ranges after dead code elimination unless it is going to help. · 96d04c8e
Jakob Stoklund Olesen authored Apr 11, 2011
```
In particular, don't repeatedly recompute the PIC base live range after rematerialization.

llvm-svn: 129275
```
96d04c8e
Don't include Operator.h from InstrTypes.h. · 7c14a558
Jay Foad authored Apr 11, 2011
```
llvm-svn: 129271
```
7c14a558

Apr 09, 2011

Avoid excess precision issues that lead to generating host-compiler-specific code. · cfe5aa65
Chris Lattner authored Apr 09, 2011
```
Switch lowering probably shouldn't be using FP for this.  This resolves PR9581.

llvm-svn: 129199
```
cfe5aa65

Build the Hopfield network incrementally when splitting global live ranges. · ed47ed4e

Jakob Stoklund Olesen authored Apr 09, 2011

It is common for large live ranges to have few basic blocks with register uses
and many live-through blocks without any uses. This approach grows the Hopfield
network incrementally around the use blocks, completely avoiding checking
interference for some through blocks.

llvm-svn: 129188

ed47ed4e

Precompute interference for neighbor blocks as long as there is no interference. · 4ad6c160
Jakob Stoklund Olesen authored Apr 09, 2011
```
This doesn't require seeking in the live interval union, so it is very cheap.

llvm-svn: 129187
```
4ad6c160