Commits · 27e0a4ab86a9066f19e4027d2a6ead4e51e40d76 · Roger Ferrer / llvm-epi-0.8

Mar 05, 2011

Jakob Stoklund Olesen authored Mar 05, 2011

The coalescer can in very rare cases leave too large live intervals around after
rematerializing cheap-as-a-move instructions.

Linear scan doesn't really care, but live range splitting gets very confused
when a live range is killed by a ghost instruction.

I will fix this properly in the coalescer after 2.9 branches.

llvm-svn: 127096

27e0a4ab

Remove unused conditional negate operations. · 00d09428
Bob Wilson authored Mar 05, 2011
```
llvm-svn: 127090
```
00d09428

InstCombine: We know the number of items initially added to the worklist map,... · 08c913b6

Benjamin Kramer authored Mar 05, 2011

InstCombine: We know the number of items initially added to the worklist map, reserve space early to avoid rehashing.

llvm-svn: 127089

08c913b6

ptx: add basic intrinsic support · 369ea3fd
Che-Liang Chiou authored Mar 05, 2011
```
llvm-svn: 127084
```
369ea3fd
Be explicit with abs(). Visual Studio workaround. · 25cedf3f
Andrew Trick authored Mar 05, 2011
```
llvm-svn: 127075
```
25cedf3f
Fix for -sched-high-latency-cycles in sched=list-ilp mode. · d7f4c216
Andrew Trick authored Mar 05, 2011
```
llvm-svn: 127071
```
d7f4c216
Fix PR9398 - 10% of llc compile time is spent in Value::getNumUses. This reduces · 13c885d1
Cameron Zwarich authored Mar 05, 2011
```
the percentage of time spent in CodeGenPrepare when llcing 403.gcc from 12.6% to
1.8% of total llc time.

llvm-svn: 127069
```
13c885d1
Missing comment. · b8390b7a
Andrew Trick authored Mar 05, 2011
```
llvm-svn: 127068
```
b8390b7a

Increased the register pressure limit on x86_64 from 8 to 12 · 641e2d4f

Andrew Trick authored Mar 05, 2011

regs. This is the only change in this checkin that may affects the
default scheduler. With better register tracking and heuristics, it
doesn't make sense to artificially lower the register limit so much.

Added -sched-high-latency-cycles and X86InstrInfo::isHighLatencyDef to
give the scheduler a way to account for div and sqrt on targets that
don't have an itinerary. It is currently defaults to 10 (the actual
number doesn't matter much), but only takes effect on non-default
schedulers: list-hybrid and list-ilp.

Added several heuristics that can be individually disabled for the
non-default sched=list-ilp mode. This helps us determine how much
better we can do on a given benchmark than the default
scheduler. Certain compute intensive loops run much faster in this
mode with the right set of heuristics, and it doesn't seem to have
much negative impact elsewhere. Not all of the heuristics are needed,
but we still need to experiment to decide which should be disabled by
default for sched=list-ilp.

llvm-svn: 127067

641e2d4f

whitespace · 27c079e1
Andrew Trick authored Mar 05, 2011
```
llvm-svn: 127065
```
27c079e1

Thread comparisons over udiv/sdiv/ashr/lshr exact and lshr nuw/nsw whenever · 9719a719

Nick Lewycky authored Mar 05, 2011

possible. This goes into instcombine and instsimplify because instsimplify
doesn't need to check hasOneUse since it returns (almost exclusively) constants.

This fixes PR9343 #4 #5 and #8!

llvm-svn: 127064

9719a719

Try once again to optimize "icmp (srem X, Y), Y" by turning the comparison into · 25cc338d
Nick Lewycky authored Mar 05, 2011
```
true/false or "icmp slt/sge Y, 0".

llvm-svn: 127063
```
25cc338d

Rework the global split cost calculation. · 1a9b66c7

Jakob Stoklund Olesen authored Mar 05, 2011

The global cost is the sum of block frequencies for spill code that must be
inserted because preferences weren't met.

llvm-svn: 127062

1a9b66c7

Compute the constraints for global live range splitting from an interference pattern. · 4b598e15

Jakob Stoklund Olesen authored Mar 05, 2011

This simplifies the code and makes it faster too.

The interference patterns are saved for each candidate register. It will be
reused for actually executing the split. Work in progress.

llvm-svn: 127054

4b598e15

Teach the register scavenger to take subregs into account when finding a free register. · dc55428d
Jim Grosbach authored Mar 05, 2011
```
llvm-svn: 127049
```
dc55428d
Support unregistering exception frames of functions when they are removed. · f045b7ab
Eric Christopher authored Mar 04, 2011
```
Patch by Johannes Schaub!

Fixes PR8548

llvm-svn: 127047
```
f045b7ab

Mar 04, 2011
- Improve readability with some whitespace! · 40326989
  Eric Christopher authored Mar 04, 2011
```
llvm-svn: 127043
```
  40326989
- Extract a method. No functional change. · 05a2f517
  Jakob Stoklund Olesen authored Mar 04, 2011
```
llvm-svn: 127040
```
  05a2f517
- Initialize variable. · 88842e45
  Bill Wendling authored Mar 04, 2011
```
llvm-svn: 127038
```
  88842e45
- Go back to comparing spill weights when deciding if interference can be evicted. · d7e1bb80
  Jakob Stoklund Olesen authored Mar 04, 2011
```
It gives better results. Sometimes, a live range can be large and still have
high spill weight. Such a range should not be spilled.

llvm-svn: 127036
```
  d7e1bb80
- Improve div/rem node handling on mips. Patch by Akira Hatanaka · 434248a6
  Bruno Cardoso Lopes authored Mar 04, 2011
```
llvm-svn: 127034
```
  434248a6
- Expands register/immediate pairs when the immediate is too large to fit in... · a744ef3f
  Bruno Cardoso Lopes authored Mar 04, 2011
```
Expands register/immediate pairs when the immediate is too large to fit in 16-bit field. Patch by Akira Hatanaka

llvm-svn: 127032
```
  a744ef3f
- When decling to reuse existing expressions that involve casts, ignore · aa036eed
  Dan Gohman authored Mar 04, 2011
```
bitcasts, which are really no-ops here. This fixes slowdowns on
MultiSource/Applications/aha and others.

llvm-svn: 127031
```
  aa036eed
- Rewrite and simplify o32 vaarg passing, no functional changes. Patch by Sasa Stankovic · 8887d659
  Bruno Cardoso Lopes authored Mar 04, 2011
```
llvm-svn: 127029
```
  8887d659
- Be nice to Xcore and the XMOS assembler and avoid quoting section names · 62f75979
  Joerg Sonnenberger authored Mar 04, 2011
```
that contain only letters, digits and the characters "_" and ".".

llvm-svn: 127028
```
  62f75979
- Lowers block address. Currently asserts when relocation model is not PIC. Patch by Akira Hatanaka · f8198e43
  Bruno Cardoso Lopes authored Mar 04, 2011
```
llvm-svn: 127027
```
  f8198e43
- raw_ostream: while it is generally desirable to do larger writes, it can lead to · dfb0ad30
  Benjamin Kramer authored Mar 04, 2011
```
inefficient file system buffering if the writes are not a multiple of the desired
buffer size. Avoid this by limiting the large write to a multiple of the buffer
size and copying the remainder into the buffer.

Thanks to Dan for pointing this out.

llvm-svn: 127026
```
  dfb0ad30
- Renumber slot indexes locally when possible. · b8e6fdc2
  Jakob Stoklund Olesen authored Mar 04, 2011
```
Initially, slot indexes are quad-spaced. There is room for inserting up to 3
new instructions between the original instructions.

When we run out of indexes between two instructions, renumber locally using
double-spaced indexes. The original quad-spacing means that we catch up quickly,
and we only have to renumber a handful of instructions to get a monotonic
sequence. This is much faster than renumbering the whole function as we did
before.

llvm-svn: 127023
```
  b8e6fdc2
- Revert broken srem logic from r126991. · 41c529bd
  Nick Lewycky authored Mar 04, 2011
```
llvm-svn: 127021
```
  41c529bd
- Fix an old copy-n-paste · 328e2ce0
  Bruno Cardoso Lopes authored Mar 04, 2011
```
llvm-svn: 127020
```
  328e2ce0
- Disable ARMGlobalMerge on darwin. The debugger is not yet able to extract... · a0d73fd6
  Devang Patel authored Mar 04, 2011
```
Disable ARMGlobalMerge on darwin. The debugger is not yet able to extract individual variable's info from merged global.

llvm-svn: 127019
```
  a0d73fd6
- Expands FCOS and FSIN nodes when type is f64. · 22b69db8
  Bruno Cardoso Lopes authored Mar 04, 2011
```
llvm-svn: 127017
```
  22b69db8
- Number SlotIndexes uniformly without looking at the number of defs on each instruction. · 348d8e8b
  Jakob Stoklund Olesen authored Mar 04, 2011
```
You can't really predict how many indexes will be needed from the number of
defs, so let's keep it simple.

Also remove an extra empty index that was inserted after each basic block. It
was intended for live-out ranges, but it was never used that way.

llvm-svn: 127014
```
  348d8e8b
- raw_ostream: If writing a string that is larger than the buffer, write it... · acf08420
  Benjamin Kramer authored Mar 04, 2011
```
raw_ostream: If writing a string that is larger than the buffer, write it directly instead of doing many buffer-sized writes.

This caps the number of write(2) calls per string to a maximum of 2.

llvm-svn: 127010
```
  acf08420
- Add SlotIndex statistics. · b88f6adf
  Jakob Stoklund Olesen authored Mar 04, 2011
```
llvm-svn: 127007
```
  b88f6adf
- Tweak debug output. No functional changes. · d4f78895
  Jakob Stoklund Olesen authored Mar 04, 2011
```
llvm-svn: 127006
```
  d4f78895
- Fixes addc pattern when immediate cannot be represented with 16-bit. Patch by Akira Hatanaka · db93ddb4
  Bruno Cardoso Lopes authored Mar 04, 2011
```
llvm-svn: 127005
```
  db93ddb4
- Remove (hopefully) all trailing whitespaces from the mips backend. Patch by Hatanaka, Akira · ed874eff
  Bruno Cardoso Lopes authored Mar 04, 2011
```
llvm-svn: 127003
```
  ed874eff
- Revert commit 126684 "Use the correct shift amount type". It is only the correct · 6bd10442
  Duncan Sands authored Mar 04, 2011
```
type after type legalization has completed.  Before then it may simply not be big
enough to hold the shift amount, particularly on x86 which uses a very small type
for shifts (this issue broke stuff in the past which is why LegalizeTypes carefully
uses a large type for shift amounts).

llvm-svn: 127000
```
  6bd10442
- Allow vector shifts (shl,lshr,ashr) on SPU. · a1d947dd
  Kalle Raiskila authored Mar 04, 2011
```
There was a previous implementation with patterns that would 
have matched e.g. 
	shl <v4i32> <i32>,
but this is not valid LLVM IR so they never were selected.

llvm-svn: 126998
```
  a1d947dd