Commits · 61f67624c3a3fc56f6445e9f533d06fbcfa0ef18 · Roger Ferrer / llvm-epi-0.8

Aug 05, 2008

PR2621: Improvements to the SCEV AddRec binomial expansion. This · 61f67624

Eli Friedman authored Aug 04, 2008

version uses a new algorithm for evaluating the binomial coefficients 
which is significantly more efficient for AddRecs of more than 2 terms 
(see the comments in the code for details on how the algorithm works).  
It also fixes some bugs: it removes the arbitrary length restriction for 
AddRecs, it fixes the silent generation of incorrect code for AddRecs 
which require a wide calculation width, and it fixes an issue where we 
were incorrectly truncating the iteration count too far when evaluating 
an AddRec expression narrower than the induction variable.

There are still a few related issues I know of: I think there's 
still an issue with the SCEVExpander expansion of AddRec in terms of
the width of the induction variable used.  The hack to avoid generating 
too-wide integers shouldn't be necessary; instead, the callers should be 
considering the cost of the expansion before expanding it (in addition 
to not expanding too-wide integers, we might not want to expand 
expressions that are really expensive, especially when optimizing for 
size; calculating an length-17 32-bit AddRec currently generates about 250 
instructions of straight-line code on X86).  Also, for long 32-bit 
AddRecs on X86, CodeGen really sucks at scheduling the code.  I'm planning on 
filing follow-up PRs for these issues.

llvm-svn: 54332

61f67624

Fix SDISel lowering of PHI nodes to use ComputeValueVTs. · 90c724ca
Dan Gohman authored Aug 04, 2008
```
This allows it to work correctly on aggregate values.
This fixes PR2623.

llvm-svn: 54331
```
90c724ca
Fix SDISel lowering of zeroinitializer and undef to use ComputeValueVTs. · 6e023e63
Dan Gohman authored Aug 04, 2008
```
This allows it to work correctly on nested aggregate values.
This fixes PR2625.

llvm-svn: 54330
```
6e023e63
Add an assert to catch invalid VECTOR_SHUFFLE mask indices. · 8ef79ebd
Dan Gohman authored Aug 04, 2008
```
llvm-svn: 54329
```
8ef79ebd

Aug 04, 2008
- Mips ISelLowering cleanup : Removed old LowerCALL and FORMAL_ARGS helpers, they · a01ede2f
  Bruno Cardoso Lopes authored Aug 04, 2008
```
aren't used anyway, they also used to broke compiling when fastcc was specified for a
function, but not anymore.

llvm-svn: 54316
```
  a01ede2f
- Handle i32->f32 bitconvert results. · 2ca70df5
  Bruno Cardoso Lopes authored Aug 04, 2008
```
llvm-svn: 54315
```
  2ca70df5
Aug 03, 2008
- Add atomic sub for other sizes · 77e3e86e
  Andrew Lenharth authored Aug 03, 2008
```
llvm-svn: 54314
```
  77e3e86e
- Emit saveri with the correct operand order, patch by Richard Pennington! · 796e9be3
  Chris Lattner authored Aug 03, 2008
```
llvm-svn: 54313
```
  796e9be3
- Fix PR2615 · 3e667cfe
  Bruno Cardoso Lopes authored Aug 03, 2008
```
llvm-svn: 54312
```
  3e667cfe
Aug 02, 2008
- Improved asm inline for hi,lo results · 3d4bdcc1
  Bruno Cardoso Lopes authored Aug 02, 2008
```
Added hi,lo registers to be used,def implicitly. This provides better handle of
instructions which use hi/lo.
Fixes a small BranchAnalysis bug

llvm-svn: 54274
```
  3d4bdcc1
- Apply the same pattern used in 'and' lowering for 'or' · 3397298b
  Bruno Cardoso Lopes authored Aug 02, 2008
```
llvm-svn: 54273
```
  3397298b
Aug 01, 2008
- Fix comment typos. · c1e48b58
  Duncan Sands authored Aug 01, 2008
```
llvm-svn: 54266
```
  c1e48b58
Jul 31, 2008
- Expand fcopysign · e4798c83
  Bruno Cardoso Lopes authored Jul 31, 2008
```
llvm-svn: 54250
```
  e4798c83
- Handle more SELECT corner cases considering legalize types, probabily wont work with · 23471047
  Bruno Cardoso Lopes authored Jul 31, 2008
```
the default legalizer.

llvm-svn: 54249
```
  23471047
- Add a flag to disable jump table generation (all · c31eb205
  Dale Johannesen authored Jul 31, 2008
```
switches use the binary search algorithm) for
environments that don't support it.  PPC64 JIT
is such an environment; turn the flag on for that.

llvm-svn: 54248
```
  c31eb205
- Improve dagcombining for sext-loads and sext-in-reg nodes. · 345d63cc
  Dan Gohman authored Jul 31, 2008
```
llvm-svn: 54239
```
  345d63cc
Jul 30, 2008
- Added pattern for floating point zero immediate (avoiding a constant pool · 2d7ddea2
  Bruno Cardoso Lopes authored Jul 30, 2008
```
access).
Added pattern to match bitconvert node.
Fixed MTC1 asm string bug.

llvm-svn: 54229
```
  2d7ddea2
- Move SelectionDAG::viewGraph() out of line; as an inline function · 88e0df0c
  Dan Gohman authored Jul 30, 2008
```
it isn't always visible to gdb.

llvm-svn: 54228
```
  88e0df0c
- Don't look for leaf values to store when lowering stores of · 2fe43526
  Dan Gohman authored Jul 30, 2008
```
empty structs. This fixes PR2612.

llvm-svn: 54226
```
  2fe43526
- Use existing LiveInterval methods to simplify live interval merging. Thanks... · c818c015
  Owen Anderson authored Jul 30, 2008
```
Use existing LiveInterval methods to simplify live interval merging.  Thanks to Evan for pointing these out.

llvm-svn: 54225
```
  c818c015
- Reapply r54147 with a constraint to only use the 8-bit · 86b06335
  Dan Gohman authored Jul 30, 2008
```
subreg form on x86-64, to avoid the problem with x86-32
having GPRs that don't have 8-bit subregs.

Also, change several 16-bit instructions to use 
equivalent 32-bit instructions. These have a smaller
encoding and avoid partial-register updates.

llvm-svn: 54223
```
  86b06335
- Value numbers whose def index is a special sentinel value should not be remapped. · 7b5f5355
  Owen Anderson authored Jul 30, 2008
```
llvm-svn: 54218
```
  7b5f5355
- Fixed bug in global address lowering for functions and in Brcond lowering · a9504222
  Bruno Cardoso Lopes authored Jul 30, 2008
```
llvm-svn: 54215
```
  a9504222
- Removed small section flag for mips, the assembler doesnt support this flag · 57e17f0e
  Bruno Cardoso Lopes authored Jul 30, 2008
```
llvm-svn: 54214
```
  57e17f0e
- Added new features to represent specific instructions groups · f714e25f
  Bruno Cardoso Lopes authored Jul 30, 2008
```
llvm-svn: 54213
```
  f714e25f
- Instruction definition cleanup · 89e2b163
  Bruno Cardoso Lopes authored Jul 30, 2008
```
llvm-svn: 54212
```
  89e2b163
- Added support for overloading intrinsics (atomics) based on pointers · 2c839d4b
  Mon P Wang authored Jul 30, 2008
```
to different address spaces.  This alters the naming scheme for those
intrinsics, e.g., atomic.load.add.i32 => atomic.load.add.i32.p0i32

llvm-svn: 54195
```
  2c839d4b
- Another SCEV issue from PR2607; essentially the same issue, but this · 4736916a
  Eli Friedman authored Jul 30, 2008
```
time applying to the implicit comparison in smin expressions. The 
correct way to transform an inequality into the opposite 
inequality, either signed or unsigned, is with a not expression.

I looked through the SCEV code, and I don't think there are any more 
occurrences of this issue.

llvm-svn: 54194
```
  4736916a
- More fixes for corner cases when remapping live range indices. · e9a0bae2
  Owen Anderson authored Jul 30, 2008
```
llvm-svn: 54186
```
  e9a0bae2
- When merging live intervals, we also need to merge in any live ranges that are... · 1aebe49a
  Owen Anderson authored Jul 30, 2008
```
When merging live intervals, we also need to merge in any live ranges that are inputs to two-address instructions
that themselves define a range we already care about.

llvm-svn: 54185
```
  1aebe49a
- Fix for PR2607: SCEV miscomputing the loop count for loops with an · 5ae90441
  Eli Friedman authored Jul 30, 2008
```
SGT exit condition.  Essentially, the correct way to flip an inequality 
in 2's complement is the not operator, not the negation operator.  
That said, the difference only affects cases involving INT_MIN.

Also, enhance the pre-test search logic to be a bit smarter about 
inequalities flipped with a not operator, so it can eliminate the smax 
from the iteration count for simple loops.

llvm-svn: 54184
```
  5ae90441
Jul 29, 2008
- When merging a PHI operand's live interval into the PHI's live interval, we... · 6b1cc46f
  Owen Anderson authored Jul 29, 2008
```
When merging a PHI operand's live interval into the PHI's live interval, we need to merge over all liveranges in
the operand's interval that share the relevant value number, not just the range that immediately precedes the PHI.

llvm-svn: 54174
```
  6b1cc46f
- Don't decrement the BB remap when we don't need to. · 2532e759
  Owen Anderson authored Jul 29, 2008
```
llvm-svn: 54173
```
  2532e759
- Fix PR2609. If a label is deleted, then it needs · fa412053
  Duncan Sands authored Jul 29, 2008
```
to be marked invalid regardless of whether it is
a debug, an exception handling or (hopefully) a
GC label.

llvm-svn: 54172
```
  fa412053
- Changed some methods order. · 2a241573
  Bruno Cardoso Lopes authored Jul 29, 2008
```
llvm-svn: 54169
```
  2a241573
- Fix broken CellSPU lowering, re-instate braces in Legalize · 82f19257
  Nate Begeman authored Jul 29, 2008
```
llvm-svn: 54168
```
  82f19257
- Added floating point lowering for select. · e683bbab
  Bruno Cardoso Lopes authored Jul 29, 2008
```
llvm-svn: 54167
```
  e683bbab
- Disable a fix in the previous patch, since it breaks CellSPU. · d63495ff
  Nate Begeman authored Jul 29, 2008
```
The CellSPU codegen is broken, but needs to be fixed before we can
put this back in.

llvm-svn: 54164
```
  d63495ff
- Add vector shifts to the IR, patch by Eli Friedman. · fecbc8cf
  Nate Begeman authored Jul 29, 2008
```
CodeGen & Clang work coming next.

llvm-svn: 54161
```
  fecbc8cf
- Add -unroll-allow-partial command line option that enabled the loop unroller to · 98b5c16e
  Matthijs Kooijman authored Jul 29, 2008
```
partially unroll a loop when fully unrolling would not fit under the threshold.

Patch by Mikael Lepistö.

llvm-svn: 54160
```
  98b5c16e