Commits · 37b3c48ef7d962dcdd074105d35dabbcd934c39b · Roger Ferrer / llvm-epi-0.8

Feb 14, 2010
- Fix whitespace. · e4e51a63
  Dan Gohman authored Feb 14, 2010
```
llvm-svn: 96179
```
  e4e51a63
- Fix a comment. · e7f74bb1
  Dan Gohman authored Feb 14, 2010
```
llvm-svn: 96178
```
  e7f74bb1
- When complicated expressions are broken down into subexpressions · bb7d5221
  Dan Gohman authored Feb 14, 2010
```
with multiplication by constants distributed through, occasionally
those subexpressions can include both x and -x. For now, if this
condition is discovered within LSR, just prune such cases away,
as they won't be profitable. This fixes a "zero allocated in a
base register" assertion failure.

llvm-svn: 96177
```
  bb7d5221
- Actually, this code doesn't have to be quite so conservative in · 2d0f96d4
  Dan Gohman authored Feb 14, 2010
```
the no-TLI case. But it should still default to declining the
transformation.

llvm-svn: 96152
```
  2d0f96d4
- Don't attempt aggressive post-inc uses if TargetLowering is not available, · cb76a806
  Dan Gohman authored Feb 14, 2010
```
because profitability can't be sufficiently approximated.

llvm-svn: 96148
```
  cb76a806
- Make LSR not crash if invoked without target lowering info, e.g. if invoked · 0daaf13b
  John McCall authored Feb 13, 2010
```
from opt.

llvm-svn: 96135
```
  0daaf13b
Feb 13, 2010
- remove dead code. · b8639bc2
  Chris Lattner authored Feb 13, 2010
```
llvm-svn: 96109
```
  b8639bc2
- Split some code out to a helper function (FindReusablePredBB) · 42c66b72
  Chris Lattner authored Feb 13, 2010
```
and add a doxygen comment.

Cache the phi entry to avoid doing tons of 
PHINode::getBasicBlockIndex calls in the common case.

On my insane testcase from re2c, this speeds up CGP from
617.4s to 7.9s (78x).

llvm-svn: 96083
```
  42c66b72
- speed up CGP a bit by scanning predecessors through phi operands · 96b88265
  Chris Lattner authored Feb 13, 2010
```
instead of with pred_begin/end.

llvm-svn: 96078
```
  96b88265
- Fix a pruning heuristic which implicitly assumed that SmallPtrSet is · 5b18f039
  Dan Gohman authored Feb 13, 2010
```
deterministically sorted.

llvm-svn: 96071
```
  5b18f039
Feb 12, 2010
- Reapply 95979, a compile-time speedup, now that the bug it exposed is fixed. · 2b75de97
  Dan Gohman authored Feb 12, 2010
```
llvm-svn: 96005
```
  2b75de97
- Fix this code to avoid dereferencing an end() iterator in · 363f847e
  Dan Gohman authored Feb 12, 2010
```
offset distributions it doesn't expect.

llvm-svn: 96002
```
  363f847e
- Revert "Reverse the order for collecting the parts of an addrec. The order", it · e0b2c69d
  Daniel Dunbar authored Feb 12, 2010
```
is breaking llvm-gcc bootstrap.

llvm-svn: 95988
```
  e0b2c69d
- Reverse the order for collecting the parts of an addrec. The order · 0194f580
  Dan Gohman authored Feb 12, 2010
```
doesn't matter, except that ScalarEvolution tends to need less time
to fold the results this way.

llvm-svn: 95979
```
  0194f580
- Reapply the new LoopStrengthReduction code, with compile time and · 45774ce0
  Dan Gohman authored Feb 12, 2010
```
bug fixes, and with improved heuristics for analyzing foreign-loop
addrecs.

This change also flattens IVUsers, eliminating the stride-oriented
groupings, which makes it easier to work with.

llvm-svn: 95975
```
  45774ce0
Feb 11, 2010

Make DSE only scan blocks that are reachable from the entry · c053cbbc

Chris Lattner authored Feb 11, 2010

block.  Other blocks may have pointer cycles that will crash
basicaa and other alias analyses.  In any case, there is no
point wasting cycles optimizing dead blocks.  This fixes 
rdar://7635088

llvm-svn: 95852

c053cbbc

Make jump threading honor x|undef -> true and x&undef -> false, · d924f636
Chris Lattner authored Feb 11, 2010
```
instead of considering x|undef -> x, which may not be true.

llvm-svn: 95850
```
d924f636
Ignore dbg info intrinsics. · 03936a18
Devang Patel authored Feb 11, 2010
```
llvm-svn: 95828
```
03936a18

Feb 10, 2010
- Fix "the the" and similar typos. · 4a618827
  Dan Gohman authored Feb 10, 2010
```
llvm-svn: 95781
```
  4a618827
Feb 09, 2010

Pull these back out, they're a little too aggressive and time · ad1aa862
Eric Christopher authored Feb 09, 2010
```
consuming for a simple optimization.

llvm-svn: 95671
```
ad1aa862
Add file in here too. · be2f0b2b
Eric Christopher authored Feb 09, 2010
```
llvm-svn: 95641
```
be2f0b2b

Add a new pass to do llvm.objsize lowering using SCEV. · 9f85e7eb

Eric Christopher authored Feb 09, 2010

Initial skeleton and SCEVUnknown lowering implemented,
the rest should come relatively quickly.  Move testcase
to new directory.

Move pass to right before SimplifyLibCalls - which is
moved down a bit so we can take advantage of a few opts.

llvm-svn: 95628

9f85e7eb

Feb 06, 2010
- Don't unroll loops containing function calls. · 5f9ead27
  Jakob Stoklund Olesen authored Feb 05, 2010
```
llvm-svn: 95454
```
  5f9ead27
Feb 05, 2010

Teach SimplifyCFG about magic pointer constants. · 916f48a0

Jakob Stoklund Olesen authored Feb 05, 2010

Weird code sometimes uses pointer constants other than null. This patch
teaches SimplifyCFG to build switch instructions in those cases.

Code like this:

void f(const char *x) {
  if (!x)
    puts("null");
  else if ((uintptr_t)x == 1)
    puts("one");
  else if (x == (char*)2 || x == (char*)3)
    puts("two");
  else if ((intptr_t)x == 4)
    puts("four");
  else
    puts(x);
}

Now becomes a switch:

define void @f(i8* %x) nounwind ssp {
entry:
  %magicptr23 = ptrtoint i8* %x to i64            ; <i64> [#uses=1]
  switch i64 %magicptr23, label %if.else16 [
    i64 0, label %if.then
    i64 1, label %if.then2
    i64 2, label %if.then9
    i64 3, label %if.then9
    i64 4, label %if.then14
  ]

Note that LLVM's own DenseMap uses magic pointers.

llvm-svn: 95439

916f48a0

Implement releaseMemory in CodeGenPrepare and free the BackEdges · 4739e41c

Dan Gohman authored Feb 05, 2010

container data. This prevents it from holding onto dangling
pointers and potentially behaving unpredictably.

llvm-svn: 95409

4739e41c

Do not reassociate expressions with i1 type. SimplifyCFG converts some · 27dfb1e1

Bob Wilson authored Feb 04, 2010

short-circuited conditions to AND/OR expressions, and those expressions
are often converted back to a short-circuited form in code gen.  The
original source order may have been optimized to take advantage of the
expected values, and if we reassociate them, we change the order and
subvert that optimization.  Radar 7497329.

llvm-svn: 95333

27dfb1e1

Feb 03, 2010

Adjust the heuristics used to decide when SROA is likely to be profitable. · 04365c5f

Bob Wilson authored Feb 03, 2010

The SRThreshold value makes perfect sense for checking if an entire aggregate
should be promoted to a scalar integer, but it is not so good for splitting
an aggregate into its separate elements. A struct may contain a large embedded
array along with some scalar fields that would benefit from being split apart
by SROA. Even if the total aggregate size is large, it may still be good to
perform SROA. Thus, the most important piece of this patch is simply moving
the aggregate size comparison vs. SRThreshold so that it guards only the
aggregate promotion.

We have also been checking the number of elements to decide if an aggregate
should be split up. The limit of "SRThreshold/4" seemed rather arbitrary,
and I don't think it's very useful to derive this limit from SRThreshold
anyway. I've collected some data showing that the current default limit of
32 (since SRThreshold defaults to 128) is a reasonable cutoff for struct
types. One thing suggested by the data is that distinguishing between structs
and arrays might be useful. There are (obviously) a lot more large arrays
than large structs (as measured by the number of elements and not the total
size -- a large array inside a struct still counts as a single element given
the way we do SROA right now). Out of 8377 arrays where we successfully
performed SROA while compiling a large set of benchmarks, only 16 of them had
more than 8 elements. And, for those 16 arrays, it's not at all clear that
SROA was actually beneficial. So, to offset the compile time cost of
investigating more large structs for SROA, the patch lowers the limit on array
elements to 8.

This fixes Apple Radar 7563690.

llvm-svn: 95224

04365c5f

Revert 94937 and move the noreturn check to codegen. · 27a41d54
Evan Cheng authored Feb 03, 2010
```
llvm-svn: 95198
```
27a41d54
Fix some comment typos. · 76e8c595
Bob Wilson authored Feb 03, 2010
```
llvm-svn: 95170
```
76e8c595
Recommit this, looks like it wasn't the cause. · d86233c1
Eric Christopher authored Feb 03, 2010
```
llvm-svn: 95165
```
d86233c1
Hopefully temporarily revert this. · e67d01a9
Eric Christopher authored Feb 02, 2010
```
llvm-svn: 95154
```
e67d01a9

Feb 02, 2010
- Re-add strcmp and known size object size checking optimization. · 4264e7e4
  Eric Christopher authored Feb 02, 2010
```
Passed bootstrap and nightly test run here.

llvm-svn: 95145
```
  4264e7e4
- fix a crash in loop unswitch on a loop invariant vector condition. · 302240d7
  Chris Lattner authored Feb 02, 2010
```
llvm-svn: 95055
```
  302240d7
- Don't need to check the last argument since it'll always be bool. We also · 14dfc3f6
  Eric Christopher authored Feb 02, 2010
```
don't use TargetData here.

llvm-svn: 95040
```
  14dfc3f6
- More indentation/tabification fixes. · 9afa9732
  Eric Christopher authored Feb 02, 2010
```
llvm-svn: 95036
```
  9afa9732
- Untabify previous commit. · 14082347
  Eric Christopher authored Feb 02, 2010
```
llvm-svn: 95035
```
  14082347
- Formatting. · 56e4182c
  Eric Christopher authored Feb 01, 2010
```
llvm-svn: 95027
```
  56e4182c
Feb 01, 2010

Add an option to GVN to remove all partially redundant loads. This is currently · d517b520

Bob Wilson authored Feb 01, 2010

disabled by default.  This divides the existing load PRE code into 2 phases:
first it checks that it is safe to move the load to each of the predecessors
where it is unavailable, and then if it is safe, the code is changed to move
the load.  Radar 7571861.

llvm-svn: 95007

d517b520

Jan 31, 2010

Do not mark no-return calls tail calls. It'll screw up special calls like... · d86d3fe0

Evan Cheng authored Jan 31, 2010

Do not mark no-return calls tail calls. It'll screw up special calls like longjmp and it doesn't make much sense for performance reason. If my logic is faulty, please let me know.

llvm-svn: 94937

d86d3fe0

Jan 30, 2010

Check alignment of loads when deciding whether it is safe to execute them · 56600a15

Bob Wilson authored Jan 30, 2010

unconditionally.  Besides checking the offset, also check that the underlying
object is aligned as much as the load itself.

llvm-svn: 94875

56600a15