Commits · b0ebb65ab0af336d40a2ad6b429df3d7f4a01030 · Lorenzo Albano / LLVM bpEVL

Feb 13, 2010

iterate over preds using PHI information when available instead of · b0ebb65a
Chris Lattner authored Feb 13, 2010
```
using pred_begin/end.  It is much faster.

llvm-svn: 96079
```
b0ebb65a
speed up CGP a bit by scanning predecessors through phi operands · 96b88265
Chris Lattner authored Feb 13, 2010
```
instead of with pred_begin/end.

llvm-svn: 96078
```
96b88265
Fix a pruning heuristic which implicitly assumed that SmallPtrSet is · 5b18f039
Dan Gohman authored Feb 13, 2010
```
deterministically sorted.

llvm-svn: 96071
```
5b18f039

Enable the inlinehint attribute in the Inliner. · 492b8b42

Jakob Stoklund Olesen authored Feb 13, 2010

Functions explicitly marked inline will get an inlining threshold slightly
more aggressive than the default for -O3. This means than -O3 builds are
mostly unaffected while -Os builds will be a bit bigger and faster.

The difference depends entirely on how many 'inline's are sprinkled on the
source.

In the CINT2006 suite, only these tests are significantly affected under -Os:

               Size   Time
471.omnetpp   +1.63% -1.85%
473.astar     +4.01% -6.02%
483.xalancbmk +4.60%  0.00%

Note that 483.xalancbmk runs too quickly to give useful timing results.

llvm-svn: 96066

492b8b42

Feb 12, 2010
- Reapply 95979, a compile-time speedup, now that the bug it exposed is fixed. · 2b75de97
  Dan Gohman authored Feb 12, 2010
```
llvm-svn: 96005
```
  2b75de97
- Fix this code to avoid dereferencing an end() iterator in · 363f847e
  Dan Gohman authored Feb 12, 2010
```
offset distributions it doesn't expect.

llvm-svn: 96002
```
  363f847e
- 1. modernize the constantmerge pass, using densemap/smallvector. · 75879be9
  Chris Lattner authored Feb 12, 2010
```
2. don't bother trying to merge globals in non-default sections,
   doing so is quite dubious at best anyway.
3. fix a bug reported by Arnaud de Grandmaison where we'd try to
   merge two globals in different address spaces.

llvm-svn: 95995
```
  75879be9
- Revert "Reverse the order for collecting the parts of an addrec. The order", it · e0b2c69d
  Daniel Dunbar authored Feb 12, 2010
```
is breaking llvm-gcc bootstrap.

llvm-svn: 95988
```
  e0b2c69d
- Reverse the order for collecting the parts of an addrec. The order · 0194f580
  Dan Gohman authored Feb 12, 2010
```
doesn't matter, except that ScalarEvolution tends to need less time
to fold the results this way.

llvm-svn: 95979
```
  0194f580
- Reapply the new LoopStrengthReduction code, with compile time and · 45774ce0
  Dan Gohman authored Feb 12, 2010
```
bug fixes, and with improved heuristics for analyzing foreign-loop
addrecs.

This change also flattens IVUsers, eliminating the stride-oriented
groupings, which makes it easier to work with.

llvm-svn: 95975
```
  45774ce0
Feb 11, 2010

Make sure that ConstantExpr offsets also aren't off of extern · cccdc136
Eric Christopher authored Feb 11, 2010
```
symbols.

Thanks to Duncan Sands for the testcase!

llvm-svn: 95877
```
cccdc136

Rename ValueRequiresCast to ShouldOptimizeCast, to better reflect · 4e8137d6

Chris Lattner authored Feb 11, 2010

what it does.  Enhance it to return false to optimizing vector
sign extensions from vector comparisions, which is the idiom used
to get a splatted vector for a vector comparison.

Doing this breaks vector-casts.ll, add some compensating 
transformations to handle the important case they cover without
depending on this canonicalization.

This fixes rdar://7434900 a serious pessimization of vector compares.

llvm-svn: 95855

4e8137d6

Make DSE only scan blocks that are reachable from the entry · c053cbbc

Chris Lattner authored Feb 11, 2010

block.  Other blocks may have pointer cycles that will crash
basicaa and other alias analyses.  In any case, there is no
point wasting cycles optimizing dead blocks.  This fixes 
rdar://7635088

llvm-svn: 95852

c053cbbc

Make jump threading honor x|undef -> true and x&undef -> false, · d924f636
Chris Lattner authored Feb 11, 2010
```
instead of considering x|undef -> x, which may not be true.

llvm-svn: 95850
```
d924f636
Add ConstantExpr handling to Intrinsic::objectsize lowering. · 531ea566
Eric Christopher authored Feb 11, 2010
```
Update testcase accordingly now that we can optimize another
section.

llvm-svn: 95846
```
531ea566
Ignore dbg info intrinsics. · 03936a18
Devang Patel authored Feb 11, 2010
```
llvm-svn: 95828
```
03936a18

Feb 10, 2010
- Strip new llvm.dbg.value intrinsic. · 211746a6
  Devang Patel authored Feb 10, 2010
```
llvm-svn: 95807
```
  211746a6
- Fix "the the" and similar typos. · 4a618827
  Dan Gohman authored Feb 10, 2010
```
llvm-svn: 95781
```
  4a618827
Feb 09, 2010
- Move Intrinsic::objectsize lowering back to InstCombineCalls and · 7b7028fd
  Eric Christopher authored Feb 09, 2010
```
enable constant 0 offset lowering.

llvm-svn: 95691
```
  7b7028fd
- Pull these back out, they're a little too aggressive and time · ad1aa862
  Eric Christopher authored Feb 09, 2010
```
consuming for a simple optimization.

llvm-svn: 95671
```
  ad1aa862
- simplify this code, duh. · f4c8d3ce
  Chris Lattner authored Feb 09, 2010
```
llvm-svn: 95643
```
  f4c8d3ce
- fix PR6193, only considering sign extensions *from i1* for this · 9b6a1789
  Chris Lattner authored Feb 09, 2010
```
xform.

llvm-svn: 95642
```
  9b6a1789
- Add file in here too. · be2f0b2b
  Eric Christopher authored Feb 09, 2010
```
llvm-svn: 95641
```
  be2f0b2b
- Add a new pass to do llvm.objsize lowering using SCEV. · 9f85e7eb
  Eric Christopher authored Feb 09, 2010
```
Initial skeleton and SCEVUnknown lowering implemented,
the rest should come relatively quickly.  Move testcase
to new directory.

Move pass to right before SimplifyLibCalls - which is
moved down a bit so we can take advantage of a few opts.

llvm-svn: 95628
```
  9f85e7eb
- fix some problems handling large vectors reported in PR6230 · b22423c8
  Chris Lattner authored Feb 08, 2010
```
llvm-svn: 95616
```
  b22423c8
Feb 06, 2010

Reintroduce the InlineHint function attribute. · 74bb06c0

Jakob Stoklund Olesen authored Feb 06, 2010

This time it's for real! I am going to hook this up in the frontends as well.

The inliner has some experimental heuristics for dealing with the inline hint.
When given a -respect-inlinehint option, functions marked with the inline
keyword are given a threshold just above the default for -O3.

We need some experiments to determine if that is the right thing to do.

llvm-svn: 95466

74bb06c0

Don't unroll loops containing function calls. · 5f9ead27
Jakob Stoklund Olesen authored Feb 05, 2010
```
llvm-svn: 95454
```
5f9ead27

Feb 05, 2010

Teach SimplifyCFG about magic pointer constants. · 916f48a0

Jakob Stoklund Olesen authored Feb 05, 2010

Weird code sometimes uses pointer constants other than null. This patch
teaches SimplifyCFG to build switch instructions in those cases.

Code like this:

void f(const char *x) {
  if (!x)
    puts("null");
  else if ((uintptr_t)x == 1)
    puts("one");
  else if (x == (char*)2 || x == (char*)3)
    puts("two");
  else if ((intptr_t)x == 4)
    puts("four");
  else
    puts(x);
}

Now becomes a switch:

define void @f(i8* %x) nounwind ssp {
entry:
  %magicptr23 = ptrtoint i8* %x to i64            ; <i64> [#uses=1]
  switch i64 %magicptr23, label %if.else16 [
    i64 0, label %if.then
    i64 1, label %if.then2
    i64 2, label %if.then9
    i64 3, label %if.then9
    i64 4, label %if.then14
  ]

Note that LLVM's own DenseMap uses magic pointers.

llvm-svn: 95439

916f48a0

fix logical-select to invoke filecheck right, and fix hte instcombine · 64ffd11d

Chris Lattner authored Feb 05, 2010

xform it is checking to actually pass.  There is no need to match
m_SelectCst<0, -1> since instcombine canonicalizes that into not(sext).

Add matches for sext(not(x)) in addition to not(sext(x)).

llvm-svn: 95420

64ffd11d

Implement releaseMemory in CodeGenPrepare and free the BackEdges · 4739e41c

Dan Gohman authored Feb 05, 2010

container data. This prevents it from holding onto dangling
pointers and potentially behaving unpredictably.

llvm-svn: 95409

4739e41c

Use a SmallSetVector instead of a SetVector; this code showed up as a · 8abb67df
Dan Gohman authored Feb 05, 2010
```
malloc caller in a profile.

llvm-svn: 95407
```
8abb67df
Remove this code for now. I have a better idea and will rewrite with · 04371b4f
Eric Christopher authored Feb 05, 2010
```
that in mind.

llvm-svn: 95402
```
04371b4f

Do not reassociate expressions with i1 type. SimplifyCFG converts some · 27dfb1e1

Bob Wilson authored Feb 04, 2010

short-circuited conditions to AND/OR expressions, and those expressions
are often converted back to a short-circuited form in code gen.  The
original source order may have been optimized to take advantage of the
expected values, and if we reassociate them, we change the order and
subvert that optimization.  Radar 7497329.

llvm-svn: 95333

27dfb1e1

Feb 04, 2010

Increase inliner thresholds by 25. · 113fb54b

Jakob Stoklund Olesen authored Feb 04, 2010

This makes the inliner about as agressive as it was before my changes to the
inliner cost calculations. These levels give the same performance and slightly
smaller code than before.

llvm-svn: 95320

113fb54b

Temporarily revert this since it appears to have caused a build · 107a1fbf
Eric Christopher authored Feb 04, 2010
```
failure.

llvm-svn: 95294
```
107a1fbf

Rework constant expr and array handling for objectsize instcombining. · 42fa84a8

Eric Christopher authored Feb 04, 2010

Fix bugs where we would compute out of bounds as in bounds, and where
we couldn't know that the linker could override the size of an array.

Add a few new testcases, change existing testcase to use a private
global array instead of extern.

llvm-svn: 95283

42fa84a8

If we're dealing with a zero-length array, don't lower to any · f12e18db
Eric Christopher authored Feb 03, 2010
```
particular size, we just don't know what the length is yet.

llvm-svn: 95266
```
f12e18db

Feb 03, 2010

Adjust the heuristics used to decide when SROA is likely to be profitable. · 04365c5f

Bob Wilson authored Feb 03, 2010

The SRThreshold value makes perfect sense for checking if an entire aggregate
should be promoted to a scalar integer, but it is not so good for splitting
an aggregate into its separate elements. A struct may contain a large embedded
array along with some scalar fields that would benefit from being split apart
by SROA. Even if the total aggregate size is large, it may still be good to
perform SROA. Thus, the most important piece of this patch is simply moving
the aggregate size comparison vs. SRThreshold so that it guards only the
aggregate promotion.

We have also been checking the number of elements to decide if an aggregate
should be split up. The limit of "SRThreshold/4" seemed rather arbitrary,
and I don't think it's very useful to derive this limit from SRThreshold
anyway. I've collected some data showing that the current default limit of
32 (since SRThreshold defaults to 128) is a reasonable cutoff for struct
types. One thing suggested by the data is that distinguishing between structs
and arrays might be useful. There are (obviously) a lot more large arrays
than large structs (as measured by the number of elements and not the total
size -- a large array inside a struct still counts as a single element given
the way we do SROA right now). Out of 8377 arrays where we successfully
performed SROA while compiling a large set of benchmarks, only 16 of them had
more than 8 elements. And, for those 16 arrays, it's not at all clear that
SROA was actually beneficial. So, to offset the compile time cost of
investigating more large structs for SROA, the patch lowers the limit on array
elements to 8.

This fixes Apple Radar 7563690.

llvm-svn: 95224

04365c5f

Revert 94937 and move the noreturn check to codegen. · 27a41d54
Evan Cheng authored Feb 03, 2010
```
llvm-svn: 95198
```
27a41d54
Fix some comment typos. · 76e8c595
Bob Wilson authored Feb 03, 2010
```
llvm-svn: 95170
```
76e8c595