Commits · 894874e7aff94d5b2a630df7693f91f8b9fff410 · Lorenzo Albano / LLVM bpEVL

Apr 20, 2010
- make the inliner do less work for leaf functions. · a5cdd5e6
  Chris Lattner authored Apr 20, 2010
```
llvm-svn: 101846
```
  a5cdd5e6
Apr 17, 2010

introduce a new CallGraphSCC class, and pass it around · 4422d31b

Chris Lattner authored Apr 16, 2010

to CallGraphSCCPass's instead of passing around a
std::vector<CallGraphNode*>.  No functionality change,
but now we have a much tidier interface.

llvm-svn: 101558

4422d31b

Mar 10, 2010

Try to keep the cached inliner costs around for a bit longer for big functions. · b495cad7

Jakob Stoklund Olesen authored Mar 09, 2010

The Caller cost info would be reset everytime a callee was inlined. If the
caller has lots of calls and there is some mutual recursion going on, the
caller cost info could be calculated many times.

This patch reduces inliner runtime from 240s to 0.5s for a function with 20000
small function calls.

This is a more conservative version of r98089 that doesn't break the clang
test CodeGenCXX/temp-order.cpp. That test relies on rather extreme inlining
for constant folding.

llvm-svn: 98099

b495cad7

Mar 09, 2010

Revert r98089, it was breaking a clang test. · 44974759
Jakob Stoklund Olesen authored Mar 09, 2010
```
llvm-svn: 98094
```
44974759

Try to keep the cached inliner costs around for a bit longer for big functions. · 741dec43

Jakob Stoklund Olesen authored Mar 09, 2010

The Caller cost info would be reset everytime a callee was inlined. If the
caller has lots of calls and there is some mutual recursion going on, the
caller cost info could be calculated many times.

This patch reduces inliner runtime from 240s to 0.5s for a function with 20000
small function calls.

llvm-svn: 98089

741dec43

Add inlining threshold to log output. · d62c2f55
Jakob Stoklund Olesen authored Mar 09, 2010
```
llvm-svn: 98024
```
d62c2f55

Feb 13, 2010

Enable the inlinehint attribute in the Inliner. · 492b8b42

Jakob Stoklund Olesen authored Feb 13, 2010

Functions explicitly marked inline will get an inlining threshold slightly
more aggressive than the default for -O3. This means than -O3 builds are
mostly unaffected while -Os builds will be a bit bigger and faster.

The difference depends entirely on how many 'inline's are sprinkled on the
source.

In the CINT2006 suite, only these tests are significantly affected under -Os:

               Size   Time
471.omnetpp   +1.63% -1.85%
473.astar     +4.01% -6.02%
483.xalancbmk +4.60%  0.00%

Note that 483.xalancbmk runs too quickly to give useful timing results.

llvm-svn: 96066

492b8b42

Feb 06, 2010

Reintroduce the InlineHint function attribute. · 74bb06c0

Jakob Stoklund Olesen authored Feb 06, 2010

This time it's for real! I am going to hook this up in the frontends as well.

The inliner has some experimental heuristics for dealing with the inline hint.
When given a -respect-inlinehint option, functions marked with the inline
keyword are given a threshold just above the default for -O3.

We need some experiments to determine if that is the right thing to do.

llvm-svn: 95466

74bb06c0

Feb 04, 2010

Increase inliner thresholds by 25. · 113fb54b

Jakob Stoklund Olesen authored Feb 04, 2010

This makes the inliner about as agressive as it was before my changes to the
inliner cost calculations. These levels give the same performance and slightly
smaller code than before.

llvm-svn: 95320

113fb54b

Jan 20, 2010

Move per-function inline threshold calculation to a method. · 8a19d3c9

Jakob Stoklund Olesen authored Jan 20, 2010

No functional change except the forgotten test for
InlineLimit.getNumOccurrences() == 0 in the CurrentThreshold2 calculation.

llvm-svn: 94007

8a19d3c9

Jan 05, 2010
- Change errs() to dbgs(). · 0122fc49
  David Greene authored Jan 05, 2010
```
llvm-svn: 92625
```
  0122fc49
Nov 12, 2009

use isInstructionTriviallyDead, as pointed out by Duncan · 5c89f4b4
Chris Lattner authored Nov 12, 2009
```
llvm-svn: 87035
```
5c89f4b4

implement a nice little efficiency hack in the inliner. Since we're now · eb9acbfb

Chris Lattner authored Nov 12, 2009

running IPSCCP early, and we run functionattrs interlaced with the inliner,
we often (particularly for small or noop functions) completely propagate
all of the information about a call to its call site in IPSSCP (making a call
dead) and functionattrs is smart enough to realize that the function is
readonly (because it is interlaced with inliner).

To improve compile time and make the inliner threshold more accurate, realize
that we don't have to inline dead readonly function calls.  Instead, just 
delete the call.  This happens all the time for C++ codes, here are some
counters from opt/llvm-ld counting the number of times calls were deleted vs
inlined on various apps:

Tramp3d opt:
  5033 inline                - Number of call sites deleted, not inlined
 24596 inline                - Number of functions inlined
llvm-ld:
  667 inline           - Number of functions deleted because all callers found
  699 inline           - Number of functions inlined

483.xalancbmk opt:
  8096 inline                - Number of call sites deleted, not inlined
 62528 inline                - Number of functions inlined
llvm-ld:
   217 inline           - Number of allocas merged together
  2158 inline           - Number of functions inlined

471.omnetpp:
  331 inline                - Number of call sites deleted, not inlined
 8981 inline                - Number of functions inlined
llvm-ld:
  171 inline           - Number of functions deleted because all callers found
  629 inline           - Number of functions inlined


Deleting a call is much faster than inlining it, and is insensitive to the
size of the callee. :)

llvm-svn: 86975

eb9acbfb

Oct 13, 2009
- Move the InlineCost code from Transforms/Utils to Analysis. · 4552e3cd
  Dan Gohman authored Oct 13, 2009
```
llvm-svn: 83998
```
  4552e3cd
Oct 09, 2009

Use names instead of numbers for some of the magic · 96a5b87a

Dale Johannesen authored Oct 09, 2009

constants used in inlining heuristics (especially
those used in more than one file).  No functional change.

llvm-svn: 83675

96a5b87a

When considering whether to inline Callee into Caller, · 3059924b

Dale Johannesen authored Oct 09, 2009

and that will make Caller too big to inline, see if it
might be better to inline Caller into its callers instead.
This situation is described in PR 2973, although I haven't
tried the specific case in SPASS.

llvm-svn: 83602

3059924b

Oct 04, 2009
- Allow -inline-threshold override default threshold even if compiling to optimize for size. · bb4ed239
  Evan Cheng authored Oct 04, 2009
```
llvm-svn: 83274
```
  bb4ed239
Aug 31, 2009

comment and simplify some code. · 9e507479
Chris Lattner authored Aug 31, 2009
```
llvm-svn: 80540
```
9e507479

Fix PR4834, a tricky case where the inliner would resolve an · 081375bb

Chris Lattner authored Aug 31, 2009

indirect function pointer, inline it, then go to delete the body.
The problem is that the callgraph had other references to the function,
though the inliner had no way to know it, so we got a dangling pointer
and an invalid iterator out of the deal.

The fix to this is pretty simple: stop the inliner from deleting the
function by knowing that there are references to it.  Do this by making
CallGraphNodes contain a refcount.  This requires moving deletion of 
available_externally functions to the module-level cleanup sweep where
it belongs.

llvm-svn: 80533

081375bb

Fix some nasty callgraph dangling pointer problems in · 305b115a

Chris Lattner authored Aug 31, 2009

argpromotion and structretpromote. Basically, when replacing
a function, they used the 'changeFunction' api which changes
the entry in the function map (and steals/reuses the callgraph
node).

This has some interesting effects: first, the problem is that it doesn't
update the "callee" edges in any callees of the function in the call graph.
Second, this covers for a major problem in all the CGSCC pass stuff, which
is that it is completely broken when functions are deleted if they *don't*
reuse a CGN. (there is a cute little fixme about this though :).

This patch changes the protocol that CGSCC passes must obey: now the CGSCC
pass manager copies the SCC and preincrements its iterator to avoid passes
invalidating it. This allows CGSCC passes to mutate the current SCC. However
multiple passes may be run on that SCC, so if passes do this, they are now
required to *update* the SCC to be current when they return.

Other less interesting parts of this patch are that it makes passes update
the CG more directly, eliminates changeFunction, and requires clients of
replaceCallSite to specify the new callee CGN if they are changing it.

llvm-svn: 80527

305b115a

Aug 28, 2009
- finish a half formed thought :) · 0e890180
  Chris Lattner authored Aug 28, 2009
```
llvm-svn: 80334
```
  0e890180
Aug 27, 2009

Implement a new optimization in the inliner: if inlining multiple · d3374e8d

Chris Lattner authored Aug 27, 2009

calls into a function and if the calls bring in arrays, try to merge
them together to reduce stack size.  For example, in the testcase
we'd previously end up with 4 allocas, now we end up with 2 allocas.

As described in the comments, this is not really the ideal solution
to this problem, but it is surprisingly effective.  For example, on
176.gcc, we end up eliminating 67 arrays at "gccas" time and another
24 at "llvm-ld" time.

One piece of concern that I didn't look into: at -O0 -g with
forced inlining this will almost certainly result in worse debug
info.  I think this is acceptable though given that this is a case
of "debugging optimized code", and we don't want debug info to
prevent the optimizer from doing things anyway.

llvm-svn: 80215

d3374e8d

reduce header #include'age · b9d0a961
Chris Lattner authored Aug 27, 2009
```
llvm-svn: 80204
```
b9d0a961
reduce inlining factor some stuff out to a static helper function, · 5eef6ad6
Chris Lattner authored Aug 27, 2009
```
and other code cleanups.  No functionality change.

llvm-svn: 80199
```
5eef6ad6

Aug 25, 2009

Allow multiple occurrences of -inline-threshold on · c221a55f

Dale Johannesen authored Aug 25, 2009

the command line.  This gives llvm-gcc developers
a way to control inlining (documented as "not intended
for end users").

llvm-svn: 79966

c221a55f

Jul 31, 2009
- - Convert the rest of the DOUTs to DEBUG+errs(). · 2602bb4c
  Bill Wendling authored Jul 31, 2009
```
- One formatting change.

No intended functionality change.

llvm-svn: 77717
```
  2602bb4c
Jul 25, 2009

More migration to raw_ostream, the water has dried up around the iostream hole. · 0dd5e1ed

Daniel Dunbar authored Jul 25, 2009

 - Some clients which used DOUT have moved to DEBUG. We are deprecating the
   "magic" DOUT behavior which avoided calling printing functions when the
   statement was disabled. In addition to being unnecessary magic, it had the
   downside of leaving code in -Asserts builds, and of hiding potentially
   unnecessary computations.

llvm-svn: 77019

0dd5e1ed

Jul 24, 2009
- Convert several more passes to use getAnalysisIfAvailable<TargetData>() · 67243a4b
  Dan Gohman authored Jul 24, 2009
```
instead of getAnalysis<TargetData>().

llvm-svn: 76982
```
  67243a4b
Jul 18, 2009
- Add line breaks to make the debug output a bit more readable. · f13b36dd
  Eli Friedman authored Jul 18, 2009
```
llvm-svn: 76284
```
  f13b36dd
May 23, 2009

available_externall linkage is not local, this was confusing the codegenerator, · 7996339d

Torok Edwin authored May 23, 2009

and it wasn't generating calls through @PLT for these functions.
hasLocalLinkage() is now false for available_externally,
I attempted to fix the inliner and dce to handle available_externally properly.
It passed make check.

llvm-svn: 72328

7996339d

Mar 24, 2009
- Use a SmallPtrSet instead of std::set. · 32dfb352
  Dale Johannesen authored Mar 23, 2009
```
llvm-svn: 67578
```
  32dfb352
Mar 19, 2009

Clear the cached cost when removing a function in · 2050968d

Dale Johannesen authored Mar 19, 2009

the inliner; prevents nondeterministic behavior
when the same address is reallocated.
Don't build call graph nodes for debug intrinsic calls;
they're useless, and there were typically a lot of them.

llvm-svn: 67311

2050968d

Jan 15, 2009
- Add the private linkage. · 6de96a1b
  Rafael Espindola authored Jan 15, 2009
```
llvm-svn: 62279
```
  6de96a1b
Jan 12, 2009

Enable recursive inlining. Reduce inlining threshold · 433a9086
Dale Johannesen authored Jan 12, 2009
```
back to 200; 400 seems to be too high, loses more than
it gains.

llvm-svn: 62107
```
433a9086

Increase default inlining aggressiveness in partial · f8468529

Dale Johannesen authored Jan 11, 2009

compensation for turning off gcc's inliner.  This gets
us closer to the amount of inlining we were getting before.
It is not a win on everything, of course, but seems to
gain overall.

llvm-svn: 62058

f8468529

Jan 09, 2009
- Adjustments to last patch based on review. · 4755d9df
  Dale Johannesen authored Jan 09, 2009
```
llvm-svn: 61969
```
  4755d9df
Nov 21, 2008
- Fix error where it wasn't getting the correct caller function. · f5260d29
  Bill Wendling authored Nov 21, 2008
```
llvm-svn: 59758
```
  f5260d29
- If the function being inlined has a higher stack protection level than the · 26c6a3e7
  Bill Wendling authored Nov 21, 2008
```
inlining function, then increase the stack protection level on the inlining
function.

llvm-svn: 59757
```
  26c6a3e7
Nov 05, 2008
- Do now allow InlineAlways pass to remove dead functions. · f0ef3573
  Devang Patel authored Nov 05, 2008
```
llvm-svn: 58744
```
  f0ef3573
Oct 30, 2008

Add InlineCost class for represent the estimated cost of inlining a · 3933e66a

Daniel Dunbar authored Oct 30, 2008

function.
 - This explicitly models the costs for functions which should
   "always" or "never" be inlined. This fixes bugs where such costs
   were not previously respected.

llvm-svn: 58450

3933e66a