Commits · 525ce253fed71917c612d62ae9f161718e023a2d · Lorenzo Albano / LLVM bpEVL

Aug 10, 2016

Changed sign of LastCallToStaticBouns · d89875ca

Piotr Padlewski authored Aug 10, 2016

Summary:
I think it is much better this way.
When I firstly saw line:
  Cost += InlineConstants::LastCallToStaticBonus;
I though that this is a bug, because everywhere where the cost is being reduced
it is usuing -=.

Reviewers: eraman, tejohnson, mehdi_amini

Subscribers: llvm-commits, mehdi_amini

Differential Revision: https://reviews.llvm.org/D23222

llvm-svn: 278290

d89875ca

[Inliner,OptDiag] Add hotness attribute to opt diagnostics · 896c09bd

Adam Nemet authored Aug 10, 2016

Summary:
The inliner not being a function pass requires the work-around of
generating the OptimizationRemarkEmitter and in turn BFI on demand.
This will go away after the new PM is ready.

BFI is only computed inside ORE if the user has requested hotness
information for optimization diagnostitics (-pass-remark-with-hotness at
the 'opt' level).  Thus there is no additional overhead without the
flag.

Reviewers: hfinkel, davidxl, eraman

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D22694

llvm-svn: 278185

896c09bd

Aug 06, 2016
- [Inliner] Use function_ref for functors which are never taken ownership of. · 41e66dad
  Benjamin Kramer authored Aug 06, 2016
```
llvm-svn: 277922
```
  41e66dad
Aug 03, 2016
- [Inliner] clang-format various parts of the inliner prior to changes · 8562d3a5
  Chandler Carruth authored Aug 03, 2016
```
here. NFC.

llvm-svn: 277557
```
  8562d3a5
Jul 29, 2016

Added ThinLTO inlining statistics · 84abc74f

Piotr Padlewski authored Jul 29, 2016

Summary:
copypasta doc of ImportedFunctionsInliningStatistics class
 \brief Calculate and dump ThinLTO specific inliner stats.
 The main statistics are:
 (1) Number of inlined imported functions,
 (2) Number of imported functions inlined into importing module (indirect),
 (3) Number of non imported functions inlined into importing module
 (indirect).
 The difference between first and the second is that first stat counts
 all performed inlines on imported functions, but the second one only the
 functions that have been eventually inlined to a function in the importing
 module (by a chain of inlines). Because llvm uses bottom-up inliner, it is
 possible to e.g. import function `A`, `B` and then inline `B` to `A`,
 and after this `A` might be too big to be inlined into some other function
 that calls it. It calculates this statistic by building graph, where
 the nodes are functions, and edges are performed inlines and then by marking
 the edges starting from not imported function.

 If `Verbose` is set to true, then it also dumps statistics
 per each inlined function, sorted by the greatest inlines count like
 - number of performed inlines
 - number of performed inlines to importing module

Reviewers: eraman, tejohnson, mehdi_amini

Subscribers: mehdi_amini, llvm-commits

Differential Revision: https://reviews.llvm.org/D22491

llvm-svn: 277089

84abc74f

Jul 23, 2016

Avoid using a raw AssumptionCacheTracker in various inliner functions. · ab6a6837

Sean Silva authored Jul 23, 2016

This unblocks the new PM part of River's patch in
https://reviews.llvm.org/D22706

Conveniently, this same change was needed for D21921 and so these
changes are just spun out from there.

llvm-svn: 276515

ab6a6837

Jun 10, 2016

Use ProfileSummaryInfo in inline cost analysis. · 71069cf6

Easwaran Raman authored Jun 09, 2016

Instead of directly using MaxFunctionCount and function entry count to determine callee hotness, use the isHotFunction/isColdFunction methods provided by ProfileSummaryInfo.

Differential revision: http://reviews.llvm.org/D21045

llvm-svn: 272321

71069cf6

May 23, 2016
- Avoid including AlwaysInliner pass in opt-bisect search. · 9c81d0fd
  Andrew Kaylor authored May 23, 2016
```
Differential Revision: http://reviews.llvm.org/D19640

llvm-svn: 270495
```
  9c81d0fd
Apr 30, 2016
- Reapply r268107 after fixing a bug breaks debug build. · 4b2fdcca
  Xinliang David Li authored Apr 29, 2016
```
Makes the new method to set data needed by debug dump.

llvm-svn: 268130
```
  4b2fdcca
Apr 29, 2016
- Revert r268107 -- debug build failure · 0552521b
  Xinliang David Li authored Apr 29, 2016
```
llvm-svn: 268116
```
  0552521b
- [inliner]: Refactor inline deferring logic into its own method /NFC · 1ffa28a3
  Xinliang David Li authored Apr 29, 2016
```
The implemented heuristic has a large body of code which better sits
in its own function for better readability. It also allows adding more
heuristics easier in the future.

llvm-svn: 268107
```
  1ffa28a3
Apr 23, 2016

Re-commit optimization bisect support (r267022) without new pass manager support. · aa641a51

Andrew Kaylor authored Apr 22, 2016

The original commit was reverted because of a buildbot problem with LazyCallGraph::SCC handling (not related to the OptBisect handling).

Differential Revision: http://reviews.llvm.org/D19172

llvm-svn: 267231

aa641a51

Apr 22, 2016

Revert "Initial implementation of optimization bisect support." · 6013f45f

Vedant Kumar authored Apr 22, 2016

This reverts commit r267022, due to an ASan failure:

  http://lab.llvm.org:8080/green/job/clang-stage2-cmake-RgSan_check/1549

llvm-svn: 267115

6013f45f

Apr 21, 2016

Initial implementation of optimization bisect support. · f0f27929

Andrew Kaylor authored Apr 21, 2016

This patch implements a optimization bisect feature, which will allow optimizations to be selectively disabled at compile time in order to track down test failures that are caused by incorrect optimizations.

The bisection is enabled using a new command line option (-opt-bisect-limit). Individual passes that may be skipped call the OptBisect object (via an LLVMContext) to see if they should be skipped based on the bisect limit. A finer level of control (disabling individual transformations) can be managed through an addition OptBisect method, but this is not yet used.

The skip checking in this implementation is based on (and replaces) the skipOptnoneFunction check. Where that check was being called, a new call has been inserted in its place which checks the bisect limit and the optnone attribute. A new function call has been added for module and SCC passes that behaves in a similar way.

Differential Revision: http://reviews.llvm.org/D19172

llvm-svn: 267022

f0f27929

Apr 18, 2016

[NFC] Header cleanup · b550cb17

Mehdi Amini authored Apr 18, 2016

Removed some unused headers, replaced some headers with forward class declarations.

Found using simple scripts like this one:
clear && ack --cpp -l '#include "llvm/ADT/IndexedMap.h"' | xargs grep -L 'IndexedMap[<]' | xargs grep -n --color=auto 'IndexedMap'

Patch by Eugene Kosov <claprix@yandex.ru>

Differential Revision: http://reviews.llvm.org/D19219

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266595

b550cb17

Mar 08, 2016
- Revert revisions 262636, 262643, 262679, and 262682. · b1bd398c
  Easwaran Raman authored Mar 08, 2016
```
llvm-svn: 262883
```
  b1bd398c
Mar 04, 2016
- Fix a use-after-free bug introduced in r262636 · 3b7a8246
  Easwaran Raman authored Mar 04, 2016
```
llvm-svn: 262679
```
  3b7a8246
Mar 03, 2016

Infrastructure for PGO enhancements in inliner · 3035719c

Easwaran Raman authored Mar 03, 2016

This patch provides the following infrastructure for PGO enhancements in inliner:

Enable the use of block level profile information in inliner
Incremental update of block frequency information during inlining
Update the function entry counts of callees when they get inlined into callers.

Differential Revision: http://reviews.llvm.org/D16381

llvm-svn: 262636

3035719c

Mar 02, 2016

[AA] Hoist the logic to reformulate various AA queries in terms of other · 12884f7f

Chandler Carruth authored Mar 02, 2016

parts of the AA interface out of the base class of every single AA
result object.

Because this logic reformulates the query in terms of some other aspect
of the API, it would easily cause O(n^2) query patterns in alias
analysis. These could in turn be magnified further based on the number
of call arguments, and then further based on the number of AA queries
made for a particular call. This ended up causing problems for Rust that
were actually noticable enough to get a bug (PR26564) and probably other
places as well.

When originally re-working the AA infrastructure, the desire was to
regularize the pattern of refinement without losing any generality.
While I think it was successful, that is clearly proving to be too
costly. And the cost is needless: we gain no actual improvement for this
generality of making a direct query to tbaa actually be able to
re-use some other alias analysis's refinement logic for one of the other
APIs, or some such. In short, this is entirely wasted work.

To the extent possible, delegation to other API surfaces should be done
at the aggregation layer so that we can avoid re-walking the
aggregation. In fact, this significantly simplifies the logic as we no
longer need to smuggle the aggregation layer into each alias analysis
(or the TargetLibraryInfo into each alias analysis just so we can form
argument memory locations!).

However, we also have some delegation logic inside of BasicAA and some
of it even makes sense. When the delegation logic is baking in specific
knowledge of aliasing properties of the LLVM IR, as opposed to simply
reformulating the query to utilize a different alias analysis interface
entry point, it makes a lot of sense to restrict that logic to
a different layer such as BasicAA. So one aspect of the delegation that
was in every AA base class is that when we don't have operand bundles,
we re-use function AA results as a fallback for callsite alias results.
This relies on the IR properties of calls and functions w.r.t. aliasing,
and so seems a better fit to BasicAA. I've lifted the logic up to that
point where it seems to be a natural fit. This still does a bit of
redundant work (we query function attributes twice, once via the
callsite and once via the function AA query) but it is *exactly* twice
here, no more.

The end result is that all of the delegation logic is hoisted out of the
base class and into either the aggregation layer when it is a pure
retargeting to a different API surface, or into BasicAA when it relies
on the IR's aliasing properties. This should fix the quadratic query
pattern reported in PR26564, although I don't have a stand-alone test
case to reproduce it.

It also seems general goodness. Now the numerous AAs that don't need
target library info don't carry it around and depend on it. I think
I can even rip out the general access to the aggregation layer and only
expose that in BasicAA as it is the only place where we re-query in that
manner.

However, this is a non-trivial change to the AA infrastructure so I want
to get some additional eyes on this before it lands. Sadly, it can't
wait long because we should really cherry pick this into 3.8 if we're
going to go this route.

Differential Revision: http://reviews.llvm.org/D17329

llvm-svn: 262490

12884f7f

Feb 09, 2016

Add an "addUsedAAAnalyses" helper function · 1c481f50

Sanjoy Das authored Feb 09, 2016

Summary:
Passes that call `getAnalysisIfAvailable<T>` also need to call
`addUsedIfAvailable<T>` in `getAnalysisUsage` to indicate to the
legacy pass manager that it uses `T`.  This contract was being
violated by passes that used `createLegacyPMAAResults`.  This change
fixes this by exposing a helper in AliasAnalysis.h,
`addUsedAAAnalyses`, that is complementary to createLegacyPMAAResults
and does the right thing when called from `getAnalysisUsage`.

Reviewers: chandlerc

Subscribers: mcrosier, llvm-commits

Differential Revision: http://reviews.llvm.org/D17010

llvm-svn: 260183

1c481f50

Jan 15, 2016
- Refactor threshold computation for inline cost analysis · f4bb2f0d
  Easwaran Raman authored Jan 14, 2016
```
Differential Revision: http://reviews.llvm.org/D15401

llvm-svn: 257832
```
  f4bb2f0d
Dec 28, 2015

Refactor inline costs analysis by removing the InlineCostAnalysis class · b9f7120e

Easwaran Raman authored Dec 28, 2015

InlineCostAnalysis is an analysis pass without any need for it to be one.
Once it stops being an analysis pass, it doesn't maintain any useful state
and the member functions inside can be made free functions. NFC.

Differential Revision: http://reviews.llvm.org/D15701

llvm-svn: 256521

b9f7120e

Dec 23, 2015

Provide a way to specify inliner's attribute compatibility and merging. · 1cb242eb

Akira Hatanaka authored Dec 22, 2015

This reapplies r256277 with two changes:

- In emitFnAttrCompatCheck, change FuncName's type to std::string to fix
  a use-after-free bug.
- Remove an unnecessary install-local target in lib/IR/Makefile. 

Original commit message for r252949:

Provide a way to specify inliner's attribute compatibility and merging
rules using table-gen. NFC.

This commit adds new classes CompatRule and MergeRule to Attributes.td,
which are used to generate code to check attribute compatibility and
merge attributes of the caller and callee.

rdar://problem/19836465

llvm-svn: 256304

1cb242eb

Dec 22, 2015

Revert r256277 and r256279. · 9c05cc56
Akira Hatanaka authored Dec 22, 2015
```
Some of the bots failed again.

llvm-svn: 256280
```
9c05cc56

Provide a way to specify inliner's attribute compatibility and merging. · a61deb24

Akira Hatanaka authored Dec 22, 2015

This reapplies r252990 and r252949. I've added member function getKind
to the Attr classes which returns the enum or string of the attribute.

Original commit message for r252949:

Provide a way to specify inliner's attribute compatibility and merging
rules using table-gen. NFC.

This commit adds new classes CompatRule and MergeRule to Attributes.td,
which are used to generate code to check attribute compatibility and
merge attributes of the caller and callee.

rdar://problem/19836465

llvm-svn: 256277

a61deb24

Determine callee's hotness and adjust threshold based on that. NFC. · bdb6f1dc

Easwaran Raman authored Dec 22, 2015

This uses the same criteria used in CFE's CodeGenPGO to identify hot and cold
callees and uses values of inlinehint-threshold and inlinecold-threshold
respectively as the thresholds for such callees.

Differential Revision: http://reviews.llvm.org/D15245

llvm-svn: 256222

bdb6f1dc

Nov 13, 2015

Revert r252990. · 5af7ace4
Akira Hatanaka authored Nov 13, 2015
```
Some of the buildbots are still failing.

llvm-svn: 252999
```
5af7ace4

Provide a way to specify inliner's attribute compatibility and merging. · c7dfb76f

Akira Hatanaka authored Nov 13, 2015

This reapplies r252949. I've changed the type of FuncName to be
std::string instead of StringRef in emitFnAttrCompatCheck.

Original commit message for r252949:

Provide a way to specify inliner's attribute compatibility and merging
rules using table-gen. NFC.

This commit adds new classes CompatRule and MergeRule to Attributes.td,
which are used to generate code to check attribute compatibility and
merge attributes of the caller and callee.

rdar://problem/19836465

llvm-svn: 252990

c7dfb76f

Nov 12, 2015

Revert r252949. · f3aa82f6

Akira Hatanaka authored Nov 12, 2015

It broke some of the bots including clang-x64-ninja-win7.

llvm-svn: 252951

f3aa82f6

Provide a way to specify inliner's attribute compatibility and merging · 61b81a56

Akira Hatanaka authored Nov 12, 2015

rules using table-gen. NFC.

This commit adds new classes CompatRule and MergeRule to Attributes.td,
which are used to generate code to check attribute compatibility and
merge attributes of the caller and callee.

rdar://problem/19836465

llvm-svn: 252949

61b81a56

Sep 29, 2015

Move dbg.declare intrinsics when merging and replacing allocas. · d8b86f7c

Evgeniy Stepanov authored Sep 29, 2015

Place new and update dbg.declare calls immediately after the
corresponding alloca.

Current code in replaceDbgDeclareForAlloca puts the new dbg.declare
at the end of the basic block. LLVM codegen has problems emitting
debug info in a situation when dbg.declare appears after all uses of
the variable. This usually kinda works for inlining and ASan (two
users of this function) but not for SafeStack (see the pending change
in http://reviews.llvm.org/D13178).

llvm-svn: 248769

d8b86f7c

Sep 09, 2015

[PM/AA] Rebuild LLVM's alias analysis infrastructure in a way compatible · 7b560d40

Chandler Carruth authored Sep 09, 2015

with the new pass manager, and no longer relying on analysis groups.

This builds essentially a ground-up new AA infrastructure stack for
LLVM. The core ideas are the same that are used throughout the new pass
manager: type erased polymorphism and direct composition. The design is
as follows:

- FunctionAAResults is a type-erasing alias analysis results aggregation
  interface to walk a single query across a range of results from
  different alias analyses. Currently this is function-specific as we
  always assume that aliasing queries are *within* a function.

- AAResultBase is a CRTP utility providing stub implementations of
  various parts of the alias analysis result concept, notably in several
  cases in terms of other more general parts of the interface. This can
  be used to implement only a narrow part of the interface rather than
  the entire interface. This isn't really ideal, this logic should be
  hoisted into FunctionAAResults as currently it will cause
  a significant amount of redundant work, but it faithfully models the
  behavior of the prior infrastructure.

- All the alias analysis passes are ported to be wrapper passes for the
  legacy PM and new-style analysis passes for the new PM with a shared
  result object. In some cases (most notably CFL), this is an extremely
  naive approach that we should revisit when we can specialize for the
  new pass manager.

- BasicAA has been restructured to reflect that it is much more
  fundamentally a function analysis because it uses dominator trees and
  loop info that need to be constructed for each function.

All of the references to getting alias analysis results have been
updated to use the new aggregation interface. All the preservation and
other pass management code has been updated accordingly.

The way the FunctionAAResultsWrapperPass works is to detect the
available alias analyses when run, and add them to the results object.
This means that we should be able to continue to respect when various
passes are added to the pipeline, for example adding CFL or adding TBAA
passes should just cause their results to be available and to get folded
into this. The exception to this rule is BasicAA which really needs to
be a function pass due to using dominator trees and loop info. As
a consequence, the FunctionAAResultsWrapperPass directly depends on
BasicAA and always includes it in the aggregation.

This has significant implications for preserving analyses. Generally,
most passes shouldn't bother preserving FunctionAAResultsWrapperPass
because rebuilding the results just updates the set of known AA passes.
The exception to this rule are LoopPass instances which need to preserve
all the function analyses that the loop pass manager will end up
needing. This means preserving both BasicAAWrapperPass and the
aggregating FunctionAAResultsWrapperPass.

Now, when preserving an alias analysis, you do so by directly preserving
that analysis. This is only necessary for non-immutable-pass-provided
alias analyses though, and there are only three of interest: BasicAA,
GlobalsAA (formerly GlobalsModRef), and SCEVAA. Usually BasicAA is
preserved when needed because it (like DominatorTree and LoopInfo) is
marked as a CFG-only pass. I've expanded GlobalsAA into the preserved
set everywhere we previously were preserving all of AliasAnalysis, and
I've added SCEVAA in the intersection of that with where we preserve
SCEV itself.

One significant challenge to all of this is that the CGSCC passes were
actually using the alias analysis implementations by taking advantage of
a pretty amazing set of loop holes in the old pass manager's analysis
management code which allowed analysis groups to slide through in many
cases. Moving away from analysis groups makes this problem much more
obvious. To fix it, I've leveraged the flexibility the design of the new
PM components provides to just directly construct the relevant alias
analyses for the relevant functions in the IPO passes that need them.
This is a bit hacky, but should go away with the new pass manager, and
is already in many ways cleaner than the prior state.

Another significant challenge is that various facilities of the old
alias analysis infrastructure just don't fit any more. The most
significant of these is the alias analysis 'counter' pass. That pass
relied on the ability to snoop on AA queries at different points in the
analysis group chain. Instead, I'm planning to build printing
functionality directly into the aggregation layer. I've not included
that in this patch merely to keep it smaller.

Note that all of this needs a nearly complete rewrite of the AA
documentation. I'm planning to do that, but I'd like to make sure the
new design settles, and to flesh out a bit more of what it looks like in
the new pass manager first.

Differential Revision: http://reviews.llvm.org/D12080

llvm-svn: 247167

7b560d40

Aug 11, 2015
- Variable names should start with an upper case letter; NFC · 278004be
  Sanjay Patel authored Aug 11, 2015
```
llvm-svn: 244618
```
  278004be
Aug 05, 2015

-Wdeprecated cleanup: Make CallGraph movable by default by using unique_ptr... · a5d7de9f

David Blaikie authored Aug 05, 2015

-Wdeprecated cleanup: Make CallGraph movable by default by using unique_ptr members rather than raw pointers.

The only place that tries to return a CallGraph by value
(CallGraphAnalysis::run) doesn't seem to be used right now, but it's a
reasonable bit of cleanup anyway.

llvm-svn: 244122

a5d7de9f

Aug 04, 2015

wrap OptSize and MinSize attributes for easier and consistent access (NFCI) · 924879ad

Sanjay Patel authored Aug 04, 2015

Create wrapper methods in the Function class for the OptimizeForSize and MinSize
attributes. We want to hide the logic of "or'ing" them together when optimizing
just for size (-Os).

Currently, we are not consistent about this and rely on a front-end to always set
OptimizeForSize (-Os) if MinSize (-Oz) is on. Thus, there are 18 FIXME changes here
that should be added as follow-on patches with regression tests.

This patch is NFC-intended: it just replaces existing direct accesses of the attributes
by the equivalent wrapper call.

Differential Revision: http://reviews.llvm.org/D11734

llvm-svn: 243994

924879ad

Jul 19, 2015
- Narrow Callee scope, suggestion from David Blaikie. · c66c06b8
  Yaron Keren authored Jul 19, 2015
```
llvm-svn: 242644
```
  c66c06b8
- De-duplicate CS.getCalledFunction() expression. · 611f614e
  Yaron Keren authored Jul 19, 2015
```
Not sure if the optimizer will save the call as getCalledFunction()
is not a trivial access function but the code is clearer this way.

llvm-svn: 242641
```
  611f614e
Jul 02, 2015
- Remove whitespace from start of line, NFC. · 6967cbb3
  Yaron Keren authored Jul 02, 2015
```
llvm-svn: 241268
```
  6967cbb3
Jun 25, 2015
- Rangify for loop in Inliner.cpp. NFC. · 62064d6d
  Yaron Keren authored Jun 25, 2015
```
llvm-svn: 240678
```
  62064d6d
Jun 20, 2015
- Rangify for loops in Inliner::runOnSCC(), NFC. · 4c548f2d
  Yaron Keren authored Jun 20, 2015
```
llvm-svn: 240215
```
  4c548f2d