Commits · 8cfcf586bbf5e83daff8aecd193b742002e961c1 · Lorenzo Albano / LLVM bpEVL

May 20, 2016

Functions with differing phis should not be merged. · 9b5fcf65

Mark Lacey authored May 20, 2016

Check that the incoming blocks of phi nodes are identical, and block
function merging if they are not.

rdar://problem/26255167

Differential Revision: http://reviews.llvm.org/D20462

llvm-svn: 270250

9b5fcf65

May 15, 2016
- Rename pass name to prepare to new PM porting /NFC · 72616180
  Xinliang David Li authored May 15, 2016
```
llvm-svn: 269586
```
  72616180
May 12, 2016

[ThinLTO] Don't re-analyze callee at same threshold unnecessarily · 2e03094d

Teresa Johnson authored May 11, 2016

This should just be a compile-time change. Correct the check for whether
we have already analyzed the callee when making summary based decisions.
There is no need to reprocess one at the same threshold as when it was
last processed.

llvm-svn: 269251

2e03094d

May 11, 2016

Delete mayBeOverridden. · f329be83

Rafael Espindola authored May 11, 2016

It is the same as isInterposable which seems to be the preferred name.

llvm-svn: 269150

f329be83

May 10, 2016

Cloning: Clean up the interface to the CloneFunction function. · dba99560

Peter Collingbourne authored May 10, 2016

Remove the ModuleLevelChanges argument, and the ability to create new
subprograms for cloned functions. The latter was added without review in
r203662, but it has no in-tree clients (all non-test callers pass false
for ModuleLevelChanges [1], so it isn't reachable outside of tests). It
also isn't clear that adding a duplicate subprogram to the compile unit is
always the right thing to do when cloning a function within a module. If
this functionality comes back it should be accompanied with a more concrete
use case.

Furthermore, all in-tree clients add the returned function to the module.
Since that's pretty much the only sensible thing you can do with the function,
just do that in CloneFunction.

[1] http://llvm-cs.pcc.me.uk/lib/Transforms/Utils/CloneFunction.cpp/rCloneFunction

Differential Revision: http://reviews.llvm.org/D18628

llvm-svn: 269110

dba99560

Re-apply r269081 and r269082 with a fix for MSVC. · ccdc225c
Peter Collingbourne authored May 10, 2016
```
llvm-svn: 269094
```
ccdc225c
Revert r269081 and r269082 while I try to find the right incantation to fix MSVC build. · 4d41cb6c
Peter Collingbourne authored May 10, 2016
```
llvm-svn: 269091
```
4d41cb6c

WholeProgramDevirt: Move logic for finding devirtualizable call sites to Analysis. · 0df2b085

Peter Collingbourne authored May 10, 2016

The plan is to eventually make this logic simpler, however I expect it to
be a little tricky for the foreseeable future (at least until we're rid of
pointee types), so move it here so that it can be reused to build a summary
index for devirtualization.

Differential Revision: http://reviews.llvm.org/D20005

llvm-svn: 269081

0df2b085

[ThinLTO] Add option to emit imports files for distributed backends · 8570fe47

Teresa Johnson authored May 10, 2016

Summary:
Add support for emission of plaintext lists of the imported files for
each distributed backend compilation. Used for distributed build file
staging.

Invoked with new gold-plugin thinlto-emit-imports-files option, which is
only valid with thinlto-index-only (i.e. for distributed builds), or
from llvm-lto with new -thinlto-action=emitimports value.

Depends on D19556.

Reviewers: joker.eph

Subscribers: llvm-commits, joker.eph

Differential Revision: http://reviews.llvm.org/D19636

llvm-svn: 269067

8570fe47

Restore "[ThinLTO] Emit individual index files for distributed backends" · 84174c37

Teresa Johnson authored May 10, 2016

This restores commit r268627:
    Summary:
    When launching ThinLTO backends in a distributed build (currently
    supported in gold via the thinlto-index-only plugin option), emit
    an individual index file for each backend process as described here:
    http://lists.llvm.org/pipermail/llvm-dev/2016-April/098272.html

    ...

    Differential Revision: http://reviews.llvm.org/D19556

Address msan failures by avoiding std::prev on map.end(), the
theory is that this is causing issues due to some known UB problems
in __tree.

llvm-svn: 269059

84174c37

May 07, 2016
- [PM] code refactoring -- preparation for new PM porting /NFC · d55827f7
  Xinliang David Li authored May 07, 2016
```
llvm-svn: 268851
```
  d55827f7
May 06, 2016

Tweak the ThinLTO pass pipeline · 31407ba0

Mehdi Amini authored May 06, 2016

Summary:
The original ThinLTO pipeline was derived from some
work I did tuning FullLTO on the test suite and SPEC. This
patch reduces the amount of work done in the "linker phase" of
the build, and extend the function simplifications passes
performed during the "compile phase". This helps the build time
by reducing the IR as much as possible during the compile phase
and limiting the work to be performed during the "link phase",
while keeping the performance "on par" with the existing pipeline.

Reviewers: tejohnson

Subscribers: llvm-commits, joker.eph

Differential Revision: http://reviews.llvm.org/D19773

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 268769

31407ba0

[PM] port IR based PGO prof-gen pass to new pass manager · 8aebf44c
Xinliang David Li authored May 06, 2016
```
llvm-svn: 268710
```
8aebf44c

May 05, 2016

Revert "[ThinLTO] Emit individual index files for distributed backends" · 1df2338b

Vitaly Buka authored May 05, 2016

MemorySanitizer: use-of-uninitialized-value in lib/Bitcode/Writer/BitcodeWriter.cpp:364:70
http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/12544/steps/check-llvm%20msan/logs/stdio

This reverts commit 0c4a898ea550699d1b2f4fe3767251c8f9a48d52.

llvm-svn: 268660

1df2338b

[ThinLTO] Emit individual index files for distributed backends · 9254ebe3

Teresa Johnson authored May 05, 2016

Summary:
When launching ThinLTO backends in a distributed build (currently
supported in gold via the thinlto-index-only plugin option), emit
an individual index file for each backend process as described here:
http://lists.llvm.org/pipermail/llvm-dev/2016-April/098272.html

The individual index file encodes the summary and module information
required for implementing the importing/exporting decisions made
for a given module in the thin link step.
This is in place of the current mechanism that uses the combined index
to make importing decisions in each back end independently. It is an
enabler for doing global summary based optimizations in the thin link
step (which will be recorded in the individual index files), and reduces
the size of the index that must be sent to each backend process, and
the amount of work to scan it in the backends.

Rather than create entirely new ModuleSummaryIndex structures (and all
the included unique_ptrs) for each backend index file, a map is created
to record all of the GUID and summary pointers needed for a particular
index file. The IndexBitcodeWriter walks this map instead of the full
index (hiding the details of managing the appropriate summary iteration
in a new iterator subclass). This is more efficient than walking the
entire combined index and filtering out just the needed summaries during
each backend bitcode index write.

Depends on D19481.

Reviewers: joker.eph

Subscribers: llvm-commits, joker.eph

Differential Revision: http://reviews.llvm.org/D19556

llvm-svn: 268627

9254ebe3

[PM] Port EliminateAvailableExternally pass to the new pass manager. · 344e838f
Davide Italiano authored May 05, 2016
```
llvm-svn: 268599
```
344e838f
[PM] Port ConstantMerge to the new pass manager. · 164b9bc6
Davide Italiano authored May 05, 2016
```
llvm-svn: 268582
```
164b9bc6

May 04, 2016
- [IPO/ConstantMerge] Convert to static function, to facilitate transition to the new PM. · 17da174b
  Davide Italiano authored May 04, 2016
```
llvm-svn: 268476
```
  17da174b
- [GlobalDCE, Misc] Don't remove functions referenced by ifuncs · 95549497
  David Majnemer authored May 04, 2016
```
We forgot to consider the target of ifuncs when considering if a
function was alive or dead.

N.B. Also update a few auxiliary tools like bugpoint and
verify-uselistorder.

This fixes PR27593.

llvm-svn: 268468
```
  95549497
May 03, 2016

[IPO/ConstantMerge] Garbage collect dead code. NFC. · c91e0b2f
Davide Italiano authored May 03, 2016
```
llvm-svn: 268442
```
c91e0b2f
[IPO/IPCP] Convert to use static functions. NFC. · 296d12cd
Davide Italiano authored May 03, 2016
```
In preparation for porting this pass to the new PM.

llvm-svn: 268429
```
296d12cd
[IPO/GlobalDCE] Port to the new pass manager. · 66228c4c
Davide Italiano authored May 03, 2016
```
Differential Revision:  http://reviews.llvm.org/D19782

llvm-svn: 268425
```
66228c4c

Move "Eliminate Available Externally" immediately after the inliner · 7f7d8be5

Mehdi Amini authored May 03, 2016

This pass is supposed to reduce the size of the IR for compile time
purpose. We should run it ASAP, except when we prepare for LTO or
ThinLTO, and we want to keep them available for link-time inline.

Differential Revision: http://reviews.llvm.org/D19813

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 268394

7f7d8be5

ThinLTO: do not import function whose linkage prevents inlining. · 5b85d8d6

Mehdi Amini authored May 03, 2016

There is not point in importing a "weak" or a "linkonce" function
since we won't be able to inline it anyway.
We already had a targeted check for WeakAny, this is using the
same check on GlobalValue as the inline, i.e.
isMayBeOverriddenLinkage()

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 268341

5b85d8d6

Revert "ThinLTO: do not import function whose linkage prevents inlining." · 1e918c9c
Mehdi Amini authored May 02, 2016
```
This reverts commit r268315, the tests are not passing.

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 268317
```
1e918c9c

ThinLTO: do not import function whose linkage prevents inlining. · bda9b2ae

Mehdi Amini authored May 02, 2016

There is not point in importing a "weak" or a "linkonce" function
since we won't be able to inline it anyway.
We already had a targeted check for WeakAny, this is using the
same check on GlobalValue as the inline, i.e.
isMayBeOverriddenLinkage()

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 268315

bda9b2ae

May 02, 2016

ReversePostOrderFunctionAttrs is not modifying the call graph, let's preserve it. · 0ddf404c

Mehdi Amini authored May 02, 2016

When running cc1 with -flto=thin, it is followed by GlobalOpt, which
requires the callgraph. This saves rebuilding one.

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 268266

0ddf404c

Move createReversePostOrderFunctionAttrsPass right after the inliner is done · 45c7b3ec

Mehdi Amini authored May 02, 2016

This is where it was originally, until LoopVersioningLICM was
inserted before in r259986, I don't believe it was on purpose.

Differential Revision: http://reviews.llvm.org/D19809

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 268252

45c7b3ec

Apr 30, 2016
- Reapply r268107 after fixing a bug breaks debug build. · 4b2fdcca
  Xinliang David Li authored Apr 29, 2016
```
Makes the new method to set data needed by debug dump.

llvm-svn: 268130
```
  4b2fdcca
Apr 29, 2016

Revert r268107 -- debug build failure · 0552521b
Xinliang David Li authored Apr 29, 2016
```
llvm-svn: 268116
```
0552521b

[inliner]: Refactor inline deferring logic into its own method /NFC · 1ffa28a3

Xinliang David Li authored Apr 29, 2016

The implemented heuristic has a large body of code which better sits
in its own function for better readability. It also allows adding more
heuristics easier in the future.

llvm-svn: 268107

1ffa28a3

Do not read callee name when matching IR to profile as it is not used. · 21aefaec

Dehao Chen authored Apr 29, 2016

Summary: Callee name is not used to identify a callsite now, so do not read it during annotation.

Reviewers: davidxl, dnovillo

Subscribers: dnovillo, danielcdh, llvm-commits

Differential Revision: http://reviews.llvm.org/D19704

llvm-svn: 268069

21aefaec

[GlobalOpt] Propagate operand bundles · fadc6db0

David Majnemer authored Apr 29, 2016

We neglected to transfer operand bundles for some transforms.  These
were found via inspection, I'll try to come up with some test cases.

llvm-svn: 268011

fadc6db0

[DeadArgumentElimination] Propagate operand bundles to promoted call sites · 1a5799fe
David Majnemer authored Apr 29, 2016
```
We neglected to transfer operand bundles when performing argument
promotion.

llvm-svn: 268008
```
1a5799fe
[ArgumentPromotion] Propagate operand bundles to promoted call sites · cd24bb1d
David Majnemer authored Apr 29, 2016
```
We neglected to transfer operand bundles when performing argument
promotion.

This fixes PR27568.

llvm-svn: 267986
```
cd24bb1d

Apr 28, 2016

[PGO] Promote indirect calls to conditional direct calls with value-profile · 6e34c490

Rong Xu authored Apr 27, 2016

This patch implements the transformation that promotes indirect calls to
conditional direct calls when the indirect-call value profile meta-data is
available.

Differential Revision: http://reviews.llvm.org/D17864

llvm-svn: 267815

6e34c490

Apr 27, 2016

[TLI] Unify LibFunc attribute inference. NFCI. · b0624a2c

Ahmed Bougacha authored Apr 27, 2016

Now the pass is just a tiny wrapper around the util. This lets us reuse
the logic elsewhere (done here for BuildLibCalls) instead of duplicating
it.

The next step is to have something like getOrInsertLibFunc that also
sets the attributes.

Differential Revision: http://reviews.llvm.org/D19470

llvm-svn: 267759

b0624a2c

[TLI] Unify LibFunc signature checking. NFCI. · d765a82b

Ahmed Bougacha authored Apr 27, 2016

I tried to be as close as possible to the strongest check that
existed before; cleaning these up properly is left for future work.

Differential Revision: http://reviews.llvm.org/D19469

llvm-svn: 267758

d765a82b

[LoopDist] Add llvm.loop.distribute.enable loop metadata · d2fa4147

Adam Nemet authored Apr 27, 2016

Summary:
D19403 adds a new pragma for loop distribution.  This change adds
support for the corresponding metadata that the pragma is translated to
by the FE.

As part of this I had to rethink the flag -enable-loop-distribute.  My
goal was to be backward compatible with the existing behavior:

  A1. pass is off by default from the optimization pipeline
  unless -enable-loop-distribute is specified

  A2. pass is on when invoked directly from opt (e.g. for unit-testing)

The new pragma/metadata overrides these defaults so the new behavior is:

  B1. A1 + enable distribution for individual loop with the pragma/metadata

  B2. A2 + disable distribution for individual loop with the pragma/metadata

The default value whether the pass is on or off comes from the initiator
of the pass.  From the PassManagerBuilder the default is off, from opt
it's on.

I moved -enable-loop-distribute under the pass.  If the flag is
specified it overrides the default from above.

Then the pragma/metadata can further modifies this per loop.

As a side-effect, we can now also use -enable-loop-distribute=0 from opt
to emulate the default from the optimization pipeline.  So to be precise
this is the new behavior:

  C1. pass is off by default from the optimization pipeline
  unless -enable-loop-distribute or the pragma/metadata enables it

  C2. pass is on when invoked directly from opt
  unless -enable-loop-distribute=0 or the pragma/metadata disables it

Reviewers: hfinkel

Subscribers: joker.eph, mzolotukhin, llvm-commits

Differential Revision: http://reviews.llvm.org/D19431

llvm-svn: 267672

d2fa4147

ThinLTO: do not promote GlobalVariable that have a specific section. · b4e1e829
Mehdi Amini authored Apr 27, 2016
```
Differential Revision: http://reviews.llvm.org/D18298

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 267646
```
b4e1e829