Commits · 1e9e615f92179a4be8e1df70acf6329b0bb198a7 · Lorenzo Albano / LLVM bpEVL

Apr 22, 2016

Revert "Initial implementation of optimization bisect support." · 6013f45f

Vedant Kumar authored Apr 22, 2016

This reverts commit r267022, due to an ASan failure:

  http://lab.llvm.org:8080/green/job/clang-stage2-cmake-RgSan_check/1549

llvm-svn: 267115

6013f45f

Apr 21, 2016

Initial implementation of optimization bisect support. · f0f27929

Andrew Kaylor authored Apr 21, 2016

This patch implements a optimization bisect feature, which will allow optimizations to be selectively disabled at compile time in order to track down test failures that are caused by incorrect optimizations.

The bisection is enabled using a new command line option (-opt-bisect-limit). Individual passes that may be skipped call the OptBisect object (via an LLVMContext) to see if they should be skipped based on the bisect limit. A finer level of control (disabling individual transformations) can be managed through an addition OptBisect method, but this is not yet used.

The skip checking in this implementation is based on (and replaces) the skipOptnoneFunction check. Where that check was being called, a new call has been inserted in its place which checks the bisect limit and the optnone attribute. A new function call has been added for module and SCC passes that behaves in a similar way.

Differential Revision: http://reviews.llvm.org/D19172

llvm-svn: 267022

f0f27929

ThinLTO/ModuleLinker: add a flag to not always pull-in linkonce when performing importing · bda3c97c

Mehdi Amini authored Apr 21, 2016

Summary:
The function importer already decided what symbols need to be pulled
in. Also these magically added ones will not be in the export list
for the source module, which can confuse the internalizer for
instance.

Reviewers: tejohnson, rafael

Subscribers: joker.eph, llvm-commits

Differential Revision: http://reviews.llvm.org/D19096

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266948

bda3c97c

Refine instruction weight annotation algorithm for sample profiler. · a8bae823

Dehao Chen authored Apr 20, 2016

Summary:
This patch refined the instruction weight anootation algorithm:
1. Do not use dbg_value intrinsics for annotation.
2. Annotate cold calls if the call is inlined in profile, but not inlined before preparation. This indicates that the annotation preparation step found no sample for the inlined callsite, thus the call should be very cold.

Reviewers: dnovillo, davidxl

Subscribers: mgrang, llvm-commits

Differential Revision: http://reviews.llvm.org/D19286

llvm-svn: 266936

a8bae823

Apr 20, 2016

[ThinLTO] Prevent importing of "llvm.used" values · b35cc691

Teresa Johnson authored Apr 20, 2016

Summary:
This patch prevents importing from (and therefore exporting from) any
module with a "llvm.used" local value. Local values need to be promoted
and renamed when importing, and their presense on the llvm.used variable
indicates that there are opaque uses that won't see the rename. One such
example is a use in inline assembly.

See also the discussion at:
http://lists.llvm.org/pipermail/llvm-dev/2016-April/098047.html

As part of this, move collectUsedGlobalVariables out of Transforms/Utils
and into IR/Module so that it can be used more widely. There are several
other places in LLVM that used copies of this code that can be cleaned
up as a follow on NFC patch.

Reviewers: joker.eph

Subscribers: pcc, llvm-commits, joker.eph

Differential Revision: http://reviews.llvm.org/D18986

llvm-svn: 266877

b35cc691

FunctionImport: make sure we always select the right callee in presence of alias · 2c719cc1
Mehdi Amini authored Apr 20, 2016
```
From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266854
```
2c719cc1
ThinLTO: Move alias importing decision on the summary · 6968ef77
Mehdi Amini authored Apr 20, 2016
```
From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266845
```
6968ef77

Apr 19, 2016
- Minor improvement to debug output for Function Importer (NFC) · aeb1e59b
  Mehdi Amini authored Apr 19, 2016
```
From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266723
```
  aeb1e59b
Apr 18, 2016

Port InstrProfiling pass to the new pass manager · e6b89294
Xinliang David Li authored Apr 18, 2016
```
Differential Revision: http://reviews.llvm.org/D18126

llvm-svn: 266637
```
e6b89294

[NFC] Header cleanup · b550cb17

Mehdi Amini authored Apr 18, 2016

Removed some unused headers, replaced some headers with forward class declarations.

Found using simple scripts like this one:
clear && ack --cpp -l '#include "llvm/ADT/IndexedMap.h"' | xargs grep -L 'IndexedMap[<]' | xargs grep -n --color=auto 'IndexedMap'

Patch by Eugene Kosov <claprix@yandex.ru>

Differential Revision: http://reviews.llvm.org/D19219

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266595

b550cb17

Apr 16, 2016

ThinLTO: Move the ODR resolution to be based purely on the summary. · 1aafabf7

Mehdi Amini authored Apr 16, 2016

This is a requirement for the cache handling in D18494

Differential Revision: http://reviews.llvm.org/D18908

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266519

1aafabf7

ThinLTO: Make aliases explicit in the summary · 2d28f7aa

Mehdi Amini authored Apr 16, 2016

To be able to work accurately on the reference graph when taking
decision about internalizing, promoting, renaming, etc. We need
to have the alias information explicit.

Differential Revision: http://reviews.llvm.org/D18836

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266517

2d28f7aa

[cfi] Support explicit sections for functions in cfi-icall. · 40cd1514

Evgeniy Stepanov authored Apr 15, 2016

Allow explicit section for indirectly called functions in cfi-icall.
Jumptables for functions in the same type class must be contiguous, so they
always go to the default text section.

Fixes PR25079.

llvm-svn: 266486

40cd1514

Apr 15, 2016

[PR27284] Reverse the ownership between DICompileUnit and DISubprogram. · 75819aed

Adrian Prantl authored Apr 15, 2016

Currently each Function points to a DISubprogram and DISubprogram has a
scope field. For member functions the scope is a DICompositeType. DIScopes
point to the DICompileUnit to facilitate type uniquing.

Distinct DISubprograms (with isDefinition: true) are not part of the type
hierarchy and cannot be uniqued. This change removes the subprograms
list from DICompileUnit and instead adds a pointer to the owning compile
unit to distinct DISubprograms. This would make it easy for ThinLTO to
strip unneeded DISubprograms and their transitively referenced debug info.

Motivation
----------

Materializing DISubprograms is currently the most expensive operation when
doing a ThinLTO build of clang.

We want the DISubprogram to be stored in a separate Bitcode block (or the
same block as the function body) so we can avoid having to expensively
deserialize all DISubprograms together with the global metadata. If a
function has been inlined into another subprogram we need to store a
reference the block containing the inlined subprogram.

Attached to https://llvm.org/bugs/show_bug.cgi?id=27284 is a python script
that updates LLVM IR testcases to the new format.

http://reviews.llvm.org/D19034
<rdar://problem/25256815>

llvm-svn: 266446

75819aed

[PM] Add a SpeculativeExecution pass for targets with divergent branches. · cf63b64f

Justin Lebar authored Apr 15, 2016

Summary:
This IR pass is helpful for GPUs, and other targets with divergent
branches.  It's a nop on targets without divergent branches.

Reviewers: chandlerc

Subscribers: llvm-commits, jingyue, rnk, joker.eph, tra

Differential Revision: http://reviews.llvm.org/D18626

llvm-svn: 266399

cf63b64f

Apr 13, 2016

NFC mergefunc: const correctness · 8331458d

JF Bastien authored Apr 13, 2016

Some of the comparators were const others weren't making it annoying to add new comparators which call existing ones.

llvm-svn: 266247

8331458d

Revert "Make aliases explicit in the summary" · b5b28933

Mehdi Amini authored Apr 13, 2016

Inadvertently commited...

This reverts commit e618ec93786d99df2ddf280ad2d5e02f5516cecf.

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266215

b5b28933

Make aliases explicit in the summary · ce744a95

Mehdi Amini authored Apr 13, 2016

Summary:
To be able to work accurately on the reference graph when taking decision
about internalizing, promoting, renaming, etc. We need to have the alias
information explicit.

Reviewers: tejohnson

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D18836

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266214

ce744a95

Minor cleanup in Internalize, hide helper class using anonymous namespace (NFC) · 10593830
Mehdi Amini authored Apr 13, 2016
```
From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266173
```
10593830
Really return whether Internalize did change the Module or not. · 59269a87
Mehdi Amini authored Apr 13, 2016
```
From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266169
```
59269a87
Modernize Internalizer with for-range loop (NFC) · 3949b9e6
Mehdi Amini authored Apr 13, 2016
```
From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266168
```
3949b9e6

Refactor the InternalizePass into a helper class, and expose it through a... · 24d3414f

Mehdi Amini authored Apr 13, 2016

Refactor the InternalizePass into a helper class, and expose it through a public free function (NFC)

There is really no reason to require to instanciate a pass manager to
internalize.

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266167

24d3414f

Refactor Internalization pass to use as a callback instead of a StringSet (NFC) · 40787099

Mehdi Amini authored Apr 13, 2016

This will save a bunch of copies / initialization of intermediate
datastructure, and (hopefully) simplify the code.

This also abstract the symbol preservation mechanism outside of the
Internalization pass into the client code, which is not forced
to keep a map of strings for instance (ThinLTO will prefere hashes).

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266163

40787099

Fix FunctionImport export list computation: need to take a reference to a map... · ef7555fb

Mehdi Amini authored Apr 13, 2016

Fix FunctionImport export list computation: need to take a reference to a map entry to actually modify it

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266159

ef7555fb

Apr 12, 2016

Add a pass to name anonymous/nameless function · d5faa267

Mehdi Amini authored Apr 12, 2016

Summary:
For correct handling of alias to nameless
function, we need to be able to refer them through a GUID in the summary.
Here we name them using a hash of the non-private global names in the module.

Reviewers: tejohnson

Subscribers: joker.eph, llvm-commits

Differential Revision: http://reviews.llvm.org/D18883

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266132

d5faa267

NFC: MergeFunctions return early · f90029bb
JF Bastien authored Apr 12, 2016
```
Same effect, easier to read.

llvm-svn: 266128
```
f90029bb

[ThinLTO] Only compute imports for current module in FunctionImport pass · c86af334

Teresa Johnson authored Apr 12, 2016

Summary:
The function import pass was computing all the imports for all the
modules in the index, and only using the imports for the current module.
Change this to instead compute only for the given module. This means
that the exports list can't be populated, but they weren't being used
anyway.

Longer term, the linker can collect all the imports and export lists
and serialize them out for consumption by the distributed backend
processes which use this pass.

Reviewers: joker.eph

Subscribers: llvm-commits, joker.eph

Differential Revision: http://reviews.llvm.org/D18945

llvm-svn: 266125

c86af334

NFC: MergeFunctions update more comments · 1bb32ac4
JF Bastien authored Apr 12, 2016
```
They are wordy. Some words were wrong.

llvm-svn: 266124
```
1bb32ac4

MergeFunctions: test alloca better · 4f43cfd2

JF Bastien authored Apr 12, 2016

r237193 fix handling of alloca size / align in MergeFunctions, but only tested one and didn't follow FunctionComparator::cmpOperations's usual comparison pattern. It also didn't update Instruction.cpp:haveSameSpecialState which I'll do separately.

llvm-svn: 266022

4f43cfd2

Apr 11, 2016

[ThinLTO] Move summary computation from BitcodeWriter to new pass · 2d5487cf

Teresa Johnson authored Apr 11, 2016

Summary:
This is the first step in also serializing the index out to LLVM
assembly.

The per-module summary written to bitcode is moved out of the bitcode
writer and to a new analysis pass (ModuleSummaryIndexWrapperPass).
The pass itself uses a new builder class to compute index, and the
builder class is used directly in places where we don't have a pass
manager (e.g. llvm-as).

Because we are computing summaries outside of the bitcode writer, we no
longer can use value ids created by the bitcode writer's
ValueEnumerator. This required changing the reference graph edge type
to use a new ValueInfo class holding a union between a GUID (combined
index) and Value* (permodule index). The Value* are converted to the
appropriate value ID during bitcode writing.

Also, this enables removal of the BitWriter library's dependence on the
Analysis library that was previously required for the summary computation.

Reviewers: joker.eph

Subscribers: joker.eph, llvm-commits

Differential Revision: http://reviews.llvm.org/D18763

llvm-svn: 265941

2d5487cf

Apr 10, 2016
- [ThinLTO] Remove unused parameter (NFC) · 3255eec1
  Teresa Johnson authored Apr 10, 2016
```
llvm-svn: 265900
```
  3255eec1
Apr 08, 2016

Don't IPO over functions that can be de-refined · 5ce32728

Sanjoy Das authored Apr 08, 2016

Summary:
Fixes PR26774.

If you're aware of the issue, feel free to skip the "Motivation"
section and jump directly to "This patch".

Motivation:

I define "refinement" as discarding behaviors from a program that the
optimizer has license to discard.  So transforming:

```
void f(unsigned x) {
  unsigned t = 5 / x;
  (void)t;
}
```

to

```
void f(unsigned x) { }
```

is refinement, since the behavior went from "if x == 0 then undefined
else nothing" to "nothing" (the optimizer has license to discard
undefined behavior).

Refinement is a fundamental aspect of many mid-level optimizations done
by LLVM.  For instance, transforming `x == (x + 1)` to `false` also
involves refinement since the expression's value went from "if x is
`undef` then { `true` or `false` } else { `false` }" to "`false`" (by
definition, the optimizer has license to fold `undef` to any non-`undef`
value).

Unfortunately, refinement implies that the optimizer cannot assume
that the implementation of a function it can see has all of the
behavior an unoptimized or a differently optimized version of the same
function can have.  This is a problem for functions with comdat
linkage, where a function can be replaced by an unoptimized or a
differently optimized version of the same source level function.

For instance, FunctionAttrs cannot assume a comdat function is
actually `readnone` even if it does not have any loads or stores in
it; since there may have been loads and stores in the "original
function" that were refined out in the currently visible variant, and
at the link step the linker may in fact choose an implementation with
a load or a store.  As an example, consider a function that does two
atomic loads from the same memory location, and writes to memory only
if the two values are not equal.  The optimizer is allowed to refine
this function by first CSE'ing the two loads, and the folding the
comparision to always report that the two values are equal.  Such a
refined variant will look like it is `readonly`.  However, the
unoptimized version of the function can still write to memory (since
the two loads //can// result in different values), and selecting the
unoptimized version at link time will retroactively invalidate
transforms we may have done under the assumption that the function
does not write to memory.

Note: this is not just a problem with atomics or with linking
differently optimized object files.  See PR26774 for more realistic
examples that involved neither.

This patch:

This change introduces a new set of linkage types, predicated as
`GlobalValue::mayBeDerefined` that returns true if the linkage type
allows a function to be replaced by a differently optimized variant at
link time.  It then changes a set of IPO passes to bail out if they see
such a function.

Reviewers: chandlerc, hfinkel, dexonsmith, joker.eph, rnk

Subscribers: mcrosier, llvm-commits

Differential Revision: http://reviews.llvm.org/D18634

llvm-svn: 265762

5ce32728

Apr 06, 2016

NFC: make AtomicOrdering an enum class · 800f87a8

JF Bastien authored Apr 06, 2016

Summary:
In the context of http://wg21.link/lwg2445 C++ uses the concept of
'stronger' ordering but doesn't define it properly. This should be fixed
in C++17 barring a small question that's still open.

The code currently plays fast and loose with the AtomicOrdering
enum. Using an enum class is one step towards tightening things. I later
also want to tighten related enums, such as clang's
AtomicOrderingKind (which should be shared with LLVM as a 'C++ ABI'
enum).

This change touches a few lines of code which can be improved later, I'd
like to keep it as NFC for now as it's already quite complex. I have
related changes for clang.

As a follow-up I'll add:
  bool operator<(AtomicOrdering, AtomicOrdering) = delete;
  bool operator>(AtomicOrdering, AtomicOrdering) = delete;
  bool operator<=(AtomicOrdering, AtomicOrdering) = delete;
  bool operator>=(AtomicOrdering, AtomicOrdering) = delete;
This is separate so that clang and LLVM changes don't need to be in sync.

Reviewers: jyknight, reames

Subscribers: jyknight, llvm-commits

Differential Revision: http://reviews.llvm.org/D18775

llvm-svn: 265602

800f87a8

Apr 05, 2016

[IFUNC] Use GlobalIndirectSymbol when aliases and ifuncs have something similar · a3d5b0b2

Dmitry Polukhin authored Apr 05, 2016

Second part extracted from http://reviews.llvm.org/D15525

Use GlobalIndirectSymbol in all cases when aliases and ifuncs have
something in common.

Differential Revision: http://reviews.llvm.org/D18754

llvm-svn: 265382

a3d5b0b2

Apr 04, 2016

[ThinLTO] Augment FunctionImport dump with value name to GUID map · 0beb858e

Teresa Johnson authored Apr 04, 2016

Summary:
To aid in debugging, dump out the correlation between value names and
GUID for each source module when it is materialized. This will make it
easier to comprehend the earlier summary-based function importing debug
trace which only has access to and prints the GUIDs.

Reviewers: joker.eph

Subscribers: llvm-commits, joker.eph

Differential Revision: http://reviews.llvm.org/D18556

llvm-svn: 265326

0beb858e

Apr 02, 2016

Create a typedef GlobalValue::GUID for uint64_t and RAUW (NFC) · ad5741b0

Mehdi Amini authored Apr 02, 2016

Summary: This should make the code more readable, especially all the map declarations.

Reviewers: tejohnson

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D18721

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 265215

ad5741b0

Apr 01, 2016

LowerBitSets: Move declarations to separate namespace. · dd711b93
Peter Collingbourne authored Apr 01, 2016
```
Should fix modules build.

llvm-svn: 265176
```
dd711b93

Add a module Hash in the bitcode and the combined index, implementing a kind of "build-id" · d7ad221c

Mehdi Amini authored Apr 01, 2016

This is intended to be used for ThinLTO incremental build.

Differential Revision: http://reviews.llvm.org/D18213

This is a recommit of r265095 after fixing the Windows issues.

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 265111

d7ad221c

Revert "Add support for computing SHA1 in LLVM" · 85fb9e05

Mehdi Amini authored Apr 01, 2016

This reverts commit r265096, r265095, and r265094.
Windows build is broken, and the validation does not pass.

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 265102

85fb9e05

Add a module Hash in the bitcode and the combined index, implementing a kind of "build-id" · 4c2ed333

Mehdi Amini authored Apr 01, 2016

This is intended to be used for ThinLTO incremental build.

Differential Revision: http://reviews.llvm.org/D18213

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 265095

4c2ed333