Commits · 9fb3ab1b1943f59862d0b5d8cd14daf5396962b0 · Lorenzo Albano / LLVM bpEVL

Mar 09, 2017

WholeProgramDevirt: Implement importing for uniform ret val opt. · 0152c815
Peter Collingbourne authored Mar 09, 2017
```
Differential Revision: https://reviews.llvm.org/D29854

llvm-svn: 297350
```
0152c815
WholeProgramDevirt: Implement importing for single-impl devirtualization. · 6d284fab
Peter Collingbourne authored Mar 09, 2017
```
Differential Revision: https://reviews.llvm.org/D29844

llvm-svn: 297333
```
6d284fab

Perform symbol binding for .symver versioned symbols · d8204472

Teresa Johnson authored Mar 09, 2017

Summary:
In a .symver assembler directive like:
.symver name, name2@@nodename
"name2@@nodename" should get the same symbol binding as "name".

While the ELF object writer is updating the symbol binding for .symver
aliases before emitting the object file, not doing so when the module
inline assembly is handled by the RecordStreamer is causing the wrong
behavior in *LTO mode.

E.g. when "name" is global, "name2@@nodename" must also be marked as
global. Otherwise, the symbol is skipped when iterating over the LTO
InputFile symbols (InputFile::Symbol::shouldSkip). So, for example,
when performing any *LTO via the gold-plugin, the versioned symbol
definition is not recorded by the plugin and passed back to the
linker. If the object was in an archive, and there were no other symbols
needed from that object, the object would not be included in the final
link and references to the versioned symbol are undefined.

The llvm-lto2 tests added will give an error about an unused symbol
resolution without the fix.

Reviewers: rafael, pcc

Reviewed By: pcc

Subscribers: mehdi_amini, llvm-commits

Differential Revision: https://reviews.llvm.org/D30485

llvm-svn: 297332

d8204472

Don't merge global constants with non-dbg metadata. · 8537d999

Evgeniy Stepanov authored Mar 09, 2017

!type metadata can not be dropped. An alternative to this is adding
!type metadata from the replaced globals to the replacement, but that
may weaken type tests and make them slower at the same time.

The merged global gets !dbg metadata from replaced globals, and can
end up with multiple debug locations.

llvm-svn: 297327

8537d999

Mar 07, 2017

Fix one-after-the-end type metadata handling in globalsplit. · 7a5cfa9a

Evgeniy Stepanov authored Mar 07, 2017

Itanium ABI may have an address point one byte after the end of a
vtable. When such vtable global is split, the !type metadata needs to
follow the right vtable.

Differential Revision: https://reviews.llvm.org/D30716

llvm-svn: 297236

7a5cfa9a

Mar 06, 2017

Disable gvn-hoist (PR32153) · 254f5fa5
Hans Wennborg authored Mar 06, 2017
```
llvm-svn: 297075
```
254f5fa5

Remove the sample pgo annotation heuristic that uses call count to annotate basic block count. · c632a393

Dehao Chen authored Mar 06, 2017

Summary: We do not need that special handling because the debug info is more accurate now. Performance testing shows no regression on google internal benchmarks.

Reviewers: davidxl, aprantl

Reviewed By: aprantl

Subscribers: llvm-commits, aprantl

Differential Revision: https://reviews.llvm.org/D30658

llvm-svn: 297038

c632a393

Mar 04, 2017

Fix build. · f0bb90b1
Peter Collingbourne authored Mar 04, 2017
```
llvm-svn: 296949
```
f0bb90b1
WholeProgramDevirt: Implement exporting for uniform ret val opt. · 77a8d563
Peter Collingbourne authored Mar 04, 2017
```
Differential Revision: https://reviews.llvm.org/D29846

llvm-svn: 296948
```
77a8d563
WholeProgramDevirt: Implement exporting for single-impl devirtualization. · 2325bb34
Peter Collingbourne authored Mar 04, 2017
```
Differential Revision: https://reviews.llvm.org/D29811

llvm-svn: 296945
```
2325bb34

WholeProgramDevirt: Add any unsuccessful llvm.type.checked.load... · b406baae

Peter Collingbourne authored Mar 04, 2017

WholeProgramDevirt: Add any unsuccessful llvm.type.checked.load devirtualizations to the list of llvm.type.test users.

Any unsuccessful llvm.type.checked.load devirtualizations will be translated
into uses of llvm.type.test, so we need to add the resulting llvm.type.test
intrinsics to the function summaries so that the LowerTypeTests pass will
export them.

Differential Revision: https://reviews.llvm.org/D29808

llvm-svn: 296939

b406baae

Mar 03, 2017

Revert "Re-apply "[GVNHoist] Move GVNHoist to function simplification part of pipeline."" · 9528f8c2
Benjamin Kramer authored Mar 03, 2017
```
This reverts commit r296759. Miscompiles bash.

llvm-svn: 296872
```
9528f8c2

ThinLTOBitcodeWriter: Do not follow operand edges of type GlobalValue when... · 3baa72af

Peter Collingbourne authored Mar 02, 2017

ThinLTOBitcodeWriter: Do not follow operand edges of type GlobalValue when looking for virtual functions.

Such edges may otherwise result in infinite recursion if a pointer to a vtable
is reachable from the vtable itself. This can happen in practice if a TU
defines the ABI types used to implement RTTI, and is itself compiled with RTTI.

Fixes PR32121.

llvm-svn: 296839

3baa72af

Mar 02, 2017

Re-apply "[GVNHoist] Move GVNHoist to function simplification part of pipeline." · 484d7565

Geoff Berry authored Mar 02, 2017

This re-applies r289696, which caused TSan perf regression, which has
since been addressed in separate changes (see PR for details).

See PR31382.

llvm-svn: 296759

484d7565

Feb 28, 2017

Add function importing info from samplepgo profile to the module summary. · a60cdd38

Dehao Chen authored Feb 28, 2017

Summary: For SamplePGO, the profile may contain cross-module inline stacks. As we need to make sure the profile annotation happens when all the hot inline stacks are expanded, we need to pass this info to the module importer so that it can import proper functions if necessary. This patch implemented this feature by emitting cross-module targets as part of function entry metadata. In the module-summary phase, the metadata is used to build call edges that points to functions need to be imported.

Reviewers: mehdi_amini, tejohnson

Reviewed By: tejohnson

Subscribers: davidxl, llvm-commits

Differential Revision: https://reviews.llvm.org/D30053

llvm-svn: 296498

a60cdd38

Feb 24, 2017

[OptDiag] Hide legacy remark ctors · de53bfb9

Adam Nemet authored Feb 23, 2017

These are only used when emitting remarks without ORE directly using the free
functions emitOptimizationRemark*.

llvm-svn: 296037

de53bfb9

Feb 23, 2017

Add call branch annotation for ICP promoted direct call in SamplePGO mode. · cc75d244

Dehao Chen authored Feb 23, 2017

Summary: SamplePGO uses branch_weight annotation to represent callsite hotness. When ICP promotes an indirect call to direct call, we need to make sure the direct call is annotated with branch_weight in SamplePGO mode, so that downstream function inliner can use hot callsite heuristic.

Reviewers: davidxl, eraman, xur

Reviewed By: davidxl, xur

Subscribers: mehdi_amini, llvm-commits

Differential Revision: https://reviews.llvm.org/D30282

llvm-svn: 296028

cc75d244

Use base discriminator in sample pgo profile matching. · 533bc6ea

Dehao Chen authored Feb 23, 2017

Summary: The discriminator has been encoded, and only the base discriminator should be used during profile matching.

Reviewers: dblaikie, davidxl

Reviewed By: dblaikie, davidxl

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D30218

llvm-svn: 295999

533bc6ea

Feb 18, 2017

Increases full-unroll threshold. · 7d230325

Dehao Chen authored Feb 18, 2017

Summary:
The default threshold for fully unroll is too conservative. This patch doubles the full-unroll threshold

This change will affect the following speccpu2006 benchmarks (performance numbers were collected from Intel Sandybridge):

Performance:

403	0.11%
433	0.51%
445	0.48%
447	3.50%
453	1.49%
464	0.75%

Code size:

403	0.56%
433	0.96%
445	2.16%
447	2.96%
453	0.94%
464	8.02%

The compiler time overhead is similar with code size.

Reviewers: davidxl, mkuper, mzolotukhin, hfinkel, chandlerc

Reviewed By: hfinkel, chandlerc

Subscribers: mehdi_amini, zzheng, efriedma, haicheng, hfinkel, llvm-commits

Differential Revision: https://reviews.llvm.org/D28368

llvm-svn: 295538

7d230325

OptDiag: Allow constructing DiagnosticLocation from DISubprograms · 7bc978b5

Justin Bogner authored Feb 18, 2017

This avoids creating a DILocation just to represent a line number,
since creating Metadata is expensive. Creating a DiagnosticLocation
directly is much cheaper.

llvm-svn: 295531

7bc978b5

Feb 17, 2017

WholeProgramDevirt: For VCP use a 32-bit ConstantInt for the byte offset. · 184773d8

Peter Collingbourne authored Feb 17, 2017

A future change will cause this byte offset to be inttoptr'd and then exported
via an absolute symbol. On the importing end we will expect the symbol to be
in range [0,2^32) so that it will fit into a 32-bit relocation. The problem
is that on 64-bit architectures if the offset is negative it will not be in
the correct range once we inttoptr it.

This change causes us to use a 32-bit integer so that it can be inttoptr'd
(which zero extends) into the correct range.

Differential Revision: https://reviews.llvm.org/D30016

llvm-svn: 295487

184773d8

WholeProgramDevirt: Examine the function body when deciding whether functions are readnone. · 37317f12
Peter Collingbourne authored Feb 17, 2017
```
The goal is to get an analysis result even for de-refineable functions.

Differential Revision: https://reviews.llvm.org/D29803

llvm-svn: 295472
```
37317f12

Feb 16, 2017
- PMB: Add an importing WPD pass to the start of the ThinLTO backend pipeline. · 08eb081a
  Peter Collingbourne authored Feb 15, 2017
```
Differential Revision: https://reviews.llvm.org/D30008

llvm-svn: 295260
```
  08eb081a
Feb 15, 2017

Re-apply r295110 and r295144 with a fix for the ASan issue. · 50cbd7cc
Peter Collingbourne authored Feb 15, 2017
```
llvm-svn: 295241
```
50cbd7cc

Revert r295110 and r295144. · eef9b033

Daniel Jasper authored Feb 15, 2017

This fails under ASAN:
http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap/builds/798/steps/check-llvm%20asan/logs/stdio

llvm-svn: 295162

eef9b033

WholeProgramDevirt: Separate the code that applies optzns from the code that... · e2367415

Peter Collingbourne authored Feb 15, 2017

WholeProgramDevirt: Separate the code that applies optzns from the code that decides whether to apply them. NFCI.

The idea is that the apply* functions will also be called when importing
devirt optimizations.

Differential Revision: https://reviews.llvm.org/D29745

llvm-svn: 295144

e2367415

Feb 14, 2017

WholeProgramDevirt: Change internal vcall data structures to match summary. · 534c0175

Peter Collingbourne authored Feb 14, 2017

Group calls into constant and non-constant arguments up front, and use uint64_t
instead of ConstantInt to represent constant arguments. The goal is to allow
the information from the summary to fit naturally into this data structure in
a future change (specifically, it will be added to CallSiteInfo).

This has two side effects:
- We disallow VCP for constant integer arguments of width >64 bits.
- We remove the restriction that the bitwidth of a vcall's argument and return
  types must match those of the vfunc definitions.
I don't expect either of these to matter in practice. The first case is
uncommon, and the second one will lead to UB (so we can do anything we like).

Differential Revision: https://reviews.llvm.org/D29744

llvm-svn: 295110

534c0175

Do not apply redundant LastCallToStaticBonus · f22fa72e

Taewook Oh authored Feb 14, 2017

Summary:
As written in the comments above, LastCallToStaticBonus is already applied to
the cost if Caller has only one user, so it is redundant to reapply the bonus
here.

If the only user is not a caller, TotalSecondaryCost will not be adjusted
anyway because callerWillBeRemoved is false. If there's no caller at all, we
don't need to care about TotalSecondaryCost because
inliningPreventsSomeOuterInline is false.

Reviewers: chandlerc, eraman

Reviewed By: eraman

Subscribers: haicheng, davidxl, davide, llvm-commits, mehdi_amini

Differential Revision: https://reviews.llvm.org/D29169

llvm-svn: 295075

f22fa72e

ThinLTOBitcodeWriter: Write available_externally copies of VCP eligible functions to merged module. · 002c2d53
Peter Collingbourne authored Feb 14, 2017
```
Differential Revision: https://reviews.llvm.org/D29701

llvm-svn: 295021
```
002c2d53

FunctionAttrs: Factor out a function for querying memory access of a specific... · c45f7f3e

Peter Collingbourne authored Feb 14, 2017

FunctionAttrs: Factor out a function for querying memory access of a specific copy of a function. NFC.

This will later be used by ThinLTOBitcodeWriter to add copies of readnone
functions to the regular LTO module.

Differential Revision: https://reviews.llvm.org/D29695

llvm-svn: 295008

c45f7f3e

[FunctionAttrs] try to extend nonnull-ness of arguments from a callsite back to its parent function · 4f74216d

Sanjay Patel authored Feb 13, 2017

As discussed here:
http://lists.llvm.org/pipermail/llvm-dev/2016-December/108182.html
...we should be able to propagate 'nonnull' info from a callsite back to its parent.

The original motivation for this patch is our botched optimization of "dyn_cast" (PR28430),
but this won't solve that problem.

The transform is currently disabled by default while we wait for clang to work-around
potential security problems:
http://lists.llvm.org/pipermail/cfe-dev/2017-January/052066.html

Differential Revision: https://reviews.llvm.org/D27855

llvm-svn: 294998

4f74216d

Feb 13, 2017

IR: Type ID summary extensions for WPD; thread summary into WPD pass. · 2b33f653

Peter Collingbourne authored Feb 13, 2017

Make the whole thing testable by adding YAML I/O support for the WPD
summary information and adding some negative tests that exercise the
YAML support.

Differential Revision: https://reviews.llvm.org/D29782

llvm-svn: 294981

2b33f653

Feb 10, 2017

[PM] Port ArgumentPromotion to the new pass manager. · addcda48

Chandler Carruth authored Feb 09, 2017

Now that the call graph supports efficient replacement of a function and
spurious reference edges, we can port ArgumentPromotion to the new pass
manager very easily.

The old PM-specific bits are sunk into callbacks that the new PM simply
doesn't use. Unlike the old PM, the new PM simply does argument
promotion and afterward does the update to LCG reflecting the promoted
function.

Differential Revision: https://reviews.llvm.org/D29580

llvm-svn: 294667

addcda48

WholeProgramDevirt: Check that VCP candidate functions are defined before evaluating them. · 17febdbb
Peter Collingbourne authored Feb 09, 2017
```
This was crashing before.

llvm-svn: 294666
```
17febdbb

[PM/LCG] Teach the LazyCallGraph how to replace a function without · aaad9f84

Chandler Carruth authored Feb 09, 2017

disturbing the graph or having to update edges.

This is motivated by porting argument promotion to the new pass manager.
Because of how LLVM IR Function objects work, in order to change their
signature a new object needs to be created. This is efficient and
straight forward in the IR but previously was very hard to implement in
LCG. We could easily replace the function a node in the graph
represents. The challenging part is how to handle updating the edges in
the graph.

LCG previously used an edge to a raw function to represent a node that
had not yet been scanned for calls and references. This was the core
of its laziness. However, that model causes this kind of update to be
very hard:
1) The keys to lookup an edge need to be `Function*`s that would all
   need to be updated when we update the node.
2) There will be some unknown number of edges that haven't transitioned
   from `Function*` edges to `Node*` edges.

All of this complexity isn't necessary. Instead, we can always build
a node around any function, always pointing edges at it and always using
it as the key to lookup an edge. To maintain the laziness, we need to
sink the *edges* of a node into a secondary object and explicitly model
transitioning a node from empty to populated by scanning the function.
This design seems much cleaner in a number of ways, but importantly
there is now exactly *one* place where the `Function*` has to be
updated!

Some other cleanups that fall out of this include having something to
model the *entry* edges more accurately. Rather than hand rolling parts
of the node in the graph itself, we have an explicit `EdgeSequence`
object that gives us exactly the functionality needed. We also have
a consistent place to define the edge iterators and can use them for
both the entry edges and the internal edges of the graph.

The API used to model the separation between a node and its edges is
intentionally very thin as most clients are expected to deal with nodes
that have populated edges. We model this exactly as an optional does
with an additional method to populate the edges when that is
a reasonable thing for a client to do. This is based on API design
suggestions from Richard Smith and David Blaikie, credit goes to them
for helping pick how to model this without it being either too explicit
or too implicit.

The patch is somewhat noisy due to shifting around iterator types and
new syntax for walking the edges of a node, but most of the
functionality change is in the `Edge`, `EdgeSequence`, and `Node` types.

Differential Revision: https://reviews.llvm.org/D29577

llvm-svn: 294653

aaad9f84

De-duplicate some code for creating an AARGetter suitable for the legacy PM. · cea1e4e7
Peter Collingbourne authored Feb 09, 2017
```
I'm about to use this in a couple more places.

Differential Revision: https://reviews.llvm.org/D29793

llvm-svn: 294648
```
cea1e4e7

Feb 09, 2017

Rename LowerTypeTestsSummaryAction to PassSummaryAction. NFCI. · 857aba44

Peter Collingbourne authored Feb 09, 2017

I intend to use the same type with the same semantics in the WholeProgramDevirt
pass.

Differential Revision: https://reviews.llvm.org/D29746

llvm-svn: 294629

857aba44

Feb 08, 2017

ThinLTOBitcodeWriter: Strip debug info from merged module. · 28ffd326

Peter Collingbourne authored Feb 08, 2017

This module will contain nothing but vtable definitions and (soon)
available_externally function definitions, so there is no point in keeping
debug info in the module.

Differential Revision: https://reviews.llvm.org/D28913

llvm-svn: 294511

28ffd326

Feb 07, 2017

LowerTypeTests: Simplify. NFC. · 1ea1fd89
Peter Collingbourne authored Feb 07, 2017
```
llvm-svn: 294273
```
1ea1fd89

Fix the samplepgo indirect call promotion bug: we should not promote a direct call. · 4a9dd702

Dehao Chen authored Feb 06, 2017

Summary: Checking CS.getCalledFunction() == nullptr does not necessary indicate indirect call. We also need to check if CS.getCalledValue() is not a constant.

Reviewers: davidxl

Reviewed By: davidxl

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D29570

llvm-svn: 294260

4a9dd702