Commits · 784929d0454c4df6a98ef6fbbd1d30a6f71f9c16 · Lorenzo Albano / LLVM bpEVL

Feb 08, 2019

Implementation of asm-goto support in LLVM · 784929d0

Craig Topper authored Feb 08, 2019

This patch accompanies the RFC posted here:
http://lists.llvm.org/pipermail/llvm-dev/2018-October/127239.html

This patch adds a new CallBr IR instruction to support asm-goto
inline assembly like gcc as used by the linux kernel. This
instruction is both a call instruction and a terminator
instruction with multiple successors. Only inline assembly
usage is supported today.

This also adds a new INLINEASM_BR opcode to SelectionDAG and
MachineIR to represent an INLINEASM block that is also
considered a terminator instruction.

There will likely be more bug fixes and optimizations to follow
this, but we felt it had reached a point where we would like to
switch to an incremental development model.

Patch by Craig Topper, Alexander Ivchenko, Mikhail Dvoretckii

Differential Revision: https://reviews.llvm.org/D53765

llvm-svn: 353563

784929d0

[CodeExtractor] Restore outputs after creating exit stubs · 0e5dd512

Vedant Kumar authored Feb 08, 2019

When CodeExtractor saves the result of InvokeInst at the first insertion
point of the 'normal destination' basic block, this block can be omitted
in the outlined region, so store is placed outside of the function. The
suggested solution is to process saving outputs after creating exit
stubs for new function, and stores will be placed in that blocks before
return in this case.

Patch by Sergei Kachkov!

Fixes llvm.org/PR40455.

Differential Revision: https://reviews.llvm.org/D57919

llvm-svn: 353562

0e5dd512

[InstrProf] Implement static profdata registration · 987d331f

Reid Kleckner authored Feb 08, 2019

Summary:
The motivating use case is eliminating duplicate profile data registered
for the same inline function in two object files. Before this change,
users would observe multiple symbol definition errors with VC link, but
links with LLD would succeed.

Users (Mozilla) have reported that PGO works well with clang-cl and LLD,
but when using LLD without this static registration, we would get into a
"relocation against a discarded section" situation. I'm not sure what
happens in that situation, but I suspect that duplicate, unused profile
information was retained. If so, this change will reduce the size of
such binaries with LLD.

Now, Windows uses static registration and is in line with all the other
platforms.

Reviewers: davidxl, wmi, inglorion, void, calixte

Subscribers: mgorny, krytarowski, eraman, fedor.sergeev, hiraditya, #sanitizers, dmajor, llvm-commits

Tags: #sanitizers, #llvm

Differential Revision: https://reviews.llvm.org/D57929

llvm-svn: 353547

987d331f

ArgumentPromotion should copy all metadata to new Function · 3ce8112d

Teresa Johnson authored Feb 08, 2019

Summary:
ArgumentPromotion had code to specifically move the dbg metadata over to
the new function, but other metadata such as the function_entry_count
!prof metadata was not. Replace code that moved dbg metadata with a call
to copyMetadata. The old metadata is automatically removed when the old
Function is removed.

Reviewers: davidxl

Subscribers: llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D57846

llvm-svn: 353537

3ce8112d

[DWARF] LLVM ERROR: Broken function found, while removing Debug Intrinsics. · 08dc50f2

Carlos Alberto Enciso authored Feb 08, 2019

Check that when SimplifyCFG is flattening a 'br', all their debug intrinsic instructions are removed, including any dbg.label referencing a label associated with the basic blocks being removed.

Differential Revision: https://reviews.llvm.org/D57444

llvm-svn: 353511

08dc50f2

[LoopSimplifyCFG] Use DTU.applyUpdates instead of insert/deleteEdge · 6b63d3a2

Max Kazantsev authored Feb 08, 2019

`insert/deleteEdge` methods in DTU can make updates incorrectly in some cases
(see https://bugs.llvm.org/show_bug.cgi?id=40528), and it is recommended to
use `applyUpdates` methods instead when it is needed to make a mass update in CFG.

Differential Revision: https://reviews.llvm.org/D57316
Reviewed By: kuhar

llvm-svn: 353502

6b63d3a2

[CodeExtractor] Update function's assumption cache after extracting blocks from it · 807960e6

Sergey Dmitriev authored Feb 08, 2019

Summary: Assumption cache's self-updating mechanism does not correctly handle the case when blocks are extracted from the function by the CodeExtractor. As a result function's assumption cache may have stale references to the llvm.assume calls that were moved to the outlined function. This patch fixes this problem by removing extracted llvm.assume calls from the function’s assumption cache.

Reviewers: hfinkel, vsk, fhahn, davidxl, sanjoy

Reviewed By: hfinkel, vsk

Subscribers: llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D57215

llvm-svn: 353500

807960e6

Feb 07, 2019

[InstCombine] Optimize `atomicrmw <op>, 0` into `load atomic` when possible · 96f54de8

Quentin Colombet authored Feb 07, 2019

This commit teaches InstCombine how to replace an atomicrmw operation
into a simple load atomic.
For a given `atomicrmw <op>`, this is possible when:
1. The ordering of that operation is compatible with a load (i.e.,
   anything that doesn't have a release semantic).
2. <op> does not modify the value being stored

Differential Revision: https://reviews.llvm.org/D57854

llvm-svn: 353471

96f54de8

[LV] Remove unnecessary assignment to UserIC. · f557a94a
Florian Hahn authored Feb 07, 2019
```
llvm-svn: 353469
```
f557a94a

[InstCombine] Fix crashing from (icmp (bitcast ([su]itofp X)), Y) · 781d8838

Sanjay Patel authored Feb 07, 2019

This fixes a class of bugs introduced by D44367,
which transforms various cases of icmp (bitcast ([su]itofp X)), Y to icmp X, Y.
If the bitcast is between vector types with a different number of elements,
the current code will produce bad IR along the lines of: icmp <N x i32> ..., <M x i32> <...>.

This patch suppresses the transform if the bitcast changes the number of vector elements.

Patch by: @AndrewScheidecker (Andrew Scheidecker)

Differential Revision: https://reviews.llvm.org/D57871

llvm-svn: 353467

781d8838

[InstCombine] refactor folds for (icmp (bitcast X), Y); NFCI · e7f46c3d
Sanjay Patel authored Feb 07, 2019
```
llvm-svn: 353462
```
e7f46c3d

[LV] Prevent interleaving if computeMaxVF returned None. · ba5acbc4

Florian Hahn authored Feb 07, 2019

As discussed in D57382, interleaving should be avoided if computeMaxVF
returns None, same as we currently do for vectorization.

Fixes https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=6477

Reviewers: Ayal, dcaballe, hsaito, mkuper, rengolin

Reviewed By: Ayal

Differential Revision: https://reviews.llvm.org/D57837

llvm-svn: 353461

ba5acbc4

[InstrProf] Avoid reconstructing Triple, NFC · f21c0223
Reid Kleckner authored Feb 07, 2019
```
llvm-svn: 353439
```
f21c0223

[HotColdSplit] With PGO add profile entry metadata to split cold function · c36c10dd

Teresa Johnson authored Feb 07, 2019

Summary:
When compiling with profile data, ensure the split cold function gets
cold function_entry_count metadata (just use 0 since it should be cold).
Otherwise with function sections it will not be placed in the unlikely
text section with other cold code.

Reviewers: vsk

Subscribers: sebpop, hiraditya, davidxl, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D57900

llvm-svn: 353434

c36c10dd

[LSR] Generate cross iteration indexes · 67756c09

Sam Parker authored Feb 07, 2019

    
Modify GenerateConstantOffsetsImpl to create offsets that can be used
by indexed addressing modes. If formulae can be generated which
result in the constant offset being the same size as the recurrence,
we can generate a pre-indexed access. This allows the pointer to be
updated via the single pre-indexed access so that (hopefully) no
add/subs are required to update it for the next iteration. For small
cores, this can significantly improve performance DSP-like loops.

Differential Revision: https://reviews.llvm.org/D55373

llvm-svn: 353403

67756c09

Feb 06, 2019

[LICM/MSSA] Add promotion to scalars by building an AliasSetTracker with MemorySSA. · 6cba96ed

Alina Sbirlea authored Feb 06, 2019

Summary:
Experimentally we found that promotion to scalars carries less benefits
than sinking and hoisting in LICM. When using MemorySSA, we build an
AliasSetTracker on demand in order to reuse the current infrastructure.
We only build it if less than AccessCapForMSSAPromotion exist in the
loop, a cap that is by default set to 250. This value ensures there are
no runtime regressions, and there are small compile time gains for
pathological cases. A much lower value (20) was found to yield a single
regression in the llvm-test-suite and much higher benefits for compile
times. Conservatively we set the current cap to a high value, but we will
explore lowering it when MemorySSA is enabled by default.

Reviewers: sanjoy, chandlerc

Subscribers: nemanjai, jlebar, Prazek, george.burgess.iv, jfb, jsji, llvm-commits

Differential Revision: https://reviews.llvm.org/D56625

llvm-svn: 353339

6cba96ed

[InstCombine] X | C == C --> (X & ~C) == 0 · 68bc5fb0

Sanjay Patel authored Feb 06, 2019

We should canonicalize to one of these forms,
and compare-with-zero could be more conducive
to follow-on transforms. This also leads to
generally better codegen as shown in PR40611:
https://bugs.llvm.org/show_bug.cgi?id=40611

llvm-svn: 353313

68bc5fb0

[NFC] Simplify check in guard widening · cd48ac36
Max Kazantsev authored Feb 06, 2019
```
llvm-svn: 353290
```
cd48ac36
[NFC] Factor out detatchment of dead blocks from their erasing · 36b392cb
Max Kazantsev authored Feb 06, 2019
```
llvm-svn: 353277
```
36b392cb
[LoopSimplifyCFG] Do not count dead exit blocks twice, make CFG simpler · a4ccfc18
Max Kazantsev authored Feb 06, 2019
```
llvm-svn: 353276
```
a4ccfc18
[NFC] Revert rL353274 · 0d7ad3c9
Max Kazantsev authored Feb 06, 2019
```
llvm-svn: 353275
```
0d7ad3c9
[NFC] Extend API of DeleteDeadBlock(s) to collect updates without DTU · 61e6ffc3
Max Kazantsev authored Feb 06, 2019
```
llvm-svn: 353274
```
61e6ffc3
[NFC] Replace readonly SmallVectorImpl with ArrayRef · bad4db8b
Max Kazantsev authored Feb 06, 2019
```
llvm-svn: 353273
```
bad4db8b

[HotColdSplit] Move splitting after instrumented PGO use · 716abbeb

Teresa Johnson authored Feb 06, 2019

Summary:
Follow up to D57082 which moved splitting earlier in the pipeline, in
order to perform it before inlining. However, it was moved too early,
before the IR is annotated with instrumented PGO data. This caused the
splitting to incorrectly determine cold functions.

Move it to just after PGO annotation (still before inlining), in both
pass managers.

Reviewers: vsk, hiraditya, sebpop

Subscribers: mehdi_amini, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D57805

llvm-svn: 353270

716abbeb

Move DomTreeUpdater from IR to Analysis · 5f436fc5

Richard Trieu authored Feb 06, 2019

DomTreeUpdater depends on headers from Analysis, but is in IR.  This is a
layering violation since Analysis depends on IR.  Relocate this code from IR
to Analysis to fix the layering violation.

llvm-svn: 353265

5f436fc5

[HotColdSplit] Do not split out `resume` instructions · bd94b428

Vedant Kumar authored Feb 05, 2019

Resumes that are not reachable from a cleanup landing pad are considered
to be unreachable. It’s not safe to split them out.

rdar://47808235

llvm-svn: 353242

bd94b428

Feb 05, 2019

[InstCombine] limit extracting shuffle transform based on uses · cddb1e54

Sanjay Patel authored Feb 05, 2019

As discussed in D53037, this can lead to worse codegen, and we
don't generally expect the backend to be able to optimize
arbitrary shuffles. If there's only one use of the 1st shuffle,
that means it's getting removed, so that should always be
safe.

llvm-svn: 353235

cddb1e54

[PGO] Use a function for creating variable for profile file name. NFC. · ce10d5ea
Rong Xu authored Feb 05, 2019
```
Factored out the code for creating variable for profile file name to
a function.

llvm-svn: 353230
```
ce10d5ea

[DebugInfo][NFCI] Split salvageDebugInfo into helper functions · 84ca706b

Jeremy Morse authored Feb 05, 2019

Some use cases are appearing where salvaging is needed that does not
correspond to an instruction being deleted -- for example an instruction
being sunk, or a Value not being available in a block being isel'd.

Enable more fine grained control over how salavging occurs by splitting
the logic into helper functions, separating things that are specific to
working on DbgVariableIntrinsics from those specific to interpreting IR
and building DIExpressions.

Differential Revision: https://reviews.llvm.org/D57696

llvm-svn: 353156

84ca706b

[LSR] Check SCEV on isZero() after extend. PR40514 · d5e595b7

Max Kazantsev authored Feb 05, 2019

When LSR first adds SCEVs to BaseRegs, it only does it if `isZero()` has
returned false. In the end, in invocation of `InsertFormula`, it asserts that
all values there are still not zero constants. However between these two
points, it makes some transformations, in particular extends them to wider
type.

SCEV does not give us guarantee that if `S` is not a constant zero, then
`sext(S)` is also not a constant zero. It might have missed some optimizing
transforms when it was calculating `S` and then made them when it took `sext`.
For example, it may happen if previously optimizing transforms were limited
by depth or somehow else.

This patch adds a bailout when we may end up with a zero SCEV after extension.

Differential Revision: https://reviews.llvm.org/D57565
Reviewed By: samparker

llvm-svn: 353136

d5e595b7

Fix narrowing issue from r353129 · a9354b2f
Richard Trieu authored Feb 05, 2019
```
llvm-svn: 353134
```
a9354b2f
[SamplePGO][NFC] Minor improvement to replace a temporary vector with a · 4901f371
Wei Mi authored Feb 05, 2019
```
brace-enclosed init list.

Differential Revision: https://reviews.llvm.org/D57726

llvm-svn: 353129
```
4901f371

[SamplePGO] Minor efficiency improvement in samplePGO ICP · 4bdf82ce

Teresa Johnson authored Feb 05, 2019

Summary:
When attaching prof metadata to promoted direct calls in SamplePGO
mode, no need to construct and use a SmallVector to pass a single count
to the ArrayRef parameter, we can simply use a brace-enclosed init list.

This made a small but consistent improvement for a ThinLTO backend
compile I was measuring.

Reviewers: wmi

Subscribers: mehdi_amini, dexonsmith, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D57706

llvm-svn: 353123

4bdf82ce

Feb 04, 2019

[SanitizerCoverage] Clang crashes if user declares `__sancov_lowest_stack` variable · 29ac3a5b

Julian Lettner authored Feb 04, 2019

Summary:
If the user declares or defines `__sancov_lowest_stack` with an
unexpected type, then `getOrInsertGlobal` inserts a bitcast and the
following cast fails:
```
Constant *SanCovLowestStackConstant =
       M.getOrInsertGlobal(SanCovLowestStackName, IntptrTy);
SanCovLowestStack = cast<GlobalVariable>(SanCovLowestStackConstant);
```

This variable is a SanitizerCoverage implementation detail and the user
should generally never have a need to access it, so we emit an error
now.

rdar://problem/44143130

Reviewers: morehouse

Differential Revision: https://reviews.llvm.org/D57633

llvm-svn: 353100

29ac3a5b

[InstCombine] Cleanup the TFE/LWE check in AMDGPU SimplifyDemanded · a69146e6

Nicolai Haehnle authored Feb 04, 2019

Summary:
The fix added in r352904 is not quite correct, or rather misleading:

1. When the texfailctrl (TFC) argument was non-constant, the fix assumed
   non-TFE/LWE, which is incorrect.

2. Regardless, this code path cannot even be hit for correct
   TFE/LWE-enabled calls, because those return a struct. Added
   a test case for those for completeness.

Change-Id: I92d314dbc67a2670f6d7adaab765ef45f56a49cf

Reviewers: hliao, dstuttard, arsenm

Subscribers: kzhuravl, jvesely, wdng, yaxunl, tpr, t-tye, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D57681

llvm-svn: 353097

a69146e6

[NewPM][MSan] Add Options Handling · 0ee6a933

Philip Pfaffe authored Feb 04, 2019

Summary: This patch enables passing options to msan via the passes pipeline, e.e., -passes=msan<recover;kernel;track-origins=4>.

Reviewers: chandlerc, fedor.sergeev, leonardchan

Subscribers: hiraditya, bollu, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D57640

llvm-svn: 353090

0ee6a933

[WarnMissedTransforms] Do not warn about already vectorized loops. · 70560a0a

Michael Kruse authored Feb 04, 2019

LoopVectorize adds llvm.loop.isvectorized, but leaves
llvm.loop.vectorize.enable. Do not consider such a loop for user-forced
vectorization since vectorization already happened -- by prioritizing
llvm.loop.isvectorized except for TM_SuppressedByUser.

Fixes http://llvm.org/PR40546

Differential Revision: https://reviews.llvm.org/D57542

llvm-svn: 353082

70560a0a

[NFC] Make a check in GuardWidening more obvious · 56b57e3f
Max Kazantsev authored Feb 04, 2019
```
llvm-svn: 353038
```
56b57e3f
[NFC] Rename variables to reflect the actual status of GuardWidening · 09802f41
Max Kazantsev authored Feb 04, 2019
```
llvm-svn: 353036
```
09802f41
[NFC] Remove redundant parameters for better readability · 13ab5cbb
Max Kazantsev authored Feb 04, 2019
```
llvm-svn: 353034
```
13ab5cbb