Commits · a373d18eb7d718de1a2155b9d8a5d64d6f74e131 · Lorenzo Albano / LLVM bpEVL

Mar 28, 2018

Transforms: Introduce Transforms/Utils.h rather than spreading the... · a373d18e

David Blaikie authored Mar 28, 2018

Transforms: Introduce Transforms/Utils.h rather than spreading the declarations amongst Scalar.h and IPO.h

Fixes layering - Transforms/Utils shouldn't depend on including a Scalar
or IPO header, because Scalar and IPO depend on Utils.

llvm-svn: 328717

a373d18e

Mar 27, 2018

[LoopUnroll][NFC] Remove redundant canPeel check · b1ad66ff

Max Kazantsev authored Mar 27, 2018

We check `canPeel` twice: when evaluating the number of iterations to be peeled
and within the method `peelLoop` that performs peeling. This method is only
executed if the calculated peel count is positive. Thus, the check in `peelLoop` can
never fail. This patch replaces this check with an assert.

Differential Revision: https://reviews.llvm.org/D44919
Reviewed By: fhahn

llvm-svn: 328615

b1ad66ff

Mar 26, 2018

[LoopUnroll] Fix dangling pointers in SCEV · a5574931

Max Kazantsev authored Mar 26, 2018

Current logic of loop SCEV invalidation in Loop Unroller implicitly relies on
fact that exit count of outer loops cannot rely on exiting blocks of
inner loops, which is true in current implementation of backedge taken count
calculation but is wrong in general. As result, when we only forget the loop that
we have just unrolled, we may still have cached data for its outer loops (in particular,
exit counts) which keeps references on blocks of inner loop that could have been
changed or even deleted.

The attached test demonstrates a situaton when after unrolling of innermost loop
the outermost loop contains a dangling pointer on non-existant block. The problem
shows up when we apply patch https://reviews.llvm.org/D44677 that makes SCEV
smarter about exit count calculation. I am not sure if the bug exists without this patch,
it appears that now it is accidentally correct just because in practice exact backedge
taken count for outer loops with complex control flow inside is never calculated.
But when SCEV learns to do so, this problem shows up.

This patch replaces existing logic of SCEV loop invalidation with a correct one, which
happens to be invalidation of outermost loop (which also leads to invalidation of all
loops inside of it). It is the only way to ensure that no outer loop keeps dangling pointers
on removed blocks, or just outdated information that has changed after unrolling.

Differential Revision: https://reviews.llvm.org/D44818
Reviewed By: samparker

llvm-svn: 328483

a5574931

Mar 24, 2018

Remove unused header from EntryExitInstrumenter · 53f51c1d

David Blaikie authored Mar 24, 2018

Fixes layering, since Transforms/Utils doesn't depend on CodeGen, so
shouldn't include headers from it.

llvm-svn: 328399

53f51c1d

Mar 23, 2018

[LoopUnroll] Simplify induction variables after peeling too. · 52436a58

Florian Hahn authored Mar 23, 2018

Loop peeling also has an impact on the induction variables, so we should
benefit from induction variable simplification after peeling too.

Reviewers: sanjoy, bogner, mzolotukhin, efriedma

Reviewed By: efriedma

Differential Revision: https://reviews.llvm.org/D43878

llvm-svn: 328301

52436a58

Mar 22, 2018

Move the initialization of the Meta Renamer pass over to IPO along with the... · 2965a01e

David Blaikie authored Mar 22, 2018

Move the initialization of the Meta Renamer pass over to IPO along with the rest of it that was moved in r328209

llvm-svn: 328234

2965a01e

[SimplifyCFG] Create attribute for fuzzing-specific optimizations. · 236cdaf8

Matt Morehouse authored Mar 22, 2018

Summary:
When building with libFuzzer, converting control flow to selects or
obscuring the original operands of CMPs reduces the effectiveness of
libFuzzer's heuristics.

This patch provides an attribute to disable or modify certain optimizations
for optimal fuzzing signal.

Provides a less aggressive alternative to https://reviews.llvm.org/D44057.

Reviewers: vitalybuka, davide, arsenm, hfinkel

Reviewed By: vitalybuka

Subscribers: junbuml, mehdi_amini, wdng, javed.absar, hiraditya, llvm-commits, kcc

Differential Revision: https://reviews.llvm.org/D44232

llvm-svn: 328214

236cdaf8

Move MetaRenamer from Transforms/UTils to Transforms/IPO since it implements part of IPO.h · 03684175
David Blaikie authored Mar 22, 2018
```
llvm-svn: 328209
```
03684175

[CloneFunction] Preserve DT in DuplicateInstructionsInSplitBetween. · 3bb822e7

Florian Hahn authored Mar 22, 2018

DuplicateInstructionsInSplitBetween can preserve the DT by passing
through DT to SplitEdge.

Reviewers: sanjoy, junbuml, anna, kuhar

Reviewed By: kuhar

Differential Revision: https://reviews.llvm.org/D44629

llvm-svn: 328189

3bb822e7

Mar 21, 2018

Fix a couple of layering violations in Transforms · 2be39228

David Blaikie authored Mar 21, 2018

Remove #include of Transforms/Scalar.h from Transform/Utils to fix layering.

Transforms depends on Transforms/Utils, not the other way around. So
remove the header and the "createStripGCRelocatesPass" function
declaration (& definition) that is unused and motivated this dependency.

Move Transforms/Utils/Local.h into Analysis because it's used by
Analysis/MemoryBuiltins.cpp.

llvm-svn: 328165

2be39228

Mar 20, 2018

[MustExecute] Move isGuaranteedToExecute and related rourtines to Analysis · 23aed5ef

Philip Reames authored Mar 20, 2018

Next step is to actually merge the implementations and get both implementations tested through the new printer.

llvm-svn: 328055

23aed5ef

Mar 17, 2018

[X86] Added support for nocf_check attribute for indirect Branch Tracking · fdd72fd5

Oren Ben Simhon authored Mar 17, 2018

X86 Supports Indirect Branch Tracking (IBT) as part of Control-Flow Enforcement Technology (CET).
IBT instruments ENDBR instructions used to specify valid targets of indirect call / jmp.
The `nocf_check` attribute has two roles in the context of X86 IBT technology:
1. Appertains to a function - do not add ENDBR instruction at the beginning of the function.
2. Appertains to a function pointer - do not track the target function of this pointer by adding nocf_check prefix to the indirect-call instruction.

This patch implements `nocf_check` context for Indirect Branch Tracking.
It also auto generates `nocf_check` prefixes before indirect branchs to jump tables that are guarded by range checks.

Differential Revision: https://reviews.llvm.org/D41879

llvm-svn: 327767

fdd72fd5

Mar 16, 2018

[LICM/mustexec] Extend first iteration must execute logic to fcmps · 8a106272

Philip Reames authored Mar 16, 2018

This builds on the work from https://reviews.llvm.org/D44287. It turned out supporting fcmp was much easier than I realized, so let's do that now.

As an aside, our -O3 handling of a floating point IVs leaves a lot to be desired. We do convert the float IV to an integer IV, but do so late enough that many other optimizations are missed (e.g. we don't vectorize).

Differential Revision: https://reviews.llvm.org/D44542

llvm-svn: 327722

8a106272

Mar 15, 2018

[LoopUnroll] Peel off iterations if it makes conditions true/false. · fc97b617

Florian Hahn authored Mar 15, 2018

If the loop body contains conditions of the form IndVar < #constant, we
can remove the checks by peeling off #constant iterations.

This improves codegen for PR34364.

Reviewers: mkuper, mkazantsev, efriedma

Reviewed By: mkazantsev

Differential Revision: https://reviews.llvm.org/D43876

llvm-svn: 327671

fc97b617

[LICM] Ignore exits provably not taken on first iteration when computing must execute · a21d5f1e

Philip Reames authored Mar 15, 2018

It is common to have conditional exits within a loop which are known not to be taken on some iterations, but not necessarily all. This patches extends our reasoning around guaranteed to execute (used when establishing whether it's safe to dereference a location from the preheader) to handle the case where an exit is known not to be taken on the first iteration and the instruction of interest *is* known to be taken on the first iteration.

This case comes up in two major ways:
* If we have a range check which we've been unable to eliminate, we frequently know that it doesn't fail on the first iteration.
* Pass ordering. We may have a check which will be eliminated through some sequence of other passes, but depending on the exact pass sequence we might never actually do so or we might miss other optimizations from passes run before the check is finally eliminated.

The initial version (here) is implemented via InstSimplify. At the moment, it catches a few cases, but misses a lot too. I added test cases for missing cases in InstSimplify which I'll follow up on separately. Longer term, we should probably wire SCEV through to here to get much smarter loop aware simplification of the first iteration predicate.

Differential Revision: https://reviews.llvm.org/D44287

llvm-svn: 327664

a21d5f1e

[Debug] Retain both copies of debug intrinsics in HoistThenElseCodeToIf · f4ceef8d

Ulrich Weigand authored Mar 15, 2018

When hoisting common code from the "then" and "else" branches of a condition
to before the "if", the HoistThenElseCodeToIf routine will attempt to merge
the debug location associated with the two original copies of the hoisted
instruction.

This is a problem in the special case where the hoisted instruction is a
debug info intrinsic, since for those the debug location is considered
part of the intrinsic and attempting to modify it may resut in invalid
IR.  This is the underlying cause of PR36410.

This patch fixes the problem by handling debug info intrinsics specially:
instead of hoisting one copy and merging the two locations, the code now
simply hoists both copies, each with its original location intact.  Note
that this is still only done in the case where both original copies are
otherwise (i.e. apart from location metadata) identical.

Reviewed By: aprantl

Differential Revision: https://reviews.llvm.org/D44312

llvm-svn: 327622

f4ceef8d

Mar 13, 2018

[ThinLTO] Clear dllimport when setting dso_local. · f5220fb6

Rafael Espindola authored Mar 13, 2018

This is PR36686.

If a user of a library is LTOed with that library we take the
opportunity to set dso_local, but we don't clear dllimport, which
creates an invalid IR.

llvm-svn: 327408

f5220fb6

[Evaluator] Evaluate load/store with bitcast · 6f42a2cd
Eugene Leviant authored Mar 13, 2018
```
Differential revision: https://reviews.llvm.org/D43457

llvm-svn: 327381
```
6f42a2cd

Mar 09, 2018

Revert "[Debug] Retain both sets of debug intrinsics in HoistThenElseCodeToIf" · 019dd231
Ulrich Weigand authored Mar 09, 2018
```
This reverts commit r327175 as problems in debug info generation were shown.

llvm-svn: 327176
```
019dd231

[Debug] Retain both sets of debug intrinsics in HoistThenElseCodeToIf · fa4e63c0

Ulrich Weigand authored Mar 09, 2018

When hoisting common code from the "then" and "else" branches of a condition
to before the "if", there is no need to require that debug intrinsics match
before moving them (and merging them).  Instead, we can simply always keep
all debug intrinsics from both sides of the "if".

This fixes PR36410, which describes a problem where as a result of the attempt
to merge debug locations for two debug intrinsics we end up with an invalid
intrinsic, where the scope indicated in the !dbg location no longer matches
the scope of the variable tracked by the intrinsic.

In addition, this has the benefit that we no longer throw away information
that is actually still valid, helping to generate better debug data.

Reviewed By: vsk

Differential Revision: https://reviews.llvm.org/D44312

llvm-svn: 327175

fa4e63c0

LowerDbgDeclare: ignore dbg.declares for allocas with volatile access · 5b477be7

Adrian Prantl authored Mar 09, 2018

There is no point in lowering a dbg.declare describing an alloca that
has volatile loads or stores as users, since the alloca cannot be
elided. Lowering the dbg.declare will result in larger debug info that
may also have worse coverage than just describing the alloca.

rdar://problem/34496278

llvm-svn: 327092

5b477be7

Mar 08, 2018
- [NFC] Factor out a helper function for checking if a block has a potential early implicit exit. · fbffd126
  Philip Reames authored Mar 08, 2018
```
llvm-svn: 327065
```
  fbffd126
Mar 06, 2018

[CloneFunction] Support BB == PredBB in DuplicateInstructionsInSplit. · f0a25f72

Florian Hahn authored Mar 06, 2018

In case PredBB == BB and StopAt == BB's terminator, StopAt != &*BI will
fail, because BB's terminator instruction gets replaced.

By using BB.getTerminator() we get the current terminator which we can use
to compare.

Reviewers: sanjoy, anna, reames

Reviewed By: anna

Differential Revision: https://reviews.llvm.org/D43822

llvm-svn: 326779

f0a25f72

Mar 02, 2018

[Utils] Salvage debug info in block simplification · f69baf64

Vedant Kumar authored Mar 02, 2018

In stage2 -O3 builds of llc, this results in small but measurable
increases in the number of variables with locations, and in the number
of unique source variables overall.

(According to llvm-dwarfdump --statistics, there are 123 additional
variables with locations, which is just a 0.006% improvement).

The size of the .debug_loc section of the llc dsym increases by 0.004%.

llvm-svn: 326629

f69baf64

[Utils] Salvage debug info in recursive inst deletion · 334fa574

Vedant Kumar authored Mar 02, 2018

In stage2 -O3 builds of llc, this results in a 0.3% increase in the
number of variables with locations, and a 0.2% increase in the number of
unique source variables overall.

The size of the .debug_loc section of the llc dsym increases by 0.5%.

llvm-svn: 326621

334fa574

Mar 01, 2018
- [SimplifyLibCalls] Update an obviously copy and pasted header comment to match this file. NFC · 2915bc00
  Craig Topper authored Mar 01, 2018
```
llvm-svn: 326475
```
  2915bc00
Feb 28, 2018

[Dominators] Remove verifyDomTree and add some verifying for Post Dom Trees · 7c35de12

David Green authored Feb 28, 2018

Removes verifyDomTree, using assert(verify()) everywhere instead, and
changes verify a little to always run IsSameAsFreshTree first in order
to print good output when we find errors. Also adds verifyAnalysis for
PostDomTrees, which will allow checking of PostDomTrees it the same way
we check DomTrees and MachineDomTrees.

Differential Revision: https://reviews.llvm.org/D41298

llvm-svn: 326315

7c35de12

Feb 23, 2018

[Debug] Add dbg.value intrinsics for PHIs created during LCSSA. · 523c656e

Matt Davis authored Feb 23, 2018

Summary:
This patch is an enhancement to propagate dbg.value information when Phis are created on behalf of LCSSA.
I noticed a case where a value carried across a loop was reported as <optimized out>.

Specifically this case:
```
int bar(int x, int y) {
  return x + y;
}

int foo(int size) {
  int val = 0;
  for (int i = 0; i < size; ++i) {
    val = bar(val, i);  // Both val and i are correct
  }
  return val; // <optimized out>
}
```

In the above case, after all of the interesting computation completes our value
is reported as "optimized out." This change will add a dbg.value to correct this.

This patch also moves the dbg.value insertion routine from LoopRotation.cpp 
into Local.cpp, so that we can share it in both places (LoopRotation and LCSSA).

Reviewers: mzolotukhin, aprantl, vsk, davide

Reviewed By: aprantl, vsk

Subscribers: dberlin, llvm-commits

Differential Revision: https://reviews.llvm.org/D42551

llvm-svn: 325926

523c656e

Feb 22, 2018

[Utils] Avoid a hash table lookup in salvageDI, NFC · 1ceabcf0

Vedant Kumar authored Feb 22, 2018

According to the current coverage report salvageDebugInfo() is called
5.12 million times during testing and almost always returns early.

The early return depends on LocalAsMetadata::getIfExists returning null,
which involves a DenseMap lookup in an LLVMContextImpl. We can probably
speed this up by simply checking the IsUsedByMD bit in Value.

llvm-svn: 325738

1ceabcf0

Feb 19, 2018

Revert "[mem2reg] Use range loops (NFCI)" · d1eabb18
Brian Gesiak authored Feb 19, 2018
```
This reverts commit r325532.

llvm-svn: 325539
```
d1eabb18

[mem2reg] Use range loops (NFCI) · 49a9d1a4

Brian Gesiak authored Feb 19, 2018

Summary:
Several for loops in PromoteMemoryToRegister.cpp leave their increment
expression empty, instead incrementing the iterator within the for loop
body. I believe this is because these loops were previously implemented
as while loops; see https://reviews.llvm.org/rL188327.

Incrementing the iterator within the body of the for loop instead of
in its increment expression makes it seem like the iterator will be
modified or conditionally incremented within the loop, but that is not
the case in these loops.

Instead, use range loops.

Test Plan: `check-llvm`

Reviewers: davide, bkramer

Reviewed By: davide, bkramer

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D43473

llvm-svn: 325532

49a9d1a4

Feb 15, 2018
- [Utils] salvageDI: Add a comment and move a call earlier, NFC · 044b5889
  Vedant Kumar authored Feb 15, 2018
```
llvm-svn: 325280
```
  044b5889
Feb 14, 2018

Pass a module reference to CloneModule. · 71867532

Rafael Espindola authored Feb 14, 2018

It can never be null and most callers were already using references or
std::unique_ptr.

llvm-svn: 325160

71867532

Move llvm::computeLoopSafetyInfo from LICM.cpp to LoopUtils.cpp. NFC · 0d5f9651

David Green authored Feb 14, 2018

Move computeLoopSafetyInfo, defined in Transforms/Utils/LoopUtils.h,
into the corresponding LoopUtils.cpp, as opposed to LICM where it resides
at the moment. This will allow other functions from Transforms/Utils
to reference it.

llvm-svn: 325151

0d5f9651

[Utils] Salvage the debug info of DCE'ed 'and' instructions · 1768957c

Petar Jovanovic authored Feb 14, 2018

Preserve debug info from a dead 'and' instruction with a constant.

Patch by Djordje Todorovic.

Differential Revision: https://reviews.llvm.org/D43163

llvm-svn: 325119

1768957c

Adding a width of the GEP index to the Data Layout. · 945b7e5a

Elena Demikhovsky authored Feb 14, 2018

Making a width of GEP Index, which is used for address calculation, to be one of the pointer properties in the Data Layout.
p[address space]:size:memory_size:alignment:pref_alignment:index_size_in_bits.
The index size parameter is optional, if not specified, it is equal to the pointer size.

Till now, the InstCombiner normalized GEPs and extended the Index operand to the pointer width.
It works fine if you can convert pointer to integer for address calculation and all registered targets do this.
But some ISAs have very restricted instruction set for the pointer calculation. During discussions were desided to retrieve information for GEP index from the Data Layout.
http://lists.llvm.org/pipermail/llvm-dev/2018-January/120416.html

I added an interface to the Data Layout and I changed the InstCombiner and some other passes to take the Index width into account.
This change does not affect any in-tree target. I added tests to cover data layouts with explicitly specified index size.

Differential Revision: https://reviews.llvm.org/D42123

llvm-svn: 325102

945b7e5a

Feb 13, 2018

[Utils] Salvage debug info from all no-op casts · 388fac5d

Vedant Kumar authored Feb 13, 2018

We already try to salvage debug values from no-op bitcasts and inttoptr
instructions: we should handle ptrtoint instructions as well.

This saves an additional 24,444 debug values in a stage2 build of clang,
and (according to llvm-dwarfdump --statistics) provides an additional
289 unique source variables.

llvm-svn: 324982

388fac5d

[Utils] Salvage debug info of DCE'ed mul/sdiv/srem instructions · 4011c26c

Vedant Kumar authored Feb 13, 2018

Here are the number of additional debug values salvaged in a stage2
build of clang:

  63 SALVAGE: MUL
  1250 SALVAGE: SDIV

(No values were salvaged from `srem` instructions in this experiment,
but it's a simple case to handle so we might as well.)

llvm-svn: 324976

4011c26c

[Utils] Salvage debug info of DCE'ed shl/lhsr/ashr instructions · 31ec356a

Vedant Kumar authored Feb 13, 2018

Here are the number of additional debug values salvaged in a stage2
build of clang:

  1912 SALVAGE: ASHR
   405 SALVAGE: LSHR
   249 SALVAGE: SHL

llvm-svn: 324975

31ec356a

[Utils] Salvage the debug info of DCE'ed 'sub' instructions · 47b16c45
Vedant Kumar authored Feb 13, 2018
```
This salvages 14 debug values in a stage2 build of clang.

llvm-svn: 324974
```
47b16c45