Commits · 3aade11252bc574105dc5c401b09f2a2efcda0ae · Lorenzo Albano / LLVM bpEVL

Aug 03, 2016

Add -lowertypetests-bitsets-level to control bitsets generation. · 3aade112

Ivan Krasin authored Aug 03, 2016

Summary:
Sometimes, bitsets could get really large (>300k entries) and
we might want to drop a check, as it would have a too much cost.

Adding a flag to control how much penalty are we willing to pay
for bitsets.

Reviewers: kcc

Differential Revision: https://reviews.llvm.org/D23088

llvm-svn: 277556

3aade112

Support for lifetime begin/end markers in the MemorySSA use optimizer · df10119e

Daniel Berlin authored Aug 03, 2016

Summary: Depends on D23072

Reviewers: george.burgess.iv

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D23076

llvm-svn: 277553

df10119e

[InstCombine] replace dyn_casts with matches; NFCI · ab50a938
Sanjay Patel authored Aug 02, 2016
```
Clean-up before changing this to allow folds for vectors.

llvm-svn: 277538
```
ab50a938

Imported statistics types changes · 47509f61

Piotr Padlewski authored Aug 02, 2016

Reviewers: tejohnson, eraman

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D22980

llvm-svn: 277534

47509f61

Aug 02, 2016

Move to having a single real instructionClobbersQuery · dff31deb

Daniel Berlin authored Aug 02, 2016

Summary: We really want to move towards MemoryLocOrCall (or fix AA) everywhere, but for now, this lets us have a single instructionClobbersQuery.

Reviewers: george.burgess.iv

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D23072

llvm-svn: 277530

dff31deb

[LoopUnroll] Switch the default value of -unroll-runtime-epilog back to its original value. · b2738e41

Michael Zolotukhin authored Aug 02, 2016

As agreed in post-commit review of r265388, I'm switching the flag to
its original value until the 90% runtime performance regression on
SingleSource/Benchmarks/Stanford/Bubblesort is addressed.

llvm-svn: 277524

b2738e41

[LoopVectorize] Change comment for isOutOfScope in collectLoopUniforms, NFC · dc7001af

Wei Mi authored Aug 02, 2016

Update comment for isOutOfScope and add a testcase for uniform value being used
out of scope.

Differential Revision: https://reviews.llvm.org/D23073

llvm-svn: 277515

dc7001af

Fixes for post-commit review comments on r277480 · 26fcea91
Daniel Berlin authored Aug 02, 2016
```
llvm-svn: 277510
```
26fcea91
[IRCE] Rename variable; NFC · 83a72850
Sanjoy Das authored Aug 02, 2016
```
There is nothing "Original" about "OriginalLoopInfo".

llvm-svn: 277506
```
83a72850

[IRCE] Preserve DomTree and LCSSA · f45e03e2

Sanjoy Das authored Aug 02, 2016

This changes IRCE to "preserve" LCSSA and DomTree by recomputing them.
It still does not preserve LoopSimplify.

llvm-svn: 277505

f45e03e2

[LoopUnroll] Ensure we create prolog loops in simplified form. · d9b6ad3c
Michael Zolotukhin authored Aug 02, 2016
```
llvm-svn: 277502
```
d9b6ad3c
MSVC 2013 does not implement C++11 unions properly, so remove the anoymous union for now, · de4be653
Daniel Berlin authored Aug 02, 2016
```
and leave a FIXME.

llvm-svn: 277485
```
de4be653

Rewrite the use optimizer to be less memory intensive and 50% faster. · c43aa5a5

Daniel Berlin authored Aug 02, 2016

Fixes PR28670

Summary:
Rewrite the use optimizer to be less memory intensive and 50% faster.
Fixes PR28670

The new use optimizer works like a standard SSA renaming pass, storing
all possible versions a MemorySSA use could get in a stack, and just
tracking indexes into the stack.
This uses much less memory than caching N^2 alias query results.
It's also a lot faster.

The current version defers phi node walking to the normal walker.

Reviewers: george.burgess.iv

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D23032

llvm-svn: 277480

c43aa5a5

[LV] Generate both scalar and vector integer induction variables · 18d88983

Matthew Simpson authored Aug 02, 2016

This patch enables the vectorizer to generate both scalar and vector versions
of an integer induction variable for a given loop. Previously, we only
generated a scalar induction variable if we knew all its users were going to be
scalar. Otherwise, we generated a vector induction variable. In the case of a
loop with both scalar and vector users of the induction variable, we would
generate the vector induction variable and extract scalar values from it for
the scalar users. With this patch, we now generate both versions of the
induction variable when there are both scalar and vector users and select which
version to use based on whether the user is scalar or vector.

Differential Revision: https://reviews.llvm.org/D22869

llvm-svn: 277474

18d88983

[LV] Untangle the concepts of uniform and scalar · 58f56288

Matthew Simpson authored Aug 02, 2016

This patch refactors the logic in collectLoopUniforms and
collectValuesToIgnore, untangling the concepts of "uniform" and "scalar". It
adds isScalarAfterVectorization along side isUniformAfterVectorization to
distinguish the two. Known scalar values include those that are uniform,
getelementptr instructions that won't be vectorized, and induction variables
and induction variable update instructions whose users are all known to be
scalar.

This patch includes the following functional changes:

- In collectLoopUniforms, we mark uniform the pointer operands of interleaved
  accesses. Although non-consecutive, these pointers are treated like
  consecutive pointers during vectorization.

- In collectValuesToIgnore, we insert a value into VecValuesToIgnore if it
  isScalarAfterVectorization rather than isUniformAfterVectorization. This
  differs from the previous functionaly in that we now add getelementptr
  instructions that will not be vectorized into VecValuesToIgnore.

This patch also removes the ValuesNotWidened set used for induction variable
scalarization since, after the above changes, it is now equivalent to
isScalarAfterVectorization.

Differential Revision: https://reviews.llvm.org/D22867

llvm-svn: 277460

58f56288

[LoadStoreVectorizer] Don't use a linear walk for an existence check in a SmallPtrSet · a0053cc0
Benjamin Kramer authored Aug 02, 2016
```
No functionality change intended.

llvm-svn: 277436
```
a0053cc0
Minor code cleanups. NFC. · db8f6eeb
Junmo Park authored Aug 02, 2016
```
llvm-svn: 277415
```
db8f6eeb

CodeExtractor : Add ability to preserve profile data. · f801575f

Sean Silva authored Aug 02, 2016

Added ability to estimate the entry count of the extracted function and
the branch probabilities of the exit branches.

Patch by River Riddle!

Differential Revision: https://reviews.llvm.org/D22744

llvm-svn: 277411

f801575f

[ADT] NFC: Generalize GraphTraits requirement of "NodeType *" in interfaces to... · b44909ec

Tim Shen authored Aug 01, 2016

[ADT] NFC: Generalize GraphTraits requirement of "NodeType *" in interfaces to "NodeRef", and migrate SCCIterator.h to use NodeRef

Summary: By generalize the interface, users are able to inject more flexible Node token into the algorithm, for example, a pair of vector<Node>* and index integer. Currently I only migrated SCCIterator to use NodeRef, but more is coming. It's a NFC.

Reviewers: dblaikie, chandlerc

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D22937

llvm-svn: 277399

b44909ec

[WebAssembly] Support CFI for WebAssembly target · c64d7655

Derek Schuff authored Aug 01, 2016

Summary: This patch implements CFI for WebAssembly. It modifies the
LowerTypeTest pass to pre-assign table indexes to functions that are
called indirectly, and lowers type checks to test against the
appropriate table indexes. It also modifies the WebAssembly backend to
support a special ".indidx" assembly directive that propagates the table
index assignments out to the linker.

Patch by Dominic Chen

Differential Revision: https://reviews.llvm.org/D21768

llvm-svn: 277398

c64d7655

Aug 01, 2016

[PM] Port SpeculativeExecution to the new PM · c4061861
Michael Kuperstein authored Aug 01, 2016
```
Differential Revision: https://reviews.llvm.org/D23033

llvm-svn: 277393
```
c4061861
[Profile] IR profiling minor cleanup /nfc · d119761b
Xinliang David Li authored Aug 01, 2016
```
Differential Revision: http://reviews.llvm.org/D22995

llvm-svn: 277379
```
d119761b
[LV] Move isGatherOrScatterLegal into LoopVectorizationLegality (NFC) · 228f9731
Matthew Simpson authored Aug 01, 2016
```
llvm-svn: 277376
```
228f9731
[LV] Use getPointerOperand helper where appropriate (NFC) · 1ce88ff6
Matthew Simpson authored Aug 01, 2016
```
llvm-svn: 277375
```
1ce88ff6

[SimplifyCFG] Fix nasty RAUW bug from r277325 · bade86ce

James Molloy authored Aug 01, 2016

Using RAUW was wrong here; if we have a switch transform such as:
  18 -> 6 then
  6 -> 0

If we use RAUW, while performing the second transform the  *transformed* 6
from the first will be also replaced, so we end up with:
  18 -> 0
  6 -> 0

Found by clang stage2 bootstrap; testcase added.

llvm-svn: 277332

bade86ce

[SimplifyCFG] Range reduce switches · b2e436de

James Molloy authored Aug 01, 2016

If a switch is sparse and all the cases (once sorted) are in arithmetic progression, we can extract the common factor out of the switch and create a dense switch. For example:

    switch (i) {
    case 5: ...
    case 9: ...
    case 13: ...
    case 17: ...
    }

can become:

    if ( (i - 5) % 4 ) goto default;
    switch ((i - 5) / 4) {
    case 0: ...
    case 1: ...
    case 2: ...
    case 3: ...
    }

or even better:

   switch ( ROTR(i - 5, 2) {
   case 0: ...
   case 1: ...
   case 2: ...
   case 3: ...
   }

The division and remainder operations could be costly so we only do this if the factor is a power of two, and emit a right-rotate instead of a divide/remainder sequence. Dense switches can be lowered significantly better than sparse switches and can even be transformed into lookup tables.

llvm-svn: 277325

b2e436de

Revert r277313 and r277314. · 423c7149

Sean Silva authored Aug 01, 2016

They seem to trigger an LSan failure:
http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/15140/steps/check-llvm%20asan/logs/stdio

Revert "Add the tests for r277313"

This reverts commit r277314.

Revert "CodeExtractor : Add ability to preserve profile data."

This reverts commit r277313.

llvm-svn: 277317

423c7149

Fix - CodeExtractor : Inherit Target Dependent Attributes from the parent function. · a0a802ab

Sean Silva authored Aug 01, 2016

When extracting a set of blocks make sure to inherit all of the target
dependent attributes to make sure that the function will be valid for
lowering. One example is the "target-features" attribute for x86, if the
extracted region has functionality that relies on a specific feature it
will fail to be lowered.
This also allows for extracted functions to be valid for inlining, at
least back into the parent function, as the target attributes are tested
when inlining for compatibility.

Patch by River Riddle!

Differential Revision: https://reviews.llvm.org/D22713

llvm-svn: 277315

a0a802ab

CodeExtractor : Add ability to preserve profile data. · 62089243

Sean Silva authored Aug 01, 2016

Added ability to estimate the entry count of the extracted function and
the branch probabilities of the exit branches.

Patch by River Riddle!

Differential Revision: https://reviews.llvm.org/D22744

llvm-svn: 277313

62089243

Jul 31, 2016
- Fix the MemorySSA updating API to enable people to create memory accesses before removing old ones · 5130cc83
  Daniel Berlin authored Jul 31, 2016
```
llvm-svn: 277309
```
  5130cc83
Jul 29, 2016

[LoopUnroll] Include hotness of region in opt remark · 12937c36

Adam Nemet authored Jul 29, 2016

LoopUnroll is a loop pass, so the analysis of OptimizationRemarkEmitter
is added to the common function analysis passes that loop passes
depend on.

The BFI and indirectly BPI used in this pass is computed lazily so no
overhead should be observed unless -pass-remarks-with-hotness is used.

This is how the patch affects the O3 pipeline:

         Dominator Tree Construction
         Natural Loop Information
         Canonicalize natural loops
         Loop-Closed SSA Form Pass
         Basic Alias Analysis (stateless AA impl)
         Function Alias Analysis Results
         Scalar Evolution Analysis
+        Lazy Branch Probability Analysis
+        Lazy Block Frequency Analysis
+        Optimization Remark Emitter
         Loop Pass Manager
           Rotate Loops
           Loop Invariant Code Motion
           Unswitch loops
         Simplify the CFG
         Dominator Tree Construction
         Basic Alias Analysis (stateless AA impl)
         Function Alias Analysis Results
         Combine redundant instructions
         Natural Loop Information
         Canonicalize natural loops
         Loop-Closed SSA Form Pass
         Scalar Evolution Analysis
+        Lazy Branch Probability Analysis
+        Lazy Block Frequency Analysis
+        Optimization Remark Emitter
         Loop Pass Manager
           Induction Variable Simplification
           Recognize loop idioms
           Delete dead loops
           Unroll loops
...

llvm-svn: 277203

12937c36

Recommitting r275284: add support to inline __builtin_mempcpy · b99d1cc7

Andrew Kaylor authored Jul 29, 2016

Patch by Sunita Marathe

Third try, now following fixes to MSan to handle mempcy in such a way that this commit won't break the MSan buildbots. (Thanks, Evegenii!)

llvm-svn: 277189

b99d1cc7

[EarlyCSE] Correctly handle simplified, but live, instructions · 130b9f99

David Majnemer authored Jul 29, 2016

Some instructions may have their uses replaced with a symbolic constant.
However, the instruction may still have side effects which percludes it
from being removed from the function.  EarlyCSE treated such an
instruction as if it were removed, resulting in PR28763.

llvm-svn: 277114

130b9f99

[ConstnatFolding] Teach the folder how to fold ConstantVector · d536f232

David Majnemer authored Jul 29, 2016

A ConstantVector can have ConstantExpr operands and vice versa.
However, the folder had no ability to fold ConstantVectors which, in
some cases, was an optimization barrier.

Instead, rephrase the folder in terms of Constants instead of
ConstantExprs and teach callers how to deal with failure.

llvm-svn: 277099

d536f232

Added ThinLTO inlining statistics · 84abc74f

Piotr Padlewski authored Jul 29, 2016

Summary:
copypasta doc of ImportedFunctionsInliningStatistics class
 \brief Calculate and dump ThinLTO specific inliner stats.
 The main statistics are:
 (1) Number of inlined imported functions,
 (2) Number of imported functions inlined into importing module (indirect),
 (3) Number of non imported functions inlined into importing module
 (indirect).
 The difference between first and the second is that first stat counts
 all performed inlines on imported functions, but the second one only the
 functions that have been eventually inlined to a function in the importing
 module (by a chain of inlines). Because llvm uses bottom-up inliner, it is
 possible to e.g. import function `A`, `B` and then inline `B` to `A`,
 and after this `A` might be too big to be inlined into some other function
 that calls it. It calculates this statistic by building graph, where
 the nodes are functions, and edges are performed inlines and then by marking
 the edges starting from not imported function.

 If `Verbose` is set to true, then it also dumps statistics
 per each inlined function, sorted by the greatest inlines count like
 - number of performed inlines
 - number of performed inlines to importing module

Reviewers: eraman, tejohnson, mehdi_amini

Subscribers: mehdi_amini, llvm-commits

Differential Revision: https://reviews.llvm.org/D22491

llvm-svn: 277089

84abc74f

[sanitizer] Simplify and future-proof maybeMarkSanitizerLibraryCallNoBuiltin(). · d240a889

Evgeniy Stepanov authored Jul 28, 2016

Sanitizers set nobuiltin attribute on certain library functions to
avoid a situation where such function is neither instrumented nor
intercepted.

At the moment the list of interesting functions is hardcoded. This
change replaces it with logic based on
TargetLibraryInfo::hasOptimizedCodegen and the presense of readnone
function attribute (sanitizers are generally interested in memory
behavior of library functions).

This is expected to be a no-op change: the new logic matches exactly
the same set of functions.

r276771 (currently reverted) added mempcpy() to the list, breaking
MSan tests. With this change, r276771 can be safely re-landed.

llvm-svn: 277086

d240a889

Do not remove empty lifetime.start/lifetime.end ranges · 0ab23cf1

Vitaly Buka authored Jul 28, 2016

Summary:
Asan stack-use-after-scope check should poison alloca even if there is
no access between start and end.

This is possible for code like this:
for (int i = 0; i < 3; i++) {
  int x;
  p = &x;
}

"Loop Invariant Code Motion" will move "p = &x;" out of the loop, making
start/end range empty.

PR27453

Reviewers: eugenis

Differential Revision: https://reviews.llvm.org/D22842

llvm-svn: 277072

0ab23cf1

Should be committed as one CL. · 2fae6a77
Vitaly Buka authored Jul 28, 2016
```
This reverts commits r277068 r277067 r277066.

llvm-svn: 277071
```
2fae6a77

[asan] Add const into few methods · 21a9e573

Vitaly Buka authored Jul 28, 2016

Summary: No functional changes

Reviewers: eugenis

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D22899

llvm-svn: 277069

21a9e573

Do not remove empty lifetime.start/lifetime.end ranges · f0500b6a

Vitaly Buka authored Jul 28, 2016

Summary:
Asan stack-use-after-scope check should poison alloca even if there is
no access between start and end.

This is possible for code like this:
for (int i = 0; i < 3; i++) {
  int x;
  p = &x;
}

"Loop Invariant Code Motion" will move "p = &x;" out of the loop, making
start/end range empty.

PR27453

Reviewers: eugenis

Differential Revision: https://reviews.llvm.org/D22842

llvm-svn: 277068

f0500b6a