Commits · a373d18eb7d718de1a2155b9d8a5d64d6f74e131 · Lorenzo Albano / LLVM bpEVL

Mar 28, 2018

Transforms: Introduce Transforms/Utils.h rather than spreading the... · a373d18e

David Blaikie authored Mar 28, 2018

Transforms: Introduce Transforms/Utils.h rather than spreading the declarations amongst Scalar.h and IPO.h

Fixes layering - Transforms/Utils shouldn't depend on including a Scalar
or IPO header, because Scalar and IPO depend on Utils.

llvm-svn: 328717

a373d18e

[MSan] Introduce ActualFnStart. NFC · 4e7ad080

Alexander Potapenko authored Mar 28, 2018

This is a step towards the upcoming KMSAN implementation patch.
KMSAN is going to prepend a special basic block containing
tool-specific calls to each function. Because we still want to
instrument the original entry block, we'll need to store it in
ActualFnStart.

For MSan this will still be F.getEntryBlock(), whereas for KMSAN
it'll contain the second BB.

llvm-svn: 328697

4e7ad080

[MSan] Add an isStore argument to getShadowOriginPtr(). NFC · e1d58778

Alexander Potapenko authored Mar 28, 2018

This is a step towards the upcoming KMSAN implementation patch.
The isStore argument is to be used by getShadowOriginPtrKernel(),
it is ignored by getShadowOriginPtrUserspace().

Depending on whether a memory access is a load or a store, KMSAN
instruments it with different functions, __msan_metadata_ptr_for_load_X()
and __msan_metadata_ptr_for_store_X().

Those functions may return different values for a single address,
which is necessary in the case the runtime library decides to ignore
particular accesses.

llvm-svn: 328692

e1d58778

Mar 27, 2018

80-line wrap. NFC · 0272cb07
Xin Tong authored Mar 27, 2018
```
llvm-svn: 328660
```
0272cb07

[PGO] Fix branch probability remarks assert · 662f38b1

Rong Xu authored Mar 27, 2018

Fixed counter/weight overflow that leads to an assertion. Also fixed the help
string for pgo-emit-branch-prob option.

Differential Revision: https://reviews.llvm.org/D44809

llvm-svn: 328653

662f38b1

[LV] Add TTI::shouldMaximizeVectorBandwidth to allow enabling it per target · 5d93fdfa

Krzysztof Parzyszek authored Mar 27, 2018

The default implementation returns false and keeps the current behavior.

Differential Revision: https://reviews.llvm.org/D44735

llvm-svn: 328632

5d93fdfa

[LoopUnroll][NFC] Remove redundant canPeel check · b1ad66ff

Max Kazantsev authored Mar 27, 2018

We check `canPeel` twice: when evaluating the number of iterations to be peeled
and within the method `peelLoop` that performs peeling. This method is only
executed if the calculated peel count is positive. Thus, the check in `peelLoop` can
never fail. This patch replaces this check with an assert.

Differential Revision: https://reviews.llvm.org/D44919
Reviewed By: fhahn

llvm-svn: 328615

b1ad66ff

[IRCE] Enable decreasing loops of non-const bound · 90b7f4f7

Sam Parker authored Mar 27, 2018

As a follow-up to r328480, this updates the logic for the decreasing
safety checks in a similar manner:
- CanBeMax is replaced by CannotBeMaxInLoop which queries
  isLoopEntryGuardedByCond on the maximum value.
- SumCanReachMin is replaced by isSafeDecreasingBound which includes
  some logic from parseLoopStructure and, again, has been updated to
  use isLoopEntryGuardedByCond on the given bounds.

Differential Revision: https://reviews.llvm.org/D44776

llvm-svn: 328613

90b7f4f7

Mar 26, 2018

[InstCombine] improve code comment; NFC · 0e3167cb
Sanjay Patel authored Mar 26, 2018
```
llvm-svn: 328560
```
0e3167cb

[InstCombine] reassociate loop invariant GEP chains to enable LICM · d870aea0

Sebastian Pop authored Mar 26, 2018

This change brings performance of zlib up by 10%. The example below is from a
hot loop in longest_match() from zlib.

do.body:
  %cur_match.addr.0 = phi i32 [ %cur_match, %entry ], [ %2, %do.cond ]
  %idx.ext = zext i32 %cur_match.addr.0 to i64
  %add.ptr = getelementptr inbounds i8, i8* %win, i64 %idx.ext
  %add.ptr2 = getelementptr inbounds i8, i8* %add.ptr, i64 %idx.ext1
  %add.ptr3 = getelementptr inbounds i8, i8* %add.ptr2, i64 -1

In this example %idx.ext1 is a loop invariant. It will be moved above the use of
loop induction variable %idx.ext such that it can be hoisted out of the loop by
LICM. The operands that have dependences carried by the loop will be sinked down
in the GEP chain. This patch will produce the following output:

do.body:
  %cur_match.addr.0 = phi i32 [ %cur_match, %entry ], [ %2, %do.cond ]
  %idx.ext = zext i32 %cur_match.addr.0 to i64
  %add.ptr = getelementptr inbounds i8, i8* %win, i64 %idx.ext1
  %add.ptr2 = getelementptr inbounds i8, i8* %add.ptr, i64 -1
  %add.ptr3 = getelementptr inbounds i8, i8* %add.ptr2, i64 %idx.ext

llvm-svn: 328539

d870aea0

[InstCombine] distribute fmul over fadd/fsub · 4fd4fd61

Sanjay Patel authored Mar 26, 2018

This replaces a large chunk of code that was looking for compound
patterns that include these sub-patterns. Existing tests ensure that
all of the previous examples are still folded as expected.

We still need to loosen the FMF check.

llvm-svn: 328502

4fd4fd61

[InstCombine] check uses before creating instructions for fmul distribution · 2455fef4
Sanjay Patel authored Mar 26, 2018
```
As the tests show, we could create extra instructions without any obvious benefit.

llvm-svn: 328498
```
2455fef4

[LSR] Allow giving priority to post-incrementing addressing modes · 0b377e0a

Krzysztof Parzyszek authored Mar 26, 2018

Implement TTI interface for targets to indicate that the LSR should give
priority to post-incrementing addressing modes.

Combination of patches by Sebastian Pop and Brendon Cahoon.

Differential Revision: https://reviews.llvm.org/D44758

llvm-svn: 328490

0b377e0a

[LoopUnroll] Fix dangling pointers in SCEV · a5574931

Max Kazantsev authored Mar 26, 2018

Current logic of loop SCEV invalidation in Loop Unroller implicitly relies on
fact that exit count of outer loops cannot rely on exiting blocks of
inner loops, which is true in current implementation of backedge taken count
calculation but is wrong in general. As result, when we only forget the loop that
we have just unrolled, we may still have cached data for its outer loops (in particular,
exit counts) which keeps references on blocks of inner loop that could have been
changed or even deleted.

The attached test demonstrates a situaton when after unrolling of innermost loop
the outermost loop contains a dangling pointer on non-existant block. The problem
shows up when we apply patch https://reviews.llvm.org/D44677 that makes SCEV
smarter about exit count calculation. I am not sure if the bug exists without this patch,
it appears that now it is accidentally correct just because in practice exact backedge
taken count for outer loops with complex control flow inside is never calculated.
But when SCEV learns to do so, this problem shows up.

This patch replaces existing logic of SCEV loop invalidation with a correct one, which
happens to be invalidation of outermost loop (which also leads to invalidation of all
loops inside of it). It is the only way to ensure that no outer loop keeps dangling pointers
on removed blocks, or just outdated information that has changed after unrolling.

Differential Revision: https://reviews.llvm.org/D44818
Reviewed By: samparker

llvm-svn: 328483

a5574931

[DeadArgElim] Strip allocsize attributes when deleting an argument. · 8840f644

Benjamin Kramer authored Mar 26, 2018

Since allocsize refers to the argument number it gets invalidated when
an argument is removed and the numbers shift.

llvm-svn: 328481

8840f644

[IRCE] Enable increasing loops of variable bounds · 53a423a4

Sam Parker authored Mar 26, 2018

    
CanBeMin is currently used which will report true for any unknown
values, but often a check is performed outside the loop which covers
this situation:
    
for (int i = 0; i < N; ++i)
  ...
    
if (N > 0)
  for (int i = 0; i < N; ++i)
    ...
    
So I've add 'LoopGuardedAgainstMin' which reports whether N is
greater than the minimum value which then allows loop with a variable
loop count to be optimised. I've also moved the increasing bound
checking into its own function and replaced SumCanReachMax is another
isLoopEntryGuardedByCond function.

llvm-svn: 328480

53a423a4

Mar 25, 2018

[PatternMatch] allow undef elements when matching vector FP +0.0 · 93e64dd9

Sanjay Patel authored Mar 25, 2018

This continues the FP constant pattern matching improvements from:
https://reviews.llvm.org/rL327627
https://reviews.llvm.org/rL327339
https://reviews.llvm.org/rL327307

Several integer constant matchers also have this ability. I'm
separating matching of integer/pointer null from FP positive zero
and renaming/commenting to make the functionality clearer.

llvm-svn: 328461

93e64dd9

[InstCombine] peek through more icmp of FP cast + bitcast · 841aac04
Sanjay Patel authored Mar 25, 2018
```
This is an extension of rL328426 as noted in D44367. 

llvm-svn: 328448
```
841aac04

Mar 24, 2018

[InstCombine] peek through FP casts for sign-bit compares (PR36682) · 745a9c62

Sanjay Patel authored Mar 24, 2018

This pattern came up in PR36682:
https://bugs.llvm.org/show_bug.cgi?id=36682
https://godbolt.org/g/LhuD9A

Equality checks are planned as a follow-up enhancement.

Differential Revision: https://reviews.llvm.org/D44367

llvm-svn: 328426

745a9c62

[InstCombine] fix formatting; NFC · 286074e8
Sanjay Patel authored Mar 24, 2018
```
llvm-svn: 328425
```
286074e8

Remove unused header from EntryExitInstrumenter · 53f51c1d

David Blaikie authored Mar 24, 2018

Fixes layering, since Transforms/Utils doesn't depend on CodeGen, so
shouldn't include headers from it.

llvm-svn: 328399

53f51c1d

[GuardWidening] Group code by class [NFC] · 6a1f3446
Philip Reames authored Mar 23, 2018
```
llvm-svn: 328387
```
6a1f3446

Mar 23, 2018

Fix Layering, move instrumentation transform headers into Instrumentation subdirectory · 4fe1fe14
David Blaikie authored Mar 23, 2018
```
llvm-svn: 328379
```
4fe1fe14

[PM][FunctionAttrs] add NoUnwind attribute inference to PostOrderFunctionAttrs pass · 6660fd0f

Fedor Sergeev authored Mar 23, 2018

Summary:
This was motivated by absence of PrunEH functionality in new PM.
It was decided that a proper way to do PruneEH is to add NoUnwind inference
into PostOrderFunctionAttrs and then perform normal SimplifyCFG on top.

This change generalizes attribute handling implemented for (a removal of)
Convergent attribute, by introducing a generic builder-like class
   AttributeInferer

It registers all the attribute inference requests, storing per-attribute
predicates into a vector, and then goes through an SCC Node, scanning all
the instructions for not breaking attribute assumptions.

The main idea is that as soon all the instructions from all the functions
of SCC Node conform to attribute assumptions then we are free to infer
the attribute as set for all the functions of SCC Node.

It handles two distinct cases of attributes:
   - those that might break due to derefinement of the function code

     for these attributes we are allowed to apply inference only if all the
     functions are "exact definitions". Example - NoUnwind.

   - those that do not care about derefinement

     for these attributes we are allowed to apply inference as soon as we see
     any function definition. Example - removal of Convergent attribute.

Also in this commit:
* Converted all the FunctionAttrs tests to use FileCheck and added new-PM
  invocations to them

* FunctionAttrs/convergent.ll test demonstrates a difference in behavior between
   new and old PM implementations. Marked with FIXME.

* PruneEH tests were converted to new-PM as well, using function-attrs+simplify-cfg
  combo as intended

* some of "other" tests were updated since function-attrs now infers 'nounwind'
  even for old PM pipeline

* -disable-nounwind-inference hidden option added as a possible workaround for a supposedly
  rare case when nounwind being inferred by default presents a problem

Reviewers: chandlerc, jlebar

Reviewed By: jlebar

Subscribers: eraman, llvm-commits

Differential Revision: https://reviews.llvm.org/D44415

llvm-svn: 328377

6660fd0f

[InstCombine] simplify code for FP intrinsic shrinking; NFCI · 32381d7c
Sanjay Patel authored Mar 23, 2018
```
llvm-svn: 328372
```
32381d7c

[HWASan] Port HWASan to Linux x86-64 (LLVM) · 83e78414

Alex Shlyapnikov authored Mar 23, 2018

Summary:
Porting HWASan to Linux x86-64, first of the three patches, LLVM part.

The approach is similar to ARM case, trap signal is used to communicate
memory tag check failure. int3 instruction is used to generate a signal,
access parameters are stored in nop [eax + offset] instruction immediately
following the int3 one.

One notable difference is that x86-64 has to untag the pointer before use
due to the lack of feature comparable to ARM's TBI (Top Byte Ignore).

Reviewers: eugenis

Subscribers: kristof.beyls, llvm-commits

Differential Revision: https://reviews.llvm.org/D44699

llvm-svn: 328342

83e78414

Fix a block copying problem in LICM · a237866f
Andrew Kaylor authored Mar 23, 2018
```
Differential Revision: https://reviews.llvm.org/D44817

llvm-svn: 328336
```
a237866f
[InstCombine] reduce code duplication; NFC · 713ca3d3
Sanjay Patel authored Mar 23, 2018
```
llvm-svn: 328323
```
713ca3d3
[InstCombine] improve variable name; NFC · 6de89ce3
Sanjay Patel authored Mar 23, 2018
```
llvm-svn: 328322
```
6de89ce3

[SLP] Stop counting cost of gather sequences with multiple uses · 6c289a1c

Matthew Simpson authored Mar 23, 2018

When building the SLP tree, we look for reuse among the vectorized tree
entries. However, each gather sequence is represented by a unique tree entry,
even though the sequence may be identical to another one. This means, for
example, that a gather sequence with two uses will be counted twice when
computing the cost of the tree. We should only count the cost of the definition
of a gather sequence rather than its uses. During code generation, the
redundant gather sequences are emitted, but we optimize them away with CSE. So
it looks like this problem just affects the cost model.

Differential Revision: https://reviews.llvm.org/D44742

llvm-svn: 328316

6c289a1c

Revert r328307: [IPSCCP] Use constant range information for comparisons of parameters. · f73c3ece
Florian Hahn authored Mar 23, 2018
```
Reverted for now, due to it causing verifier failures.

llvm-svn: 328312
```
f73c3ece

[IPSCCP] Use constant range information for comparisons of parameters. · b1feec08

Florian Hahn authored Mar 23, 2018

For comparisons with parameters, we can use the ParamState lattice
elements which also provide constant range information. This improves
the code for PR33253 further and gets us closer to use
ValueLatticeElement for all values.

Also, as we are using the range information in the solver directly, we
do not need tryToReplaceWithConstantRange afterwards anymore.

Reviewers: dberlin, mssimpso, davide, efriedma

Reviewed By: mssimpso

Differential Revision: https://reviews.llvm.org/D43762

llvm-svn: 328307

b1feec08

[LoopUnroll] Simplify induction variables after peeling too. · 52436a58

Florian Hahn authored Mar 23, 2018

Loop peeling also has an impact on the induction variables, so we should
benefit from induction variable simplification after peeling too.

Reviewers: sanjoy, bogner, mzolotukhin, efriedma

Reviewed By: efriedma

Differential Revision: https://reviews.llvm.org/D43878

llvm-svn: 328301

52436a58

Mar 22, 2018

Move SampleProfile.h into IPO along with the rest of the IPO pass headers · 301627f8
David Blaikie authored Mar 22, 2018
```
llvm-svn: 328262
```
301627f8
Finish moving the IPSCCP pass from Scalar to IPO - moving the registration · 376294c2
David Blaikie authored Mar 22, 2018
```
llvm-svn: 328259
```
376294c2

Fix layering between SCCP and IPO SCCP · 3bbf5af0

David Blaikie authored Mar 22, 2018

Transforms/Scalar/SCCP.cpp implemented both the Scalar and IPO SCCP, but
this meant Transforms/Scalar including Transfroms/IPO headers, creating
a circular dependency. (IPO depends on Scalar already) - so move the IPO
SCCP shims out into IPO and the basic library implementation accessible
from Scalar/SCCP.h to be used from the IPO/SCCP.cpp implementation.

llvm-svn: 328250

3bbf5af0

Move the initialization of the Meta Renamer pass over to IPO along with the... · 2965a01e

David Blaikie authored Mar 22, 2018

Move the initialization of the Meta Renamer pass over to IPO along with the rest of it that was moved in r328209

llvm-svn: 328234

2965a01e

[InstCombineCalls] Update deprecated API usage (NFC) · 710d7b99

Daniel Neilson authored Mar 22, 2018

Summary:
Just updating a call to MemSetInst::getAlignment() to MemSetInst::getDestAlignment(). The
former has been deprecated.

llvm-svn: 328227

710d7b99

[SimplifyCFG] Create attribute for fuzzing-specific optimizations. · 236cdaf8

Matt Morehouse authored Mar 22, 2018

Summary:
When building with libFuzzer, converting control flow to selects or
obscuring the original operands of CMPs reduces the effectiveness of
libFuzzer's heuristics.

This patch provides an attribute to disable or modify certain optimizations
for optimal fuzzing signal.

Provides a less aggressive alternative to https://reviews.llvm.org/D44057.

Reviewers: vitalybuka, davide, arsenm, hfinkel

Reviewed By: vitalybuka

Subscribers: junbuml, mehdi_amini, wdng, javed.absar, hiraditya, llvm-commits, kcc

Differential Revision: https://reviews.llvm.org/D44232

llvm-svn: 328214

236cdaf8

[LoopPredication] Add profitability check based on BPI · 9b1176b0

Anna Thomas authored Mar 22, 2018

Summary:
LoopPredication is not profitable when the loop is known to always exit
through some block other than the latch block.
A coarse grained latch check can cause loop predication to predicate the
loop, and unconditionally deoptimize.

However, without predicating the loop, the guard may never fail within the
loop during the dynamic execution because the non-latch loop termination
condition exits the loop before the latch condition causes the loop to
exit.
We teach LP about this using BranchProfileInfo pass.

Reviewers: apilipenko, skatkov, mkazantsev, reames

Reviewed by: skatkov

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D44667

llvm-svn: 328210

9b1176b0