Commits · 5d909be91b32cd7627f328844fbee39f80033292 · Lorenzo Albano / LLVM bpEVL

Jan 09, 2018
- [InstCombine] Check for out of range ashr values using APInt before calling getZExtValue · 5d909be9
  Simon Pilgrim authored Jan 09, 2018
```
Reduced from oss-fuzz #5032 test case

llvm-svn: 322078
```
  5d909be9
Jan 08, 2018

AlwaysInliner: Alow setting InsertLifetime in the new-style pass · 6f6846fc
Justin Bogner authored Jan 08, 2018
```
llvm-svn: 322033
```
6f6846fc
ArgPromotion: Allow setting MaxElements in the new-style pass · 92fe563b
Justin Bogner authored Jan 08, 2018
```
llvm-svn: 322025
```
92fe563b

[CVP] Replace incoming values from unreachable blocks with undef. · 9a60d2c1

Davide Italiano authored Jan 08, 2018

This is an attempt of fixing PR35807.
Due to the non-standard definition of dominance in LLVM, where uses in
unreachable blocks are dominated by anything, you can have, in an
unreachable block:

  %patatino = OP1 %patatino, CONSTANT

When `SimplifyInstruction` receives a PHI where an incoming value is of
the aforementioned form, in some cases, loops indefinitely.

What I propose here instead is keeping track of the incoming values
from unreachable blocks, and replacing them with undef. It fixes this
case, and it seems to be good regardless (even if we can't prove that
the value is constant, as it's coming from an unreachable block, we
can ignore it).

Differential Revision:  https://reviews.llvm.org/D41812

llvm-svn: 322006

9a60d2c1

[InstCombine] fold min/max tree with common operand (PR35717) · 31b4b76f

Sanjay Patel authored Jan 08, 2018

There is precedence for factorization transforms in instcombine for FP ops with fast-math. 
We also have similar logic in foldSPFofSPF().

It would take more work to add this to reassociate because that's specialized for binops, 
and min/max are not binops (or even single instructions). Also, I don't have evidence that 
larger min/max trees than this exist in real code, but if we find that's true, we might
want to reorganize where/how we do this optimization.

In the motivating example from https://bugs.llvm.org/show_bug.cgi?id=35717 , we have:

int test(int xc, int xm, int xy) {
  int xk;
  if (xc < xm)
    xk = xc < xy ? xc : xy;
  else
    xk = xm < xy ? xm : xy;
  return xk;
}

This patch solves that problem because we recognize more min/max patterns after rL321672

https://rise4fun.com/Alive/Qjne
https://rise4fun.com/Alive/3yg

Differential Revision: https://reviews.llvm.org/D41603

llvm-svn: 321998

31b4b76f

[SLP] Fix PR35777: Incorrect handling of aggregate values. · 5b9a77d4

Alexey Bataev authored Jan 08, 2018

Summary:
Fixes the bug with incorrect handling of InsertValue|InsertElement
instrucions in SLP vectorizer. Currently, we may use incorrect
ExtractElement instructions as the operands of the original
InsertValue|InsertElement instructions.

Reviewers: mkuper, hfinkel, RKSimon, spatel

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D41767

llvm-svn: 321994

5b9a77d4

[SLP] Fix PR35628: Count external uses on extra reduction arguments. · 118a0a2c

Alexey Bataev authored Jan 08, 2018

Summary:
If the vectorized value is marked as extra reduction argument, its users
are not considered as external users. Patch fixes this.

Reviewers: mkuper, hfinkel, RKSimon, spatel

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D41786

llvm-svn: 321993

118a0a2c

Jan 07, 2018

Revert "[SCCP] Manually fold branches on undef." · e15bffe9

Davide Italiano authored Jan 07, 2018

I thought this was responsible for PR35723, but I was
wrong, the issue lies elsewhere. Revert while I debug.

llvm-svn: 321975

e15bffe9

[SLPVectorizer] Reintroduce std::stable_sort(properlyDominates()). · 4c39758a

Davide Italiano authored Jan 07, 2018

The approach was never discussed, I wasn't able to reproduce this
non-determinism, and the original author went AWOL.
After a discussion on the ML, Philip suggested to revert this.

llvm-svn: 321974

4c39758a

[LV][VPlan] NFC patch to move LoopVectorizationPlanner class out of LoopVectorize.cpp · 0f1314c5

Hal Finkel authored Jan 07, 2018

Another small step forward to move VPlan stuff outside of LoopVectorize.cpp.

VPlanBuilder.h is renamed to LoopVectorizationPlanner.h
LoopVectorizationPlanner class is moved from LoopVectorize.cpp to
LoopVectorizationPlanner.h LoopVectorizationCostModel::VectorizationFactor
class is moved to LoopVectorizationPlanner.h (used by the planner class) ---
this needs further streamlining work in later patches and thus all I did was
take it out of the CostModel class and moved to the header file. The callback
function had to stay inside LoopVectorize.cpp since it calls an
InnerLoopVectorizer member function declared in it. Next Steps: Make
InnerLoopVectorizer, LoopVectorizationCostModel, and other classes more modular
and more aligned with VPlan direction, in small increments.

Previous step was: r320900 (https://reviews.llvm.org/D41045)

Patch by Hideki Saito, thanks!

Differential Revision: https://reviews.llvm.org/D41420

llvm-svn: 321962

0f1314c5

[CodeExtractor] Use subset of function attributes for extracted function. · 55be37e7

Florian Hahn authored Jan 07, 2018

In addition to target-dependent attributes, we can also preserve a
white-listed subset of target independent function attributes. The white-list
excludes problematic attributes, most prominently:

* attributes related to memory accesses, as alloca instructions
  could be moved in/out of the extracted block

* control-flow dependent attributes, like no_return or thunk, as the
  relerelevant instructions might or might not get extracted.

Thanks @efriedma and @aemerson for providing a set of attributes that cannot be
propagated.


Reviewers: efriedma, davidxl, davide, silvas

Reviewed By: efriedma

Differential Revision: https://reviews.llvm.org/D41334

llvm-svn: 321961

55be37e7

Jan 06, 2018

[InlineFunction] Preserve calling convention when forwarding VarArgs. · a82eef23

Florian Hahn authored Jan 06, 2018

Reviewers: efriedma, rnk, davide

Reviewed By: rnk, davide

Differential Revision: https://reviews.llvm.org/D41556

llvm-svn: 321943

a82eef23

[InlineFunction] Preserve attributes when forwarding VarArgs. · de10e6e0

Florian Hahn authored Jan 06, 2018

Reviewers: rnk, efriedma

Reviewed By: rnk

Differential Revision: https://reviews.llvm.org/D41555

llvm-svn: 321942

de10e6e0

[InlineFunction] Inline vararg functions that do not access varargs. · 80788d80

Florian Hahn authored Jan 06, 2018

If the varargs are not accessed by a function, we can inline the
function.

Reviewers: dblaikie, chandlerc, davide, efriedma, rnk, hfinkel

Reviewed By: efriedma

Differential Revision: https://reviews.llvm.org/D41335

llvm-svn: 321940

80788d80

[InstCombine] relax use constraint for min/max (~a, ~b) --> ~min/max(a, b) · 26a6fcde

Sanjay Patel authored Jan 06, 2018

In the minimal case, this won't remove instructions, but it still improves
uses of existing values.

In the motivating example from PR35834, it does remove instructions, and
sets that case up to be optimized by something like D41603:
https://reviews.llvm.org/D41603

llvm-svn: 321936

26a6fcde

[Utils] Simplify salvageDebugInfo, NFCI · b2ec02ba

Vedant Kumar authored Jan 05, 2018

Having a single call to findDbgUsers() allows salvageDebugInfo() to
return earlier.

Differential Revision: https://reviews.llvm.org/D41787

llvm-svn: 321915

b2ec02ba

Jan 05, 2018

[InstCombine] add folds for min(~a, b) --> ~max(a, b) · 5b6aacf2

Sanjay Patel authored Jan 05, 2018

Besides the bug of omitting the inverse transform of max(~a, ~b) --> ~min(a, b),
the use checking and operand creation were off. We were potentially creating 
repeated identical instructions of existing values. This led to infinite
looping after I added the extra folds.

By using the simpler m_Not matcher and not creating new 'not' ops for a and b,
we avoid that problem. It's possible that not using IsFreeToInvert() here is
more limiting than the simpler matcher, but there are no tests for anything
more exotic. It's also possible that we should relax the use checking further
to handle a case like PR35834:
https://bugs.llvm.org/show_bug.cgi?id=35834
...but we can make that a follow-up if it is needed. 

llvm-svn: 321882

5b6aacf2

WholeProgramDevirt: Simplify ORE getter mechanism for old PM. NFCI. · 9110cb45
Peter Collingbourne authored Jan 05, 2018
```
llvm-svn: 321841
```
9110cb45
Revert "[JumpThreading] Preservation of DT and LVI across the pass" · cd78ddc1
Reid Kleckner authored Jan 04, 2018
```
This reverts r321825, it causes crashes in Chromium. Reproducer
forthcoming.

llvm-svn: 321832
```
cd78ddc1

Jan 04, 2018

[JumpThreading] Preservation of DT and LVI across the pass · cdad6c0b

Brian M. Rzycki authored Jan 04, 2018

Summary:
See D37528 for a previous (non-deferred) version of this
patch and its description.

Preserves dominance in a deferred manner using a new class
DeferredDominance. This reduces the performance impact of
updating the DominatorTree at every edge insertion and
deletion. A user may call DDT->flush() within JumpThreading
for an up-to-date DT. This patch currently has one flush()
at the end of runImpl() to ensure DT is preserved across
the pass.

LVI is also preserved to help subsequent passes such as
CorrelatedValuePropagation. LVI is simpler to maintain and
is done immediately (not deferred). The code to perfom the
preversation was minimally altered and was simply marked
as preserved for the PassManager to be informed.

This extends the analysis available to JumpThreading for
future enhancements. One example is loop boundary threading.

Reviewers: dberlin, kuhar, sebpop

Reviewed By: kuhar, sebpop

Subscribers: hiraditya, llvm-commits

Differential Revision: https://reviews.llvm.org/D40146

llvm-svn: 321825

cdad6c0b

Add assertion on DT availability during LI update in UpdateAnalysisInformation · 9fca5837

Anna Thomas authored Jan 04, 2018

This came up during discussions in llvm-commits for
rL321653: Check for unreachable preds before updating LI in
UpdateAnalysisInformation

The assert provides hints to passes to require both DT and LI if we plan on
updating LI through this function.

Tests run: make check

llvm-svn: 321805

9fca5837

[InstCombine] safely create a constant of the right type (PR35794) · c63f9014
Sanjay Patel authored Jan 04, 2018
```
llvm-svn: 321801
```
c63f9014

[GVNHoist] Fix: PR35222 gvn-hoist incorrectly erases load in case of a loop · 1f90cae8

Aditya Kumar authored Jan 04, 2018

Reviewers:
    dberlin
    sebpop
    eli.friedman

Differential Revision: https://reviews.llvm.org/D41453

llvm-svn: 321789

1f90cae8

Jan 03, 2018

StructurizeCFG: Fix broken backedge detection · 8070882b

Matt Arsenault authored Jan 03, 2018

The work order was changed in r228186 from SCC order
to RPO with an arbitrary sorting function. The sorting
function attempted to move inner loop nodes earlier. This
was was apparently relying on an assumption that every block
in a given loop / the same loop depth would be seen before
visiting another loop. In the broken testcase, a block
outside of the loop was encountered before moving onto
another block in the same loop. The testcase would then
structurize such that one blocks unconditional successor
could never be reached.

Revert to plain RPO for the analysis phase. This fixes
detecting edges as backedges that aren't really.

The processing phase does use another visited set, and
I'm unclear on whether the order there is as important.
An arbitrary order doesn't work, and triggers some infinite
loops. The reversed RPO list seems to work and is closer
to the order that was used before, minus the arbitary
custom sorting.

A few of the changed tests now produce smaller code,
and a few are slightly worse looking.

llvm-svn: 321751

8070882b

[InstCombine] Check for out of range shift values using APInt before calling getZExtValue · 3bf2d645
Simon Pilgrim authored Jan 03, 2018
```
Reduced from oss-fuzz #4871 test case

llvm-svn: 321748
```
3bf2d645

Jan 02, 2018

[BasicBlockUtils] Check for unreachable preds before updating LI in UpdateAnalysisInformation · bdb94309

Anna Thomas authored Jan 02, 2018

Summary:
We are incorrectly updating the LI when loop-simplify generates
dedicated exit blocks for a loop. The issue is that there's an implicit
assumption that the Preds passed into UpdateAnalysisInformation are
reachable. However, this is not true and breaks LI by incorrectly
updating the header of a loop.

One such case is when we generate dedicated exits when the exit block is
a landing pad (through SplitLandingPadPredecessors). There maybe other
cases as well, since we do not guarantee that Preds passed in are
reachable basic blocks.

The added test case shows how loop-simplify breaks LI for the outer loop (and DT in turn)
after we try to generate the LoopSimplifyForm.

Reviewers: davide, chandlerc, sanjoy

Reviewed By: davide

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D41519

llvm-svn: 321653

bdb94309

[InstCombine] Missed optimization in math expression: squashing sqrt functions · a58d8deb

Dmitry Venikov authored Jan 02, 2018

Summary: This patch enables folding under -ffast-math flag sqrt(a) * sqrt(b) -> sqrt(a*b)

Reviewers: hfinkel, spatel, davide

Reviewed By: spatel, davide

Subscribers: davide, llvm-commits

Differential Revision: https://reviews.llvm.org/D41322

llvm-svn: 321637

a58d8deb

Dec 31, 2017
- [SimplifyCFG] Return to the pass manager the correct value. · 86b7949f
  Davide Italiano authored Dec 31, 2017
```
I wanted to commit this with r321603, but I failed to squash
the two commits.

llvm-svn: 321606
```
  86b7949f
- [Utils/Local] Use `auto` when the type is obvious. NFCI. · 0512bf5a
  Davide Italiano authored Dec 31, 2017
```
llvm-svn: 321605
```
  0512bf5a
- [Utils] Remove commented debug message. NFCI. · 5dd1c587
  Davide Italiano authored Dec 31, 2017
```
llvm-svn: 321604
```
  5dd1c587
- [SimplifyCFG] Stop hoisting musttail calls incorrectly. · 9f074fe9
  Davide Italiano authored Dec 31, 2017
```
PR35774.

llvm-svn: 321603
```
  9f074fe9
Dec 30, 2017
- Use phi ranges to simplify code. No functionality change intended. · c7fc81e6
  Benjamin Kramer authored Dec 30, 2017
```
llvm-svn: 321585
```
  c7fc81e6
Dec 29, 2017
- StructurizeCFG: Use phi iterator range · 8dcfa137
  Matt Arsenault authored Dec 29, 2017
```
llvm-svn: 321568
```
  8dcfa137
Dec 28, 2017

Remove superfluous copies in sample profiling. · 24cb28bb
Benjamin Kramer authored Dec 28, 2017
```
No functionliaty change intended.

llvm-svn: 321530
```
24cb28bb
Revert r321377, it causes regression to https://reviews.llvm.org/P8055. · 29697c13
Guozhi Wei authored Dec 28, 2017
```
llvm-svn: 321528
```
29697c13
Avoid int to string conversion in Twine or raw_ostream contexts. · 3a13ed60
Benjamin Kramer authored Dec 28, 2017
```
Some output changes from uppercase hex to lowercase hex, no other functionality change intended.

llvm-svn: 321526
```
3a13ed60

[RewriteStatepoints] Fix incorrect assertion · a13e163a

Max Kazantsev authored Dec 28, 2017

`RewriteStatepointsForGC` iterates over function blocks and their predecessors
in order of declaration. One of outcomes of this is that callsites are placed in
arbitrary order which has nothing to do with travelsar order.

On the other hand, function `recomputeLiveInValues` asserts that bases are
added to `Info.PointerToBase` before their deried pointers are updated. But
if call sites are processed in order different from RPOT, this is not necessarily
true. We cannot guarantee that the base was placed there before every
pointer derived from it. All we can guarantee is that this base was marked as
known base by this point.

This patch replaces the fact that we assert from checking that the base was
added to the map with assert that the base was marked as known base.

Differential Revision: https://reviews.llvm.org/D41593

llvm-svn: 321517

a13e163a

[InstCombine] Check for isa<Instruction> before using cast<> · 472689a1
Simon Pilgrim authored Dec 28, 2017
```
Protects against casts from constexpr etc.

Reduced from oss-fuzz #4788 test case

llvm-svn: 321515
```
472689a1

Revert "[memcpyopt] Teach memcpyopt to optimize across basic blocks" · 6d31001c

Reid Kleckner authored Dec 28, 2017

This reverts r321138. It seems there are still underlying issues with
memdep. PR35519 seems to still be present if debug info is enabled. We
end up losing a memcpy. Somehow during store to memset merging, we
insert the memset after the memcpy or fail to update the memdep analysis
to account for the newly inserted memset of a pair.

Reduced test case:

  #include <assert.h>
  #include <stdio.h>
  #include <string>
  #include <utility>
  #include <vector>

  void do_push_back(
      std::vector<std::pair<std::string, std::vector<std::string>>>* crls) {
    crls->push_back(std::make_pair(std::string(), std::vector<std::string>()));
  }

  int __attribute__((optnone)) main() {
    // Put some data in the vector and then remove it so we take the push_back
    // fast path.
    std::vector<std::pair<std::string, std::vector<std::string>>> crl_set;
    crl_set.push_back({"asdf", {}});
    crl_set.pop_back();
    printf("first word in vector storage: %p\n", *(void**)crl_set.data());

    // Do the push_back which may fail to initialize the data.
    do_push_back(&crl_set);
    auto* first = &crl_set.back().first;
    printf("first word in vector storage (should be zero): %p\n",
           *(void**)crl_set.data());
    assert(first->empty());
    puts("ok");
  }

Compile with libc++, enable optimizations, and enable debug info:
$ clang++ -stdlib=libc++ -g -O2 t.cpp -o t.exe -Wl,-rpath=llvm/build/lib

This program will assert with this change.

llvm-svn: 321510

6d31001c

Dec 27, 2017

[InstCombine] Gracefully handle out of range extractelement indices · e7d032f1

Simon Pilgrim authored Dec 27, 2017

InstSimplify is responsible for handling these, but we shouldn't just assert here.

Reduced from oss-fuzz #4808 test case

llvm-svn: 321489

e7d032f1