Commits · c7fc81e6595865296738fe0f8ffe692ea41b1ffc · Lorenzo Albano / LLVM bpEVL

Dec 30, 2017
- Use phi ranges to simplify code. No functionality change intended. · c7fc81e6
  Benjamin Kramer authored Dec 30, 2017
```
llvm-svn: 321585
```
  c7fc81e6
Dec 28, 2017
- Revert r321377, it causes regression to https://reviews.llvm.org/P8055. · 29697c13
  Guozhi Wei authored Dec 28, 2017
```
llvm-svn: 321528
```
  29697c13
Dec 27, 2017

[Unroll][DebugInfo] Propagate loop body's debug location to epilog preheader · 8af1e1cb

Zhaoshi Zheng authored Dec 26, 2017

NewExit and epilog PreHeader should has the same debug loc as the original loop
body, instead of original loop exit.

llvm-svn: 321465

8af1e1cb

Dec 24, 2017
- Make helpers static. No functionality change. · 802e6255
  Benjamin Kramer authored Dec 24, 2017
```
llvm-svn: 321425
```
  802e6255
Dec 22, 2017

[SimplifyCFG] Don't do if-conversion if there is a long dependence chain · 33250340

Guozhi Wei authored Dec 22, 2017

If after if-conversion, most of the instructions in this new BB construct a long and slow dependence chain, it may be slower than cmp/branch, even if the branch has a high miss rate, because the control dependence is transformed into data dependence, and control dependence can be speculated, and thus, the second part can execute in parallel with the first part on modern OOO processor.

This patch checks for the long dependence chain, and give up if-conversion if find one.

Differential Revision: https://reviews.llvm.org/D39352

llvm-svn: 321377

33250340

Add hasProfileData() to check if a function has profile data. NFC. · a17f2205

Easwaran Raman authored Dec 22, 2017

Summary:
This replaces calls to getEntryCount().hasValue() with hasProfileData
that does the same thing. This refactoring is useful to do before adding
synthetic function entry counts but also a useful cleanup IMO even
otherwise. I have used hasProfileData instead of hasRealProfileData as
David had earlier suggested since I think profile implies "real" and I
use the phrase "synthetic entry count" and not "synthetic profile count"
but I am fine calling it hasRealProfileData if you prefer.

Reviewers: davidxl, silvas

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D41461

llvm-svn: 321331

a17f2205

Dec 21, 2017

[SimplifyCFG] Avoid quadratic on a predecessors number behavior in instruction sinking. · ad371e0c

Michael Zolotukhin authored Dec 21, 2017

If a block has N predecessors, then the current algorithm will try to
sink common code to this block N times (whenever we visit a
predecessor). Every attempt to sink the common code includes going
through all predecessors, so the complexity of the algorithm becomes
O(N^2).
With this patch we try to sink common code only when we visit the block
itself. With this, the complexity goes down to O(N).
As a side effect, the moment the code is sunk is slightly different than
before (the order of simplifications has been changed), that's why I had
to adjust two tests (note that neither of the tests is supposed to test
SimplifyCFG):
* test/CodeGen/AArch64/arm64-jumptable.ll - changes in this test mimic
the changes that previous implementation of SimplifyCFG would do.
* test/CodeGen/ARM/avoid-cpsr-rmw.ll - in this test I disabled common
code sinking by a command line flag.

llvm-svn: 321236

ad371e0c

Dec 20, 2017

[ICP] Expose unconditional call promotion interface · cb35c5d5

Matthew Simpson authored Dec 20, 2017

This patch modifies the indirect call promotion utilities by exposing and using
an unconditional call promotion interface. The unconditional promotion
interface (i.e., call promotion without creating an if-then-else) can be used
if it's known that an indirect call has only one possible callee. The existing
conditional promotion interface uses this unconditional interface to promote an
indirect call after it has been versioned and placed within the "then" block.

A consequence of unconditional promotion is that the fix-up operations for phi
nodes in the normal destination of invoke instructions are changed. This is
necessary because the existing implementation assumed that an invoke had been
versioned, creating a "merge" block where a return value bitcast could be
placed. In the new implementation, the edge between a promoted invoke's parent
block and its normal destination is split if needed to add a bitcast for the
return value. If the invoke is also versioned, the phi node merging the return
value of the promoted and original invoke instructions is placed in the "merge"
block.

Differential Revision: https://reviews.llvm.org/D40751

llvm-svn: 321210

cb35c5d5

Dec 18, 2017
- Fix more inconsistent line endings. NFC. · e4f5d010
  Dimitry Andric authored Dec 18, 2017
```
llvm-svn: 321016
```
  e4f5d010
- [Memcpy Loop Lowering] Remove the fixed int8 lowering. · 5fb624a3
  Sean Fertile authored Dec 18, 2017
```
Switch over to the lowering that uses target supplied operand types.

Differential Revision: https://reviews.llvm.org/D41201

llvm-svn: 320989
```
  5fb624a3
Dec 16, 2017

[Memcpy Loop Lowering] Only calculate residual size/bytes copied when needed. · 68d7f9da

Sean Fertile authored Dec 16, 2017

If the loop operand type is int8 then there will be no residual loop for the
unknown size expansion. Dont create the residual-size and bytes-copied values
when they are not needed.

llvm-svn: 320929

68d7f9da

[SimplifyLibCalls] Inline calls to cabs when it's safe to do so · 2ff24731

Hal Finkel authored Dec 16, 2017

When unsafe algerbra is allowed calls to cabs(r) can be replaced by:

  sqrt(creal(r)*creal(r) + cimag(r)*cimag(r))

Patch by Paul Walker, thanks!

Differential Revision: https://reviews.llvm.org/D40069

llvm-svn: 320901

2ff24731

Dec 15, 2017

[Memcpy Loop Lowering] Insert loop BB inbetween the split BB. · 42b13343

Sean Fertile authored Dec 15, 2017

The original memcpy expansion inserted the loop basic block inbetween
the 2 new basic blocks created by splitting the original block the memcpy
call was in. This commit makes the new memcpy expansion do the same to keep the
layout of the IR matching between the old and new implementations.

Differential Review: https://reviews.llvm.org/D41197

llvm-svn: 320848

42b13343

fix typo in comment and remove inaccurate comment; NFC · c722e265
Sanjay Patel authored Dec 15, 2017
```
llvm-svn: 320838
```
c722e265

Dec 14, 2017

[SimplifyCFG] don't sink common insts too soon (PR34603) · 0ab0c1a2

Sanjay Patel authored Dec 14, 2017

This should solve:
https://bugs.llvm.org/show_bug.cgi?id=34603
...by preventing SimplifyCFG from altering redundant instructions before early-cse has a chance to run.
It changes the default (canonical-forming) behavior of SimplifyCFG, so we're only doing the
sinking transform later in the optimization pipeline.

Differential Revision: https://reviews.llvm.org/D38566

llvm-svn: 320749

0ab0c1a2

[LV] Support efficient vectorization of an induction with redundant casts · 4750c785

Dorit Nuzman authored Dec 14, 2017

D30041 extended SCEVPredicateRewriter to improve handling of Phi nodes whose
update chain involves casts; PSCEV can now build an AddRecurrence for some
forms of such phi nodes, under the proper runtime overflow test. This means
that we can identify such phi nodes as an induction, and the loop-vectorizer
can now vectorize such inductions, however inefficiently. The vectorizer
doesn't know that it can ignore the casts, and so it vectorizes them.

This patch records the casts in the InductionDescriptor, so that they could
be marked to be ignored for cost calculation (we use VecValuesToIgnore for
that) and ignored for vectorization/widening/scalarization (i.e. treated as
TriviallyDead).

In addition to marking all these casts to be ignored, we also need to make
sure that each cast is mapped to the right vector value in the vector loop body
(be it a widened, vectorized, or scalarized induction). So whenever an
induction phi is mapped to a vector value (during vectorization/widening/
scalarization), we also map the respective cast instruction (if exists) to that
vector value. (If the phi-update sequence of an induction involves more than one
cast, then the above mapping to vector value is relevant only for the last cast
of the sequence as we allow only the "last cast" to be used outside the
induction update chain itself).

This is the last step in addressing PR30654.

llvm-svn: 320672

4750c785

Dec 13, 2017

Reverting [JumpThreading] Preservation of DT and LVI across the pass · 580bc3c8

Brian M. Rzycki authored Dec 13, 2017

Stage 2 bootstrap failed:
http://lab.llvm.org:8011/builders/clang-x86_64-linux-selfhost-modules-2/builds/14434

llvm-svn: 320641

580bc3c8

Remove redundant includes from lib/Transforms. · 6af4f232
Michael Zolotukhin authored Dec 13, 2017
```
llvm-svn: 320628
```
6af4f232

[JumpThreading] Preservation of DT and LVI across the pass · d989af98

Brian M. Rzycki authored Dec 13, 2017

Summary:
See D37528 for a previous (non-deferred) version of this
patch and its description.

Preserves dominance in a deferred manner using a new class
DeferredDominance. This reduces the performance impact of
updating the DominatorTree at every edge insertion and
deletion. A user may call DDT->flush() within JumpThreading
for an up-to-date DT. This patch currently has one flush()
at the end of runImpl() to ensure DT is preserved across
the pass.

LVI is also preserved to help subsequent passes such as
CorrelatedValuePropagation. LVI is simpler to maintain and
is done immediately (not deferred). The code to perfom the
preversation was minimally altered and was simply marked
as preserved for the PassManager to be informed.

This extends the analysis available to JumpThreading for
future enhancements. One example is loop boundary threading.

Reviewers: dberlin, kuhar, sebpop

Reviewed By: kuhar, sebpop

Subscribers: hiraditya, llvm-commits

Differential Revision: https://reviews.llvm.org/D40146

llvm-svn: 320612

d989af98

Dec 12, 2017

Split IndirectBr critical edges before PGO gen/use passes. · f3bda1da

Hiroshi Yamauchi authored Dec 12, 2017

Summary:
The PGO gen/use passes currently fail with an assert failure if there's a
critical edge whose source is an IndirectBr instruction and that edge
needs to be instrumented.

To avoid this in certain cases, split IndirectBr critical edges in the PGO
gen/use passes. This works for blocks with single indirectbr predecessors,
but not for those with multiple indirectbr predecessors (splitting an
IndirectBr critical edge isn't always possible.)

Reviewers: davidxl, xur

Reviewed By: davidxl

Subscribers: efriedma, llvm-commits, mehdi_amini

Differential Revision: https://reviews.llvm.org/D40699

llvm-svn: 320511

f3bda1da

Dec 10, 2017
- [SimplifyLibCalls] propagate FMF when folding pow(x, -1.0) call · b23e1481
  Sanjay Patel authored Dec 10, 2017
```
Follow-up for a bug that's similar to:
https://bugs.llvm.org/show_bug.cgi?id=35601

llvm-svn: 320312
```
  b23e1481
- [SimplifyLibCalls] propagate FMF when folding pow(x, 2.0) call (PR35601) · 09ec3434
  Sanjay Patel authored Dec 10, 2017
```
This should fix the larger problem with sqrt shown in:
https://bugs.llvm.org/show_bug.cgi?id=35601

llvm-svn: 320310
```
  09ec3434
Dec 09, 2017

[InlineFunction] Set debug loc for call to forward varargs. · c5bebffe

Florian Hahn authored Dec 09, 2017

Reviewers: aprantl, dblaikie, rnk

Reviewed By: rnk

Subscribers: eraman, llvm-commits, JDevlieghere

Differential Revision: https://reviews.llvm.org/D40432

llvm-svn: 320252

c5bebffe

Dec 08, 2017

Generalize llvm::replaceDbgDeclare and actually support the use-case that · d1317017
Adrian Prantl authored Dec 08, 2017
```
is mentioned in the documentation (inserting a deref before the plus_uconst).

llvm-svn: 320203
```
d1317017

[CodeExtractor] Add debug locations for new call and branch instrs. · e5089e2e

Florian Hahn authored Dec 08, 2017

Summary:
If a partially inlined function has debug info, we have to add debug
locations to the call instruction calling the outlined function.
We use the debug location of the first instruction in the outlined
function, as the introduced call transfers control to this statement and
there is no other equivalent line in the source code.

We also use the same debug location for the branch instruction added
to jump from artificial entry block for the outlined function, which just
jumps to the first actual basic block of the outlined function.

Reviewers: davide, aprantl, rriddle, dblaikie, danielcdh, wmi

Reviewed By: aprantl, rriddle, danielcdh

Subscribers: eraman, JDevlieghere, llvm-commits

Differential Revision: https://reviews.llvm.org/D40413

llvm-svn: 320199

e5089e2e

Dec 06, 2017

[PGO] Make indirect call promotion a utility · e363d2ce

Matthew Simpson authored Dec 06, 2017

This patch factors out the main code transformation utilities in the pgo-driven
indirect call promotion pass and places them in Transforms/Utils. The change is
intended to be a non-functional change, letting non-pgo-driven passes share a
common implementation with the existing pgo-driven pass.

The common utilities are used to conditionally promote indirect call sites to
direct call sites. They perform the underlying transformation, and do not
consider profile information. The pgo-specific details (e.g., the computation
of branch weight metadata) have been left in the indirect call promotion pass.

Differential Revision: https://reviews.llvm.org/D40658

llvm-svn: 319963

e363d2ce

[InlineFunction] Only replace call if there are VarArgs to forward. · 115d9916

Florian Hahn authored Dec 06, 2017

Summary:
There is no need to replace the original call instruction if no
 VarArgs need to be forwarded. 

Reviewers: davide, rnk, majnemer, efriedma

Reviewed By: efriedma

Subscribers: eraman, llvm-commits

Differential Revision: https://reviews.llvm.org/D40412

llvm-svn: 319947

115d9916

[LoopUtils] simplify createTargetReduction(); NFCI · 3e069f57
Sanjay Patel authored Dec 06, 2017
```
llvm-svn: 319946
```
3e069f57
[LoopUtils] fix variable name to match FMF vocabulary; NFC · 1ea7b6f7
Sanjay Patel authored Dec 06, 2017
```
llvm-svn: 319928
```
1ea7b6f7

Dec 05, 2017

Bail out of a SimplifyCFG switch table opt at undef values. · 0a3e9806

Mikael Holmen authored Dec 05, 2017

Summary:
A true or false result is expected from a comparison, but it seems the possibility of undef was overlooked, which could lead to a failed assert. This is fixed by this patch by bailing out if we encounter undef.

The bug is old and the assert has been there since the end of 2014, so it seems this is unusual enough to forego optimization.

Patch by JesperAntonsson.

Reviewers: spatel, eeckstein, hans

Reviewed By: hans

Subscribers: uabelho, llvm-commits

Differential Revision: https://reviews.llvm.org/D40639

llvm-svn: 319768

0a3e9806

Dec 04, 2017

Move splitIndirectCriticalEdges() to BasicBlockUtils.h. · 9364fa34

Hiroshi Yamauchi authored Dec 04, 2017

Summary:
Move splitIndirectCriticalEdges() from CodeGenPrepare to BasicBlockUtils.h so
that it can be called from other places.

Reviewers: davidxl

Reviewed By: davidxl

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D40750

llvm-svn: 319689

9364fa34

[BypassSlowDivision] Improve our handling of divisions by constants · aa92cae1

Sanjoy Das authored Dec 04, 2017

(This reapplies r314253.  r314253 was reverted on r314482 because of a
correctness regression on P100, but that regression was identified to be
something else.)

Summary:
Don't bail out on constant divisors for divisions that can be narrowed without
introducing control flow .  This gives us a 32 bit multiply instead of an
emulated 64 bit multiply in the generated PTX assembly.

Reviewers: jlebar

Subscribers: jholewinski, mcrosier, llvm-commits

Differential Revision: https://reviews.llvm.org/D38265

llvm-svn: 319677

aa92cae1

Dec 01, 2017

[IndVars] Fix a bug introduced in r317012 · 6260cf71

Philip Reames authored Dec 01, 2017

Turns out we can have comparisons which are indirect users of the induction variable that we can make invariant. In this case, there is no loop invariant value contributing and we'd fail an assert.

The test case was found by a java fuzzer and reduced. It's a real cornercase. You have to have a static loop which we've already proven only executes once, but haven't broken the backedge on, and an inner phi whose result can be constant folded by SCEV using exit count reasoning but not proven by isKnownPredicate. To my knowledge, only the fuzzer has hit this case.

llvm-svn: 319583

6260cf71

Revert r319537: Bail out of a SimplifyCFG switch table opt at undef values. · 9c13c8b6
Mikael Holmen authored Dec 01, 2017
```
Broke build bots so reverting.

llvm-svn: 319539
```
9c13c8b6

Bail out of a SimplifyCFG switch table opt at undef values. · 9f047795

Mikael Holmen authored Dec 01, 2017

Summary:
A true or false result is expected from a comparison, but it seems the possibility of undef was overlooked, which could lead to a failed assert. This is fixed by this patch by bailing out if we encounter undef.

The bug is old and the assert has been there since the end of 2014, so it seems this is unusual enough to forego optimization.

Patch by: JesperAntonsson

Reviewers: spatel, eeckstein, hans

Reviewed By: hans

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D40639

llvm-svn: 319537

9f047795

Mark all library options as hidden. · 8065f0b9

Zachary Turner authored Dec 01, 2017

These command line options are not intended for public use, and often
don't even make sense in the context of a particular tool anyway. About
90% of them are already hidden, but when people add new options they
forget to hide them, so if you were to make a brand new tool today, link
against one of LLVM's libraries, and run tool -help you would get a
bunch of junk that doesn't make sense for the tool you're writing.

This patch hides these options. The real solution is to not have
libraries defining command line options, but that's a much larger effort
and not something I'm prepared to take on.

Differential Revision: https://reviews.llvm.org/D40674

llvm-svn: 319505

8065f0b9

Nov 28, 2017

EntryExitInstrumenter: set DebugLocs on the inserted call instructions (PR35412) · ca46db95
Hans Wennborg authored Nov 28, 2017
```
Apparently the verifier requires that inlineable calls in a function
with debug info have debug locations.

llvm-svn: 319199
```
ca46db95

This reverts commit r319096 and r319097. · c06f55e1

Rafael Espindola authored Nov 28, 2017

Revert "[SROA] Propagate !range metadata when moving loads."
Revert "[Mem2Reg] Clang-format unformatted parts of this file. NFCI."

Davide says they broke a bot.

llvm-svn: 319131

c06f55e1

Nov 27, 2017

[Mem2Reg] Clang-format unformatted parts of this file. NFCI. · 824d71a9
Davide Italiano authored Nov 27, 2017
```
llvm-svn: 319097
```
824d71a9

[SROA] Propagate !range metadata when moving loads. · b5d59e73

Davide Italiano authored Nov 27, 2017

This tries to propagate !range metadata to a pre-existing load
when a load is optimized out. This is done instead of adding an
assume because converting loads to and from assumes creates a
lot of IR.

Patch by Ariel Ben-Yehuda.

Differential Revision:  https://reviews.llvm.org/D37216

llvm-svn: 319096

b5d59e73