Commits · eef95f2746c3347b8dad19091ffb82a88d73acd3 · Lorenzo Albano / LLVM bpEVL

May 13, 2020

[BrachProbablityInfo] Set edge probabilities at once. NFC. · eef95f27

Yevgeny Rouban authored May 13, 2020

Hide the method that allows setting probability for particular
edge and introduce a public method that sets probabilities for
all outgoing edges at once.
Setting individual edge probability is error prone. More over
it is difficult to check that the total probability is 1.0
because there is no easy way to know when the user finished
setting all the probabilities.

Reviewers: yamauchi, ebrevnov
Tags: #llvm
Differential Revision: https://reviews.llvm.org/D79396

eef95f27

Add nomerge function attribute to supress tail merge optimization in simplifyCFG · cb22ab74

Zequan Wu authored May 12, 2020

We want to add a way to avoid merging identical calls so as to keep the
separate debug-information for those calls. There is also an asan
usecase where having this attribute would be beneficial to avoid
alternative work-arounds.

Here is the link to the feature request:
https://bugs.llvm.org/show_bug.cgi?id=42783.

`nomerge` is different from `noline`. `noinline` prevents function from
inlining at callsites, but `nomerge` prevents multiple identical calls
from being merged into one.

This patch adds `nomerge` to disable the optimization in IR level. A
followup patch will be needed to let backend understands `nomerge` and
avoid tail merge at backend.

Reviewed By: asbirlea, rnk

Differential Revision: https://reviews.llvm.org/D78659

cb22ab74

May 11, 2020

[AssumeBundles] fix crashes · 78d85c20

Tyker authored May 11, 2020

Summary:
this patch fixe crash/asserts found in the test-suite.
the AssumeptionCache cannot be assumed to have all assumes contrary to what i tought.
prevent generation of information for terminators, because this can create broken IR in transfromation where we insert the new terminator before removing the old one.

Reviewers: jdoerfert

Reviewed By: jdoerfert

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D79458

78d85c20

[NFC][DwarfDebug] Add test for variables with a single location which · da100de0

OCHyams authored May 07, 2020

don't span their entire scope.

The previous commit (6d1c40c1) is an older version of the test.

Reviewed By: aprantl, vsk

Differential Revision: https://reviews.llvm.org/D79573

da100de0

May 10, 2020

[AssumeBundles] Remove non-determinisme from assume builder · 5957e058

Tyker authored May 10, 2020

Summary:
The assume builder was non-deterministic when working on unamed values.
this patch fixes this.

Reviewers: jdoerfert

Reviewed By: jdoerfert

Subscribers: hiraditya, mgrang, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D78616

5957e058

[AssumeBundles] Prevent generation of some redundant assumes · 821a0f23

Tyker authored May 10, 2020

Summary: with this patch the assume salvageKnowledge will not generate assume if all knowledge is already available in an assume with valid context. assume bulider can also in some cases update an existing assume with better information.

Reviewers: jdoerfert

Reviewed By: jdoerfert

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D78014

821a0f23

[LAA] Move runtime-check generation to Transforms/Utils/loopUtils (NFC) · 8528186b

Florian Hahn authored May 10, 2020

Currently LAA's uses of ScalarEvolutionExpander blocks moving the
expander from Analysis to Transforms. Conceptually the expander does not
fit into Analysis (it is only used for code generation) and
runtime-check generation also seems to be better suited as a
transformation utility.

Reviewers: Ayal, anemet

Reviewed By: Ayal

Differential Revision: https://reviews.llvm.org/D78460

8528186b

May 08, 2020

Re-commit: Mark values as trivially dead when their only use is a start or end lifetime intrinsic. · f65f566a

zoecarver authored May 08, 2020

Summary:
If the only use of a value is a start or end lifetime intrinsic then mark the intrinsic as trivially dead. This should allow for that value to then be removed as well.

Currently, this only works for allocas, globals, and arguments.

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D79355

f65f566a

[SimplifyCFG] Remap rewritten debug intrinsic operands. · b38d77f1

Ricky Zhou authored May 08, 2020

FoldBranchToCommonDest clones instructions to a different basic block,
but handles debug intrinsics in a separate path. Previously, when
cloning debug intrinsics, their operands were not updated to reference
the correct cloned values. As a result, we would emit debug.value
intrinsics with broken operand references which are discarded in later
passes. This leads to incorrect debuginfo that reports incorrect values
for variables.

Fix this by remapping debug intrinsic operands when cloning them.

Fixes https://bugs.llvm.org/show_bug.cgi?id=45667.

Differential Revision: https://reviews.llvm.org/D79602

b38d77f1

May 07, 2020

SplitIndirectBrCriticalEdges: Fix Branch Probability update · b921543c

Yevgeny Rouban authored May 07, 2020

Splitting critical edges for indirect branches
the SplitIndirectBrCriticalEdges() function may break branch
probabilities if target basic block happens to have unset
a probability for any of its successors. That is because in
such cases the getEdgeProbability(Target) function returns
probability 1/NumOfSuccessors and it is called after Target
was split (thus Target has a single successor). As the result
the correspondent successor of the split block gets
probability 100% but 1/NumOfSuccessors is expected (or better
be left unset).

Reviewers: yamauchi
Differential Revision: https://reviews.llvm.org/D78806

b921543c

May 06, 2020

[LoopUnrollAndJam] Changed safety checks to consider more than 2-levels · 0a52401a

Whitney Tsang authored May 06, 2020

loop nest.

Summary: As discussed in https://reviews.llvm.org/D73129.

Example
Before unroll and jam:

for
  A
  for
    B
    for
      C
    D
  E
After unroll and jam (currently):

for
  A
  A'
  for
    B
    for
      C
    D
    B'
    for
      C'
    D'
  E
  E'
After unroll and jam (Ideal):

for
  A
  A'
  for
    B
    B'
    for
      C
      C'
    D
    D'
  E
  E'
This is the first patch to change unroll and jam to work in the ideal
way.
This patch change the safety checks needed to make sure is safe to
unroll and jam in the ideal way.

Reviewer: dmgreen, jdoerfert, Meinersbur, kbarton, bmahjour, etiotto
Reviewed By: Meinersbur
Subscribers: fhahn, hiraditya, zzheng, llvm-commits, anhtuyen, prithayan
Tag: LLVM
Differential Revision: https://reviews.llvm.org/D76132

0a52401a

Revert "Mark values as trivially dead when their only use is a start or end lifetime intrinsic." · 1998e796
zoecarver authored May 06, 2020
```
This reverts commit 95aa28cc.
```
1998e796

Mark values as trivially dead when their only use is a start or end lifetime intrinsic. · 95aa28cc

zoecarver authored May 06, 2020

Summary:
If the only use of a value is a start or end lifetime intrinsic then mark the intrinsic as trivially dead. This should allow for that value to then be removed as well.

Currently, this only works for allocas, globals, and arguments.

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D79355

95aa28cc

May 05, 2020

[InstCombine] Allow denormal C in pow(C,y) -> exp2(log2(C)*y) · 22829ab5

Jay Foad authored May 05, 2020

We check that C is finite and strictly positive, but there's no need to
check that it's normal too. exp2 should be just as accurate on denormals
as pow is.

Differential Revision: https://reviews.llvm.org/D79413

22829ab5

[InstCombine] Remove hasOneUse check for pow(C,x) -> exp2(log2(C)*x) · fa2783d7

Jay Foad authored May 05, 2020

I don't think there's any good reason not to do this transformation when
the pow has multiple uses.

Differential Revision: https://reviews.llvm.org/D79407

fa2783d7

[CallGraphUpdater] Removed references to calles when deleting function · f637334d

Sergey Dmitriev authored May 04, 2020

Summary: Otherwise we can get unaccounted references to call graph nodes.

Reviewers: jdoerfert, sstefan1

Reviewed By: jdoerfert

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D79382

f637334d

May 04, 2020

[SLC] Allow llvm.pow(x,2.0) -> x*x etc even if no pow() lib func · e737847b

Jay Foad authored Apr 30, 2020

optimizePow does not create any new calls to pow, so it should work
regardless of whether the pow library function is available. This allows
it to optimize the llvm.pow intrinsic on targets with no math library.

Based on a patch by Tim Renouf.

Differential Revision: https://reviews.llvm.org/D68231

e737847b

May 03, 2020

[ICP] Handling must tail calls in indirect call promotion · 911e06f5

Hongtao Yu authored May 01, 2020

Per the IR convention, a musttail call must precede a ret with an optional bitcast. This was violated by the indirect call promotion optimization which could result an IR like:

    ; <label>:2192:
      br i1 %2198, label %2199, label %2201, !dbg !226012, !prof !229483

    ; <label>:2199:                                   ; preds = %2192
      musttail call fastcc void @foo(i8* %2195), !dbg !226012
      br label %2202, !dbg !226012

    ; <label>:2201:                                   ; preds = %2192
      musttail call fastcc void %2197(i8* %2195), !dbg !226012
      br label %2202, !dbg !226012

    ; <label>:2202:                                   ; preds = %605, %2201, %2199
      ret void, !dbg !229485

This is being fixed in this change where the return statement goes together with the promoted indirect call. The code generated is like:

    ; <label>:2192:
      br i1 %2198, label %2199, label %2201, !dbg !226012, !prof !229483

    ; <label>:2199:                                   ; preds = %2192
      musttail call fastcc void @foo(i8* %2195), !dbg !226012
      ret void, !dbg !229485

    ; <label>:2201:                                   ; preds = %2192
      musttail call fastcc void %2197(i8* %2195), !dbg !226012
      ret void, !dbg !229485

Differential Revision: https://reviews.llvm.org/D79258

911e06f5

May 02, 2020

[MergeFuncs] Don't merge shufflevectors with different masks · 60e9ee16

Nikita Popov authored May 01, 2020

When the shufflevector mask operand was converted into special
instruction data, the FunctionComparator was not updated to
account for this. As such, MergeFuncs will happily merge
shufflevectors with different masks.

This fixes https://bugs.llvm.org/show_bug.cgi?id=45773.

Differential Revision: https://reviews.llvm.org/D79261

60e9ee16

Apr 30, 2020

[LoopVersioning] Update setAliasChecks to take ArrayRef argument (NFC). · 19ab53f1
Florian Hahn authored Apr 30, 2020
```
This cleanup was suggested as part of D78458.
```
19ab53f1

[InlineFunction] Disable emission of alignment assumptions by default · b74c6d2c

Nikita Popov authored Mar 25, 2020

In D74183 clang started emitting alignment for sret parameters
unconditionally. This caused a 1.5% compile-time regression on
tramp3d-v4. The reason is that we now generate many instance of IR like

    %ptrint = ptrtoint %class.GuardLayers* %guards_m to i64
    %maskedptr = and i64 %ptrint, 3
    %maskcond = icmp eq i64 %maskedptr, 0
    tail call void @llvm.assume(i1 %maskcond)

to preserve the alignment information during inlining. Based on IR
analysis, these assumptions also regress optimization. The attached
phase ordering test case illustrates two issues: One are instruction
count based optimization heuristics, which are affected by the four
additional instructions of the assumption. The other is blocking of
SROA due to ptrtoint casts (PR45763).

We already encountered the same problem in Rust, where we (unlike
Clang) generally prefer to emit alignment information absolutely
everywhere it is available. We were only able to do this after
hardcoding -preserve-alignment-assumptions-during-inlining=false,
because we were seeing significant optimization and compile-time
regressions otherwise.

This patch disables -preserve-alignment-assumptions-during-inlining
by default, because we should not be punishing people for adding
more alignment annotations.

Once the assume bundle work shakes out and we can represent (and use)
alignment assumptions using assume bundles, it should be possible to
re-enable this with reduced overhead.

Differential Revision: https://reviews.llvm.org/D76886

b74c6d2c

[NFC] Rename *ByValOrInalloca* to *PassPointeeByValue* · a90948fd

Arthur Eubanks authored Apr 29, 2020

Summary: In preparation for preallocated.

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D79152

a90948fd

[llvm][NFC] Use CallBase explicitly instead of Instruction in FunctionComparator · 3ab319b2

Mircea Trofin authored Apr 29, 2020

Reviewers: dblaikie, craig.topper

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D79098

3ab319b2

Apr 29, 2020

[PGO][PGSO] Prep for enabling non-cold code size opts under non-partial-profile sample PGO. · 18319868

Hiroshi Yamauchi authored Apr 27, 2020

Summary:
- Distinguish between partial-profile and non-partial-profile sample PGO.
- Add a flag for partial-profile sample PGO.
- Tune the sample PGO cutoff.
- No default behavior change (yet).

Reviewers: davidxl

Subscribers: eraman, hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D78949

18319868

[llvm][NFC] Change parameter type to more specific CallBase in IndirectCallPromotion · e61247c0

Mircea Trofin authored Apr 28, 2020

Reviewers: dblaikie, craig.topper, wmi

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D79047

e61247c0

Apr 28, 2020

[LAA] Move CheckingPtrGroup/PointerCheck outside class (NFC). · 616657b3

Florian Hahn authored Apr 28, 2020

This allows forward declarations of PointerCheck, which in turn reduce
the number of times LoopAccessAnalysis needs to be included.

Ultimately this helps with moving runtime check generation to
Transforms/Utils/LoopUtils.h, without having to include it there.

Reviewers: anemet, Ayal

Reviewed By: Ayal

Differential Revision: https://reviews.llvm.org/D78458

616657b3

[TTI] Add TargetCostKind argument to getUserCost · e9c9329a

Sam Parker authored Apr 27, 2020

There are several different types of cost that TTI tries to provide
explicit information for: throughput, latency, code size along with
a vague 'intersection of code-size cost and execution cost'.

The vectorizer is a keen user of RecipThroughput and there's at least
'getInstructionThroughput' and 'getArithmeticInstrCost' designed to
help with this cost. The latency cost has a single use and a single
implementation. The intersection cost appears to cover most of the
rest of the API.

getUserCost is explicitly called from within TTI when the user has
been explicit in wanting the code size (also only one use) as well
as a few passes which are concerned with a mixture of size and/or
a relative cost. In many cases these costs are closely related, such
as when multiple instructions are required, but one evident diverging
cost in this function is for div/rem.

This patch adds an argument so that the cost required is explicit,
so that we can make the important distinction when necessary.

Differential Revision: https://reviews.llvm.org/D78635

e9c9329a

[IR] Replace all uses of CallBase::getCalledValue() with getCalledOperand(). · a58b62b4

Craig Topper authored Apr 27, 2020

This method has been commented as deprecated for a while. Remove
it and replace all uses with the equivalent getCalledOperand().

I also made a few cleanups in here. For example, to removes use
of getElementType on a pointer when we could just use getFunctionType
from the call.

Differential Revision: https://reviews.llvm.org/D78882

a58b62b4

[llvm][NFC] Use CallBase instead of Instruction in ProfileSummaryInfo · cb56e9b9

Mircea Trofin authored Apr 27, 2020

Summary:
getProfileCount requires the parameter be a valid CallBase, and its uses
reflect that.

Reviewers: dblaikie, craig.topper, wmi

Subscribers: eraman, hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D78940

cb56e9b9

Add IR constructs for preallocated (inalloca replacement) · 3b0450ac

Arthur Eubanks authored Feb 14, 2020

Add llvm.call.preallocated.{setup,arg} instrinsics.
Add "preallocated" operand bundle which takes a token produced by llvm.call.preallocated.setup.
Add "preallocated" parameter attribute, which is like byval but without the copy.

Verifier changes for these IR constructs.

See https://github.com/rnk/llvm-project/blob/call-setup-docs/llvm/docs/CallSetup.md

Subscribers: hiraditya, jdoerfert, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D74651

3b0450ac

Apr 27, 2020
- [NFC] UnifyLoopExits: correctly skip expensive checks · 84887636
  Sameer Sahasrabuddhe authored Apr 27, 2020
  
  84887636
Apr 25, 2020

[AssumeBundles] Refactor asssume builder · e5f8a77c

Tyker authored Apr 24, 2020

Summary:
refactor assume bulider for the next patch.
the assume builder now generate only one assume per attribute kind and per value they are on. to do this it takes the highest. this is desirable because currently, for all attributes the higest value is the most valuable.

Reviewers: jdoerfert

Reviewed By: jdoerfert

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D78013

e5f8a77c

Give helpers internal linkage. NFC. · 1d42764d
Benjamin Kramer authored Apr 25, 2020

1d42764d

[CodeExtractor] Fix extraction of a value used only by intrinsics outside of region · 64249f17

Ehud Katz authored Apr 25, 2020

We should only skip `lifetime` and `dbg` intrinsics when searching for users.
Other intrinsics are legit users that can't be ignored.

Without this fix, the testcase would result in an invalid IR. `memcpy`
will have a reference to the, now, external value (local to the
extracted loop function).

Fix PR42194

Differential Revision: https://reviews.llvm.org/D78749

64249f17

Apr 24, 2020
- [NFC] Refactor SimplifyCFG to make propagating information easier. · 97ecd91e
  Tyker authored Apr 24, 2020
```
Reviewers: jdoerfert

Reviewed By: jdoerfert

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D77742
```
  97ecd91e
- [CallSite removal][Transform] Replace CallSite with CallBase in Utils. NFC · 81c5e83f
  Craig Topper authored Apr 23, 2020
```
Differential Revision: https://reviews.llvm.org/D78780
```
  81c5e83f
Apr 23, 2020

[SVE] Remove calls to isScalable from Transforms · 7ca56c90

Christopher Tetreault authored Apr 23, 2020

Reviewers: efriedma, chandlerc, reames, aprantl, sdesmalen

Reviewed By: efriedma

Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D77756

7ca56c90

[Debugify] Do not require named metadata to be present when stripping · 2fa656cd
Vedant Kumar authored Apr 17, 2020
```
This allows -mir-strip-debug to be run without -debugify having run
before.
```
2fa656cd

[MachineDebugify] Insert synthetic DBG_VALUE instructions · 2a5675f1

Vedant Kumar authored Apr 13, 2020

Summary:
Teach MachineDebugify how to insert DBG_VALUE instructions.  This can
help find bugs causing CodeGen differences when debug info is present.
DBG_VALUE instructions are only emitted when -debugify-level is set to
locations+variables.

There is essentially no attempt made to match up DBG_VALUE register
operands with the local variables they ought to correspond to. I'm not
sure how to improve the situation. In some cases (MachineMemOperand?)
it's possible to find the IR instruction a MachineInstr corresponds to,
but in general this seems to call for "undoing" the work done by ISel.

Reviewers: dsanders, aprantl

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D78135

2a5675f1

Apr 22, 2020

[SVE] Add new VectorType subclasses · 2dea3f12

Christopher Tetreault authored Apr 22, 2020

Summary:
Introduce new types for fixed width and scalable vectors.

Does not remove getNumElements yet so as to not break code during transition
period.

Reviewers: deadalnix, efriedma, sdesmalen, craig.topper, huntergr

Reviewed By: sdesmalen

Subscribers: jholewinski, arsenm, jvesely, nhaehnle, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, liufengdb, kerbowa, Joonsoo, grosul1, frgossen, lldb-commits, tschuett, hiraditya, rkruppe, psnobl, llvm-commits

Tags: #llvm, #lldb

Differential Revision: https://reviews.llvm.org/D77587

2dea3f12