Commits · fa26a37d36991dd19276c2112ae03e962debe0e4 · Lorenzo Albano / LLVM bpEVL

Jan 15, 2020

[NFC][IndVarSimplify] remove duplicate code in widenWithVariantLoadUseCodegen. · 34ba96a3

Zhongduo Lin authored Jan 15, 2020

Summary: Duplicate code in widenWithVariantLoadUseCodegen is removed and also use assert to check unknown extension type as it should be filtered out by the pre condition check before calling this function.

Reviewers: az, sanjoy, sebpop, efriedma, javed.absar, sanjoy.google

Reviewed By: efriedma

Subscribers: hiraditya, llvm-commits, amehsan

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D72652

34ba96a3

DebugInfo: Factor out logic to update locations in MD_loop metadata, NFC · a2cc80bc

Vedant Kumar authored Jan 15, 2020

Factor out the logic needed to update debug locations contained within
MD_loop metadata.

This refactor is preparation for a future change that also needs to
rewrite MD_loop metadata.

rdar://45507940

a2cc80bc

[Loop Peeling] Add possibility to enable peeling on loop nests. · 3f3017e1

Arkady Shlykov authored Jan 15, 2020

Summary:
Current peeling implementation bails out in case of loop nests.
The patch introduces a field in TargetTransformInfo structure that
certain targets can use to relax the constraints if it's
profitable (disabled by default).
Also additional option is added to enable peeling manually for
experimenting and testing purposes.

Reviewers: fhahn, lebedev.ri, xbolva00

Reviewed By: xbolva00

Subscribers: xbolva00, hiraditya, zzheng, llvm-commits

Differential Revision: https://reviews.llvm.org/D70304

3f3017e1

[InstCombine] reassociate fsub+fsub into fsub+fadd · 3180af43

Sanjay Patel authored Jan 15, 2020

As discussed in the motivating PR44509:
https://bugs.llvm.org/show_bug.cgi?id=44509

...we can end up with worse code using fast-math than without.
This is because the reassociate pass greedily transforms fsub
into fneg/fadd and apparently (based on the regression tests
seen here) expects instcombine to clean that up if it wasn't
profitable. But we were missing this fold:

(X - Y) - Z --> X - (Y + Z)

There's another, more specific case that I think we should
handle as shown in the "fake" fneg test (but missed with a real
fneg), but that's another patch. That may be tricky to get
right without conflicting with existing transforms for fneg.

Differential Revision: https://reviews.llvm.org/D72521

3180af43

[Attributor] AAValueConstantRange: Value range analysis using constant range · 188f9a34

Hideto Ueno authored Jan 15, 2020

Summary:
This patch introduces `AAValueConstantRange`, which answers a possible range for integer value in a specific program point.
One of the motivations is propagating existing `range` metadata. (I think we need to change the situation that `range` metadata cannot be put to Argument).

The state is a tuple of `ConstantRange` and it is initialized to (known, assumed) = ([-∞, +∞], empty).

Currently, AAValueConstantRange is created in `getAssumedConstant` method when `AAValueSimplify` returns `nullptr`(worst state).

Supported
 - BinaryOperator(add, sub, ...)
 - CmpInst(icmp eq, ...)
 - !range metadata

`AAValueConstantRange` is not intended to extend to polyhedral range value analysis.

Reviewers: jdoerfert, sstefan1

Reviewed By: jdoerfert

Subscribers: phosek, davezarzycki, baziotis, hiraditya, javed.absar, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D71620

188f9a34

Jan 14, 2020

[InstCombine] Fix worklist management when removing guard intrinsic · 04e58615

Nikita Popov authored Jan 11, 2020

When multiple guard intrinsics are merged into one, currently the
result of eraseInstFromFunction() is returned -- however, this
should only be done if the current instruction is being removed.
In this case we're removing a different instruction and should
instead report that the current one has been modified by returning it.

For this test case, this reduces the number of instcombine iterations
from 5 to 2 (the minimum possible).

Differential Revision: https://reviews.llvm.org/D72558

04e58615

[NewPM] Port MergeFunctions pass · 41033186

Nikita Popov authored Jan 10, 2020

This ports the MergeFunctions pass to the NewPM. This was rather
straightforward, as no analyses are used.

Additionally MergeFunctions needs to be conditionally enabled in
the PassBuilder, but I left that part out of this patch.

Differential Revision: https://reviews.llvm.org/D72537

41033186

[InstCombine] Fix infinite loop due to bitcast <-> phi transforms · 65c0805b

Nikita Popov authored Jan 13, 2020

Fix for https://bugs.llvm.org/show_bug.cgi?id=44245.

The optimizeBitCastFromPhi() and FoldPHIArgOpIntoPHI() end up
fighting against each other, because optimizeBitCastFromPhi()
assumes that bitcasts of loads will get folded. This doesn't
happen here, because a dangling phi node prevents the one-use
fold in https://github.com/llvm/llvm-project/blob/master/llvm/lib/Transforms/InstCombine/InstCombineLoadStoreAlloca.cpp#L620-L628 from triggering.

This patch fixes the issue by explicitly performing the load
combine as part of the bitcast of phi transform. Other attempts
to force the load to be combined first were ultimately too
unreliable.

Differential Revision: https://reviews.llvm.org/D71164

65c0805b

[InstCombine] Make combineLoadToNewType a method; NFC · b4dd928f
Nikita Popov authored Jan 13, 2020
```
So it can be reused as part of other combines.
In particular for D71164.
```
b4dd928f

[InstCombine] Fix user iterator invalidation in bitcast of phi transform · 652cd7c1

Nikita Popov authored Jan 13, 2020

This fixes the issue encountered in D71164. Instead of using a
range-based for, manually iterate over the users and advance the
iterator beforehand, so we do not skip any users due to iterator
invalidation.

Differential Revision: https://reviews.llvm.org/D72657

652cd7c1

[ThinLTO/WPD] Remove an overly-aggressive assert · 2cefb939

Teresa Johnson authored Jan 13, 2020

Summary:
An assert added to the index-based WPD was trying to verify that we only
have multiple vtables for a given guid when they are all non-external
linkage. This is too conservative because we may have multiple external
vtable with the same guid when they are in comdat. Remove the assert,
as we don't have comdat information in the index, the linker should
issue an error in this case.

See discussion on D71040 for more information.

Reviewers: evgeny777, aganea

Subscribers: mehdi_amini, inglorion, hiraditya, steven_wu, dexonsmith, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D72648

2cefb939

[InstCombine] Let combineLoadToNewType preserve ABI alignment of the load (PR44543) · 3e32b7e1

Juneyoung Lee authored Jan 15, 2020

Summary:
If aligment on `LoadInst` isn't specified, load is assumed to be ABI-aligned.
And said aligment may be different for different types.
So if we change load type, but don't pay extra attention to the aligment
(i.e. keep it unspecified), we may either overpromise (if the default aligment
of the new type is higher), or underpromise (if the default aligment
of the new type is smaller).

Thus, if no alignment is specified, we need to manually preserve the implied ABI alignment.

This addresses https://bugs.llvm.org/show_bug.cgi?id=44543 by making combineLoadToNewType preserve ABI alignment of the load.

Reviewers: spatel, lebedev.ri

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D72710

3e32b7e1

Removed PointerUnion3 and PointerUnion4 aliases in favor of the variadic template · 2948ec5c
Dmitri Gribenko authored Jan 14, 2020

2948ec5c

Revert "Recommit "[GlobalOpt] Pass DTU to removeUnreachableBlocks instead of recomputing."" · 192cce10

Florian Hahn authored Jan 14, 2020

This reverts commit a03d7b0f.

As discussed in D68298, this causes a compile-time regression, in case
the DTs requested are not used elsewhere in GlobalOpt. We should only
get the DTs if they are available here, but this seems not possible with
the legacy pass manager from a module pass.

192cce10

Make helper functions static or move them into anonymous namespaces. NFC. · df186507
Benjamin Kramer authored Jan 14, 2020

df186507

Jan 13, 2020

[PGO][CHR] Guard against 0-to-0 branch weight and avoid division by zero crash. · 7b9f8e17

Hiroshi Yamauchi authored Jan 13, 2020

Summary: This fixes a crash in internal builds under SamplePGO.

Reviewers: davidxl

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D72653

7b9f8e17

[ThinLTO/WPD] Fix index-based WPD for alias vtables · 31441a3e

Teresa Johnson authored Dec 04, 2019

Summary:
A recent fix in D69452 fixed index based WPD in the presence of
available_externally vtables. It added a cast of the vtable def
summary to a GlobalVarSummary. However, in some cases one def may be an
alias, in which case we need to get the base object before casting,
otherwise we will crash.

Reviewers: evgeny777, steven_wu, aganea

Subscribers: mehdi_amini, inglorion, hiraditya, dexonsmith, arphaman, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D71040

31441a3e

Jan 11, 2020
- Fix uninitialized value clang static analyzer warning. NFC. · 2740b2d5
  Simon Pilgrim authored Jan 11, 2020
  
  2740b2d5
- DSE: fix bug where we would only check libcalls for name rather than whole decl · 87407fc0
  Nuno Lopes authored Jan 11, 2020
  
  87407fc0
- [InstCombine] Preserve nuw on sub of geps (PR44419) · 0e322c8a
  Nikita Popov authored Jan 01, 2020
```
Fix https://bugs.llvm.org/show_bug.cgi?id=44419 by preserving the
nuw on sub of geps. We only do this if the offset has a multiplication
as the final operation, as we can't be sure the operations is nuw
in the other cases without more thorough analysis.

Differential Revision: https://reviews.llvm.org/D72048
```
  0e322c8a
Jan 10, 2020

Add support for __declspec(guard(nocf)) · bdd88b7e

Andrew Paverd authored Jan 10, 2020

Summary:
Avoid using the `nocf_check` attribute with Control Flow Guard. Instead, use a
new `"guard_nocf"` function attribute to indicate that checks should not be
added on indirect calls within that function. Add support for
`__declspec(guard(nocf))` following the same syntax as MSVC.

Reviewers: rnk, dmajor, pcc, hans, aaron.ballman

Reviewed By: aaron.ballman

Subscribers: aaron.ballman, tomrittervg, hiraditya, cfe-commits, llvm-commits

Tags: #clang, #llvm

Differential Revision: https://reviews.llvm.org/D72167

bdd88b7e

Don't use dyn_cast_or_null if we know the pointer is nonnull. · 2e66405d
Simon Pilgrim authored Jan 09, 2020
```
Fix clang static analyzer null dereference warning by using dyn_cast instead.
```
2e66405d
[LV] Silence unused variable warning in Release builds. NFC. · 498856fc
Benjamin Kramer authored Jan 10, 2020

498856fc

[LV] VPValues for memory operation pointers (NFCI) · 8647a72c

Gil Rapaport authored Nov 28, 2019

Memory instruction widening recipes use the pointer operand of their load/store
ingredient for generating the needed GEPs, making it difficult to feed these
recipes with pointers based on other ingredients or none at all.
This patch modifies these recipes to use a VPValue for the pointer instead, in
order to reduce ingredient def-use usage by ILV as a step towards full
VPlan-based def-use relations. The recipes are constructed with VPValues bound
to these ingredients, maintaining current behavior.

Differential revision: https://reviews.llvm.org/D70865

8647a72c

Jan 09, 2020

[NFCI][LoopUnrollAndJam] Changing LoopUnrollAndJamPass to a function · d27a15fe

Whitney Tsang authored Jan 09, 2020

pass.

Summary: This patch changes LoopUnrollAndJamPass to a function pass, and
keeps the loops traversal order same as defined in
FunctionToLoopPassAdaptor LoopPassManager.h.

The next patch will change the loop traversal to outer to inner order,
so more loops can be transform.

Discussion in llvm-dev mailing list:
https://groups.google.com/forum/#!topic/llvm-dev/LF4rUjkVI2g
Reviewer: dmgreen, jdoerfert, Meinersbur, kbarton, bmahjour, etiotto
Reviewed By: dmgreen
Subscribers: hiraditya, zzheng, llvm-commits
Tag: LLVM
Differential Revision: https://reviews.llvm.org/D72230

d27a15fe

[InstCombine] Z / (1.0 / Y) => (Y * Z) · 6c04ef47

@raghesh (Raghesh Aloor) authored Jan 09, 2020

This is a special case of Z / (X / Y) => (Y * Z) / X, with X = 1.0.
The m_OneUse check is avoided because even in the case of the
multiple uses for 1.0/Y, the number of instructions remain the same
and a division is replaced by a multiplication.

Differential Revision: https://reviews.llvm.org/D72319

6c04ef47

[Matrix] Update shape propagation to iterate until done. · ccf24225

Florian Hahn authored Jan 09, 2020

This patch updates the shape propagation to iterate until no new shape
information is discovered.

As initial seed for the forward propagation, we use the matrix intrinsic
instructions. Both propagateShapeForward and propagateShapeBackward
return new work lists, with the instructions to be used for the next
iteration. When propagating forward, we record all instructions we added
new shape information for. When propagating backward, we record all
users of instructions we added new shape information for.

Reviewers: anemet, Gerolf, reames, hfinkel, andrew.w.kaylor

Reviewed By: anemet

Differential Revision: https://reviews.llvm.org/D70901

ccf24225

[Matrix] Propagate and use shape information for loads. · 7adf6644

Florian Hahn authored Jan 09, 2020

This patch extends to shape propagation to also include load
instructions and implements shape aware lowering for vector loads.

Reviewers: anemet, Gerolf, reames, hfinkel, andrew.w.kaylor

Reviewed By: anemet

Differential Revision: https://reviews.llvm.org/D70900

7adf6644

[LoopUtils][NFC] Minor refactoring in getLoopEstimatedTripCount. · f0abe820
Evgeniy Brevnov authored Dec 31, 2019

f0abe820

[Matrix] Implement back-propagation of shape information. · 459ad8e9

Florian Hahn authored Jan 09, 2020

This patch extends the shape propagation for matrix operations to also
propagate the shape of instructions to their operands.

Reviewers: anemet, Gerolf, reames, hfinkel, andrew.w.kaylor

Reviewed By: anemet

Differential Revision: https://reviews.llvm.org/D70899

459ad8e9

[LV] Still vectorise when tail-folding can't find a primary inducation variable · 8f188745

Sjoerd Meijer authored Jan 09, 2020

This addresses a vectorisation regression for tail-folded loops that are
counting down, e.g. loops as simple as this:

  void foo(char *A, char *B, char *C, uint32_t N) {
    while (N > 0) {
      *C++ = *A++ + *B++;
       N--;
    }
  }

These are loops that can be vectorised, but when tail-folding is requested, it
can't find a primary induction variable which we do need for predicating the
loop. As a result, the loop isn't vectorised at all, which it is able to do
when tail-folding is not attempted. So, this adds a check for the primary
induction variable where we decide how to lower the scalar epilogue. I.e., when
there isn't a primary induction variable, a scalar epilogue loop is allowed
(i.e. don't request tail-folding) so that vectorisation could still be
triggered.

Having this check for the primary induction variable make sense anyway, and in
addition, in a follow-up of this I will look into discovering earlier the
primary induction variable for counting down loops, so that this can also be
tail-folded.

Differential revision: https://reviews.llvm.org/D72324

8f188745

[Attributor][FIX] Carefully change invokes to calls (after manifest) · a4088c75

Johannes Doerfert authored Jan 07, 2020

Before we manually inserted unreachable early but that could lead to
broken PHI nodes. Now we use the existing late modification
functionality.

a4088c75

[Attributor][FIX] Avoid dangling value pointers during code modification · 1e46eb74

Johannes Doerfert authored Jan 07, 2020

When we replace instructions with unreachable we delete instructions. We
now avoid dangling pointers to those deleted instructions in the
`ToBeChangedToUnreachableInsts` set. Other modification collections
might need to be updated in the future as well.

1e46eb74

Jan 08, 2020

Revert "[JumpThreading] Thread jumps through two basic blocks" · 2d258ed9

Kazu Hirata authored Jan 08, 2020

It looks like my patch breaks the sanitizer-windows build:

http://lab.llvm.org:8011/builders/sanitizer-windows/builds/56324

This reverts commit ead81592.

2d258ed9

[JumpThreading] Thread jumps through two basic blocks · ead81592

Kazu Hirata authored Jan 08, 2020

Summary:
This patch teaches JumpThreading.cpp to thread through two basic
blocks like:

  bb3:
    %var = phi i32* [ null, %bb1 ], [ @a, %bb2 ]
    %tobool = icmp eq i32 %cond, 0
    br i1 %tobool, label %bb4, label ...

  bb4:
    %cmp = icmp eq i32* %var, null
    br i1 %cmp, label bb5, label bb6

by duplicating basic blocks like bb3 above.  Once we duplicate bb3 as
bb3.dup and redirect edge bb2->bb3 to bb2->bb3.dup, we have:

  bb3:
    %var = phi i32* [ @a, %bb2 ]
    %tobool = icmp eq i32 %cond, 0
    br i1 %tobool, label %bb4, label ...

  bb3.dup:
    %var = phi i32* [ null, %bb1 ]
    %tobool = icmp eq i32 %cond, 0
    br i1 %tobool, label %bb4, label ...

  bb4:
    %cmp = icmp eq i32* %var, null
    br i1 %cmp, label bb5, label bb6

Then the existing code in JumpThreading.cpp can thread edge
bb3.dup->bb4 through bb4 and eventually create bb3.dup->bb5.

Reviewers: wmi

Subscribers: hiraditya, jfb, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D70247

ead81592

Revert "[InstCombine] fold zext of masked bit set/clear" · b212eb71

Kadir Cetinkaya authored Jan 08, 2020

This reverts commit a041c4ec.

This looks like a non-trivial change and there has been no code
reviews (at least there were no phabricator revisions attached to the
commit description). It is also causing a regression in one of our
downstream integration tests, we haven't been able to come up with a
minimal reproducer yet.

b212eb71

[GVN/FP] Considate logic for reasoning about equality vs equivalance for floats · 312a532d

Philip Reames authored Jan 07, 2020

Factor out common logic into some reasonable commented helper functions. In the process, ensure that the in-block vs cross-block cases are handled the same. They previously weren't.

Differential Revision: https://reviews.llvm.org/D67126

312a532d

Jan 07, 2020

[InstCombine] try to pull 'not' of select into compare operands · f8962571

Sanjay Patel authored Jan 07, 2020

not (select ?, (cmp TPred, ?, ?), (cmp FPred, ?, ?) -->
     select ?, (cmp TPred', ?, ?), (cmp FPred', ?, ?)

If both sides of the select are cmps, we can remove an instruction.
The case where only side is a cmp is deferred to a possible
follow-on patch.

We have a more general 'isFreeToInvert' analysis, but I'm not seeing
a way to use that more widely without inducing infinite looping
(opposing transforms).
Here, we flip the compare predicates directly, so we should not have
any danger by creating extra intermediate 'not' ops.

Alive proofs:
https://rise4fun.com/Alive/jKa

Name: both select values are compares - invert predicates
  %tcmp = icmp sle i32 %x, %y
  %fcmp = icmp ugt i32 %z, %w
  %sel = select i1 %cond, i1 %tcmp, i1 %fcmp
  %not = xor i1 %sel, true
=>
  %tcmp_not = icmp sgt i32 %x, %y
  %fcmp_not = icmp ule i32 %z, %w
  %not = select i1 %cond, i1 %tcmp_not, i1 %fcmp_not

Name: false val is compare - invert/not
  %fcmp = icmp ugt i32 %z, %w
  %sel = select i1 %cond, i1 %tcmp, i1 %fcmp
  %not = xor i1 %sel, true
=>
  %tcmp_not = xor i1 %tcmp, -1
  %fcmp_not = icmp ule i32 %z, %w
  %not = select i1 %cond, i1 %tcmp_not, i1 %fcmp_not

Differential Revision: https://reviews.llvm.org/D72007

f8962571

Fix "use of uninitialized variable" static analyzer warnings. NFCI. · bd1dc6a3
Simon Pilgrim authored Jan 06, 2020

bd1dc6a3

Add Triple::isX86() · 6904cd94

Fangrui Song authored Jan 06, 2020

Reviewed By: craig.topper, skan

Differential Revision: https://reviews.llvm.org/D72247

6904cd94