Commits · fe0197e194a64f950602fb50736b6648a9e5b2a9 · Lorenzo Albano / LLVM bpEVL

Oct 07, 2020

[InstCombine] Add checks for and(logicalshift(zext(x),undef),y) cases · fe0197e1
Simon Pilgrim authored Oct 07, 2020
```
Prep work before some cleanup in narrowMaskedBinOp
```
fe0197e1

[LAA] Use DL to get element size for bound computation. · a73166a4

Florian Hahn authored Oct 07, 2020

Currently LAA uses getScalarSizeInBits to compute the size of an element
when computing the end bound of an access.

This does not work as expected for pointers to pointers, because
getScalarSizeInBits will return 0 for pointer types.

By using DataLayout to get the size of the element we can also correctly
handle pointer element types.

Note the changes to the existing test, which seems to also use the wrong
offset for the end.

Fixes PR47751.

Reviewed By: anemet

Differential Revision: https://reviews.llvm.org/D88953

a73166a4

[llvm][mlir] Promote the experimental reduction intrinsics to be first class intrinsics. · 322d0afd

Amara Emerson authored Oct 02, 2020

This change renames the intrinsics to not have "experimental" in the name.

The autoupgrader will handle legacy intrinsics.

Relevant ML thread: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140729.html

Differential Revision: https://reviews.llvm.org/D88787

322d0afd

[MemCpyOpt] Add additional callslot test cases (NFC) · 7a01fc5a
Nikita Popov authored Oct 06, 2020
```
For cases where the destination is captured.
```
7a01fc5a
[NFC][InstCombine] Autogenerate a few tests being affected by upcoming patch · bef27e50
Roman Lebedev authored Oct 07, 2020

bef27e50
[Tests] Precommit test showing gap around load forwarding of vectors in instcombine · 14d5ee63
Philip Reames authored Oct 07, 2020

14d5ee63

InstCombine: Negator: don't rely on complexity sorting already being performed (PR47752) · fed0f890

Roman Lebedev authored Oct 07, 2020

In some cases, we can negate instruction if only one of it's operands
negates. Previously, we assumed that constants would have been
canonicalized to RHS already, but that isn't guaranteed to happen,
because of InstCombine worklist visitation order,
as the added test (previously-hanging) shows.

So if we only need to negate a single operand,
we should ensure ourselves that we try constant operand first.
Do that by re-doing the complexity sorting ourselves,
when we actually care about it.

Fixes https://bugs.llvm.org/show_bug.cgi?id=47752

fed0f890

[InstCombine] Tweak funnel by constant tests for better shl/lshr commutation coverage · dce03e30
Simon Pilgrim authored Oct 07, 2020

dce03e30
[LAA] Add test for PR47751, which currently uses wrong bounds. · 20cfd5fa
Florian Hahn authored Oct 07, 2020

20cfd5fa
[Test] Add one more test where we can avoid creating trunc · 85a6f8fc
Max Kazantsev authored Oct 07, 2020

85a6f8fc

[SROA] rewritePartition()/findCommonType(): if uses have conflicting type, try... · 7fa503ef

Roman Lebedev authored Oct 07, 2020

[SROA] rewritePartition()/findCommonType(): if uses have conflicting type, try getTypePartition() before falling back to largest integral use type (PR47592)

And another step towards transformss not introducing inttoptr and/or
ptrtoint casts that weren't there already.

In this case, when load/store uses have conflicting types,
instead of falling back to the iN, we can try to use allocated sub-type.
As disscussed, this isn't the best idea overall (we shouldn't rely on
allocated type), but it works fine as a temporary measure.

I've measured, and @ `-O3` as of vanilla llvm test-suite + RawSpeed,
this results in +0.05% more bitcasts, -5.51% less inttoptr
and -1.05% less ptrtoint (at the end of middle-end opt pipeline)

See https://bugs.llvm.org/show_bug.cgi?id=47592

Reviewed By: efriedma

Differential Revision: https://reviews.llvm.org/D88788

7fa503ef

[Test] Add test showing that we can avoid inserting trunc/zext · 0c009e09
Max Kazantsev authored Oct 07, 2020

0c009e09

[PowerPC] implement target hook getTgtMemIntrinsic · f0560870

Chen Zheng authored Sep 27, 2020

This patch can make pass recognize Powerpc related memory intrinsics.

Reviewed By: steven.zhang

Differential Revision: https://reviews.llvm.org/D88373

f0560870

[Attributor] Use smarter way to determine alignment of GEPs · 7993d611
Johannes Doerfert authored Sep 12, 2020
```
Use same logic existing in other places to deal with base case GEPs.

Add the original Attributor talk example.
```
7993d611

[Attributor] Ignore read accesses to constant memory · c4cfe7a4

Johannes Doerfert authored Sep 09, 2020

The old function attribute deduction pass ignores reads of constant
memory and we need to copy this behavior to replace the pass completely.
First step are constant globals. TBAA can also describe constant
accesses and there are other possibilities. We might want to consider
asking the alias analyses that are available but for now this is simpler
and cheaper.

c4cfe7a4

[Attributor] Give up early on AANoReturn::initialize · 3f540c05

Johannes Doerfert authored Sep 07, 2020

If the function is not assumed `noreturn` we should not wait for an
update to mark the call site as "may-return".

This has two kinds of consequences:
  - We have less iterations in many tests.
  - We have less deductions based on "known information" (since we ask
    earlier, point 1, and therefore assumed information is not "known"
    yet).
The latter is an artifact that we might want to tackle properly at some
point but which is not easily fixable right now.

3f540c05

Oct 06, 2020

[MemCpyOpt] Use dereferenceable pointer helper · 616f5450

Nikita Popov authored Oct 04, 2020

The call slot optimization has some home-grown code for checking
whether the destination is dereferenceable. Replace this with the
generic isDereferenceableAndAlignedPointer() helper.

I'm not checking alignment here, because that is currently handled
separately and may be an enforced alignment for allocas. The clean
way of integrating that part would probably be to accept a callback
in isDereferenceableAndAlignedPointer() for the actual isAligned check,
which would then have a chance to use an enforced alignment instead.

This allows the destination to be a GEP (among other things), though
the two open TODOs may prevent it from working in practice.

Differential Revision: https://reviews.llvm.org/D88805

616f5450

[MemCpyOpt] Check for throwing calls during call slot optimization · 6b441ca5

Nikita Popov authored Oct 04, 2020

When performing call slot optimization for a non-local destination,
we need to check whether there may be throwing calls between the
call and the copy. Otherwise, the early write to the destination
may be observable by the caller.

This was already done for call slot optimization of load/store,
but not for memcpys. For the sake of clarity, I'm moving this check
into the common optimization function, even if that does need an
additional instruction scan for the load/store case.

As efriedma pointed out, this check is not sufficient due to
potential accesses from another thread. This case is left as a TODO.

Differential Revision: https://reviews.llvm.org/D88799

6b441ca5

[SimplifyLibCalls] Optimize mempcpy_chk to mempcpy · 86429c4e
Dávid Bolvanský authored Oct 05, 2020

86429c4e

[test][InstCombine][NewPM] Fix InstCombine tests under NPM · 8df17b4d

Arthur Eubanks authored Sep 23, 2020

Some of these depended on analyses being present that aren't provided
automatically in NPM.

early_dce_clobbers_callgraph.ll was previously inlining a noinline function?

cast-call-combine.ll relied on the legacy always-inline pass being a
CGSCC pass and getting rerun.

Reviewed By: asbirlea

Differential Revision: https://reviews.llvm.org/D88187

8df17b4d

[Attributor][FIX] Move assertion to make it not trivially fail · 4a7a9884

Johannes Doerfert authored Sep 09, 2020

The idea of this assertion was to check the simplified value before we
assign it, not after, which caused this to trivially fail all the time.

4a7a9884

[Attributor][FIX] Dead return values are not `noundef` · 04f69513

Johannes Doerfert authored Sep 08, 2020

When we assume a return value is dead we might still visit return
instructions via `Attributor::checkForAllReturnedValuesAndReturnInsts(..)`.
When we do so the "returned value" is potentially simplified to `undef`
as it is the assumed "returned value". This is a problem if there was a
preexisting `noundef` attribute that will only be removed as we manifest
the `undef` return value. We should not use this combination to derive
`unreachable` though. Two test cases fixed.

04f69513

[Attributor][NFC] Ignore benign uses in AAMemoryBehaviorFloating · 957094e3

Johannes Doerfert authored Sep 07, 2020

In AAMemoryBehaviorFloating we used to track benign uses in a SetVector.
With this change we look through benign uses eagerly to reduce the
number of elements (=Uses) we look at during an update.

The test does actually not fail prior to this commit but I already wrote
it so I kept it.

957094e3

[VPlan] Add vplan native path vectorization test case for inner loop reduction · cef0de5e

Mauri Mustonen authored Oct 06, 2020

Regarding this bug I posted earlier: https://bugs.llvm.org/show_bug.cgi?id=47035

After reading through LLVM source code and getting familiar with VPlan I was able to vectorize the code using by enabling VPlan native path. After talking with @fhahn he suggested that I contribute this as a test case. So here it is. I tried to follow the available guides how to do this best I could. I modified IR code by hand to have more clear variable names instead of numbers.

One thing what I'd like to get input from someone is that is current CHECK lines sufficient enough to verify that the inner loop has been vectorized properly?

Reviewed By: fhahn

Differential Revision: https://reviews.llvm.org/D87564

cef0de5e

[AttributeFuncs] Consider `noundef` in `typeIncompatible` · ef48436e

Johannes Doerfert authored Sep 08, 2020

Drop `noundef` for return values that are replaced by void and make it
illegal to put `noundef` on a void value.

Reviewed By: fhahn

Differential Revision: https://reviews.llvm.org/D87306

ef48436e

[AttributeFuncs] Consider `align` in `typeIncompatible` · 2a078c30

Johannes Doerfert authored Sep 08, 2020

Alignment attributes need to be dropped for non-pointer values.
This also introduces a check into the verifier to ensure you don't use
`align` on anything but a pointer. Test needed to be adjusted
accordingly.

Reviewed By: fhahn

Differential Revision: https://reviews.llvm.org/D87304

2a078c30

[GVN LoadPRE] Extend the scope of optimization by using context to prove safety of speculation · b9888980

Serguei Katkov authored Oct 02, 2020

Use context to prove that load can be safely executed at a point where load is being hoisted.

Postpone the decision about safety of speculative load execution till the moment we know
where we hoist load and check safety at that context.

Reviewers: nikic, fhahn, mkazantsev, lebedev.ri, efriedma, reames
Reviewed By: reames, mkazantsev
Subscribers: llvm-commits
Differential Revision: https://reviews.llvm.org/D88725

b9888980

[MLInliner] Factor out logging · 36bb1fb1

Mircea Trofin authored Oct 02, 2020

Factored out the logging facility, to allow its reuse outside the
inliner.

Differential Revision: https://reviews.llvm.org/D88770

36bb1fb1

Oct 05, 2020

Revert "Outline non returning functions unless a longjmp" · 9afb1c56

Vedant Kumar authored Oct 05, 2020

This reverts commit 20797989.

This patch (https://reviews.llvm.org/D69257) cannot complete a stage2
build due to the change:

```
CI->getCalledFunction()->getName().contains("longjmp")
```

There are several concrete issues here:

  - The callee may not be a function, so `getCalledFunction` can assert.
  - The called value may not have a name, so `getName` can assert.
  - There's no distinction made between "my_longjmp_test_helper" and the
    actual longjmp libcall.

At a higher level, there's a serious layering problem here. The
splitting pass makes policy decisions in a general way (e.g. based on
attributes or profile data). Special-casing certain names breaks the
layering. It subverts the work of library maintainers (who may now need
to opt-out of unexpected optimization behavior for any affected
functions) and can lead to inconsistent optimization behavior (as not
all llvm passes special-case ".*longjmp.*" in the same way).

The patch may need significant revision to address these issues.

But the immediate issue is that this crashes while compiling llvm's unit
tests in a stage2 build (due to the `getName` problem).

9afb1c56

[InstCombine] Revert rL226781 "Teach InstCombine to canonicalize loads which... · e00f189d

Roman Lebedev authored Oct 05, 2020

[InstCombine] Revert rL226781 "Teach InstCombine to canonicalize loads which are only ever stored to always use a legal integer type if one is available." (PR47592)

(it was introduced in https://lists.llvm.org/pipermail/llvm-dev/2015-January/080956.html)

This canonicalization seems dubious.

Most importantly, while it does not create `inttoptr` casts by itself,
it may cause them to appear later, see e.g. D88788.

I think it's pretty obvious that it is an undesirable outcome,
by now we've established that seemingly no-op `inttoptr`/`ptrtoint` casts
are not no-op, and are no longer eager to look past them.
Which e.g. means that given
```
%a = load i32
%b = inttoptr %a
%c = inttoptr %a
```
we likely won't be able to tell that `%b` and `%c` is the same thing.

As we can see in D88789 / D88788 / D88806 / D75505,
we can't really teach SCEV about this (not without the https://bugs.llvm.org/show_bug.cgi?id=47592 at least)
And we can't recover the situation post-inlining in instcombine.

So it really does look like this fold is actively breaking
otherwise-good IR, in a way that is not recoverable.
And that means, this fold isn't helpful in exposing the passes
that are otherwise unaware of these patterns it produces.

Thusly, i propose to simply not perform such a canonicalization.
The original motivational RFC does not state what larger problem
that canonicalization was trying to solve, so i'm not sure
how this plays out in the larger picture.

On vanilla llvm test-suite + RawSpeed, this results in
increase of asm instructions and final object size by ~+0.05%
decreases final count of bitcasts by -4.79% (-28990),
ptrtoint casts by -15.41% (-3423),
and of inttoptr casts by -25.59% (-6919, *sic*).
Overall, there's -0.04% less IR blocks, -0.39% instructions.

See https://bugs.llvm.org/show_bug.cgi?id=47592

Differential Revision: https://reviews.llvm.org/D88789

e00f189d

Revert "[SLC] Optimize mempcpy_chk to mempcpy" · a4bae56a
Dávid Bolvanský authored Oct 05, 2020
```
This reverts commit 3f1fd59d.
```
a4bae56a

[SLC] Optimize mempcpy_chk to mempcpy · 3f1fd59d

Dávid Bolvanský authored Oct 05, 2020

As reported in PR46735:

void* f(void *d, const void *s, size_t l)
{
    return __builtin___mempcpy_chk(d, s, l, __builtin_object_size(d, 0));
}

This can be optimized to `return mempcpy(d, s, l);`.

Reviewed By: efriedma

Differential Revision: https://reviews.llvm.org/D86019

3f1fd59d

[InstCombine] Handle GEP inbounds in select op replacement (PR47730) · 3641d375

Nikita Popov authored Oct 05, 2020

When retrying the "simplify with operand replaced" select
optimization without poison flags, also handle inbounds on GEPs.

Of course, this particular example would also be safe to transform
while keeping inbounds, but the underlying machinery does not
know this (yet).

3641d375

[InstCombine] Add test for PR47730 · 0f8e4a5e
Nikita Popov authored Oct 05, 2020

0f8e4a5e

Revert "[DebugInfo] Improve dbg preservation in LSR." · 9d630297

Nikita Popov authored Oct 05, 2020

This reverts commit a3caf7f6.

The ReleaseLTO-g test-suite configuration has been failing
to build since this commit, because clang segfaults while
building 7zip.

9d630297

[InstCombine] Extend 'shift with constants' vector tests · 5ba084c4

Simon Pilgrim authored Oct 05, 2020

Added missing test coverage for shl(add(and(lshr(x,c1),c2),y),c1) -> add(and(x,c2<<c1),shl(y,c1)) combine

Rename tests as 'foo' and 'bar' isn't very extensible

Added vector tests with undefs and nonuniform constants

5ba084c4

[InstCombine] Add or(shl(v,and(x,bw-1)),lshr(v,bw-and(x,bw-1))) funnel shift tests · 2efd9fd6
Simon Pilgrim authored Oct 05, 2020
```
If we know the shift amount is less than the bitwidth we should be able to convert this to a funnel shift
```
2efd9fd6

Revert SVML support for sqrt · 89e8a8b2

Wenlei He authored Oct 05, 2020

As was brought up in D87169 by @craig.topper we shouldn't map llvm.sqrt to svml since there is a faster native instruction.
https://software.intel.com/sites/landingpage/IntrinsicsGuide/#text=_mm_sqrt_p&expand=5824,5823,5356,5823,5825,5365,5356

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D88620

89e8a8b2

[LV] Regenerate test. NFC · ff86acbb

David Green authored Oct 05, 2020

This just reruns the update script to add the new
[[LOOP0:!llvm.loop !.*]] checks to remove them from
other diffs.

ff86acbb

[ValueTracking] canCreateUndefOrPoison - use APInt to check bounds instead of getZExtValue(). · 2cd7b0e1
Simon Pilgrim authored Oct 05, 2020
```
Fixes OSS Fuzz #26135
```
2cd7b0e1