Commits · bfd512160fe091bdd45199e5db884a24cd9d5f59 · Lorenzo Albano / LLVM bpEVL

May 17, 2020

[InstCombine] improve analysis of FP->int->FP to eliminate fpextend · bfd51216

Sanjay Patel authored May 17, 2020

This was originally in D79116.
Converting from a narrow-enough FP source value to integer and
back to FP guarantees that the conversion to FP is exact because
of UB/poison-on-overflow.

This was suggested in PR36617:
https://bugs.llvm.org/show_bug.cgi?id=36617#c19

bfd51216

[LoopUnroll] Extend test case with additional loop with larger TC. · b54a6633
Florian Hahn authored May 17, 2020

b54a6633
[LoopUnroll] Precommit test for PR459393. · 9e2a99e5
Florian Hahn authored May 17, 2020

9e2a99e5

[AMDGPU] Enable base pointer. · 7c4e711e

Christudasan Devadasan authored Apr 21, 2020

When the callee requires a dynamic stack realignment,
it is not possible to correcty access the incoming
stack arguments using the stack pointer. We reserve a
base pointer in such cases to access the function arguments
inside the callee. The base pointer will hold the incoming
stack pointer value before any kind of delta added to it.

Reviewed By: arsenm, scott.linder

Differential Revision: https://reviews.llvm.org/D78811

7c4e711e

[OpenMP] Fix race condition in the completion/freeing of detached tasks · d23131a3

Joachim Protze authored May 17, 2020

Spurious assertion failures are symptoms of a race condition for the handling
of detached tasks:
Assertion failure at kmp_tasking.cpp(3744): taskdata->td_flags.complete == 1.
Assertion failure at kmp_tasking.cpp(710): taskdata->td_flags.executing == 0.

in the case of detach=true, all accesses to taskdata in __kmp_task_finish need
to happen before (~line 873):

taskdata->td_flags.proxy = TASK_PROXY;

This assignment signals to __kmp_fulfill_event, that the task will need to be
freed there. So, conceptionally the ownership of taskdata is moved.

Reviewed By: AndreyChurbanov

Differential Revision: https://reviews.llvm.org/D79702

d23131a3

[Inliner][NFC] silence gcc 'overloaded-virtual' warning on hiding of Pass::doInitialization · f93a6aae

Fedor Sergeev authored May 17, 2020

When compiling with -Werror=overloaded-virtual, gcc emits this:
====
llvm/include/llvm/Pass.h:102:16: error: ‘virtual bool llvm::Pass::doInitialization(llvm::Module&)’ was hidden [-Werror=overloaded-virtual]
   virtual bool doInitialization(Module &)  { return false; }
                ^~~~~~~~~~~~~~~~
In file included from llvm/lib/Transforms/IPO/Inliner.cpp:20:0:
llvm/include/llvm/Transforms/IPO/Inliner.h:38:8: error:   by ‘virtual bool llvm::LegacyInlinerBase::doInitialization(llvm::CallGraph&)’ [-Werror=overloaded-virtual]
   bool doInitialization(CallGraph &CG) override;
        ^~~~~~~~~~~~~~~~
====

This is an old issue which has just started biting our downstream after
a slight rearrangement of includes around Inliner.
Fixing it similar to how doFinalization was done years ago.

f93a6aae

[LLVM][AVR] Support for R_AVR_6 fixup · 1335737e

Dylan McKay authored May 17, 2020

Summary: Handle the emission of `R_AVR_6` ELF relocation type.

Reviewers: dylanmckay

Reviewed By: dylanmckay

Subscribers: hiraditya, Jim, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D78721

Patch by @LemonBoy https://reviews.llvm.org/p/LemonBoy/

1335737e

[AVR] Fix I/O instructions on XMEGA · 1420f4ef

Dylan McKay authored May 17, 2020

Summary:
On XMEGA, I/O address space is same as data address space - there is no 0x20 offset,
because CPU General Purpose Registers are not mapped in data address space.

From https://en.wikipedia.org/wiki/AVR_microcontrollers
> In the XMEGA variant, the working register file is not mapped into the data address space; as such, it is not possible to treat any of the XMEGA's working registers as though they were SRAM. Instead, the I/O registers are mapped into the data address space starting at the very beginning of the address space.

Reviewers: dylanmckay

Reviewed By: dylanmckay

Subscribers: hiraditya, Jim, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D77207

Patch by Vlastimil Labsky.

1420f4ef

[Driver] Render -T for Gnu.cpp · 3841ed41
Fangrui Song authored May 16, 2020
```
clang -T a.lds a.c currently does not render -T.
```
3841ed41
[MLIR][cmake] use LINK_LIBS PUBLIC for MLIRStandardOpsTransforms · efa70843
Stephen Neuendorffer authored May 16, 2020
```
Without this LLVM_LINK_LLVM_DYLIB is broken

Differential Revision: https://reviews.llvm.org/D80074
```
efa70843
[llvm-xray] consumeError when trying big-endian · 3dbbbcc8
Fangrui Song authored May 16, 2020
```
Follow-up of rL341226.

Fixes "Expected<T> must be checked before access or destruction"
```
3dbbbcc8

[NFC] Run clang-format on ISDOpcodes.h · 8092c8fe

Arthur Eubanks authored May 15, 2020

Subscribers: jfb, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D80050

8092c8fe

[Compiler-rt] Emit error if builtins library cannot be found · 2fe66bdb

Yi Kong authored May 16, 2020

Since setting COMPILER_RT_USE_BUILTINS_LIBRARY would remove -z,defs
flag, missing builtins library would continue to build unnoticed.
Explicitly emit an error in such case.

Differential Revision: https://reviews.llvm.org/D79470

2fe66bdb

Fix a few doc typos to cycle bots. · 3735505e
Nico Weber authored May 16, 2020

3735505e
Try to heal bots after https://reviews.llvm.org/D79655 · bc98dc12
Nico Weber authored May 16, 2020

bc98dc12

[LegalizeDAG] Use MachinePointerInfo::getUnknownStack in place of... · 796ae8cf

Craig Topper authored May 16, 2020

[LegalizeDAG] Use MachinePointerInfo::getUnknownStack in place of MachinePointerInfo() in a couple places. NFC

We know the pointer somewhere on the stack, we just don't know
exactly where since the index may be variable.

Differential Revision: https://reviews.llvm.org/D80060

796ae8cf

May 16, 2020

AllocaInst should store Align instead of MaybeAlign. · 4f04db4b

Eli Friedman authored May 15, 2020

Along the lines of D77454 and D79968. Unlike loads and stores, the
default alignment is getPrefTypeAlign, to match the existing handling in
various places, including SelectionDAG and InstCombine.

Differential Revision: https://reviews.llvm.org/D80044

4f04db4b

[X86] Replace selectScalarSSELoad ComplexPattern with PatFrags to handle the 3... · 135b8778

Craig Topper authored May 16, 2020

[X86] Replace selectScalarSSELoad ComplexPattern with PatFrags to handle the 3 types of loads we currently match.

This ensures we create mem operands for these instructions fixing PR45949.

Unfortunately, it increases the size of X86GenDAGISel.inc, but some dag
combine canonicalization could reduce the types of load we need to match.

135b8778

Harden IR and bitcode parsers against infinite size types. · 0ec5f501

Eli Friedman authored May 16, 2020

If isSized is passed a SmallPtrSet, it uses that set to catch infinitely
recursive types (for example, a struct that has itself as a member).
Otherwise, it just crashes on such types.

0ec5f501

Revert "[nfc] test commit" · accd9af8
faisal vali authored May 16, 2020
```
This reverts commit 0ee46e85.
```
accd9af8
[nfc] test commit · 0ee46e85
faisal vali authored May 16, 2020

0ee46e85

Expose IRGen API to add the default IR attributes to a function definition. · 32870a84

John McCall authored May 16, 2020

I've also made a stab at imposing some more order on where and how we add
attributes; this part should be NFC. I wasn't sure whether the CUDA use
case for libdevice should propagate CPU/features attributes, so there's a
bit of unnecessary duplication.

32870a84

The release notes for ObjCBreakBeforeNestedBlockParam was placed between the... · 49c9a68d

mydeveloperday authored May 16, 2020

The release notes for ObjCBreakBeforeNestedBlockParam was placed between the release note for IndentCaseBlocks and its example code

Remove other whitespace and line limit issues and double blank line issues

49c9a68d

[VectorCombine] forward walk through instructions to improve chaining of transforms · 81e9ede3

Sanjay Patel authored May 16, 2020

This is split off from D79799 - where I was proposing to fully iterate
over a function until there are no more transforms. I suspect we are
still going to want to do something like that eventually.

But we can achieve the same gains much more efficiently on the current
set of regression tests just by reversing the order that we visit the
instructions.

This may also reduce the motivation for D79078, but we are still not
getting the optimal pattern for a reduction.

81e9ede3

[PhaseOrdering] add vector reduction tests; NFC · 43017ceb
Sanjay Patel authored May 16, 2020
```
These are based on tests originally included in:
D79078
```
43017ceb
[InstCombine] Clean up alignment handling (NFC) · 604f4497
Nikita Popov authored May 16, 2020
```
Now that load/store alignment is required, we can simplify code
in some places.
```
604f4497

[ARM] Patterns for VQSHRN · 2123bb84

David Green authored May 16, 2020

Given a VQMOVN(VSHR), we can fold that into a VQSHRN simply enough using
a few tablegen patterns.

Differential Revision: https://reviews.llvm.org/D77720

2123bb84

[VectorCombine] add reduction-like patterns; NFC · 6211830f
Sanjay Patel authored May 16, 2020
```
These are based on tests originally included in:
D79078
```
6211830f
[AArch64] Precommit tests for D77316 · 9a055479
Jay Foad authored May 16, 2020

9a055479

[x86][CGP] try to hoist funnel shift above select-of-splats · 5be37cb1

Sanjay Patel authored May 15, 2020

This is basically the same patch as D63233, but converted to
funnel shifts rather than regular shifts. I did not see a
way to effectively share code for these 2 cases though.

This follows D79718 and D79827 to re-fix PR37426 because
that gets canonicalized to funnel shift intrinsics in IR.

I did draft an alternative patch as an enhancement to
"shouldSinkOperands()", but that was awkward because
we have to key the transform from the select, but then
look at both its users and its operands.

5be37cb1

[ARM] Combines for VMOVN · 72f1fb2e

David Green authored May 16, 2020

This adds two combines for VMOVN, one to fold
VMOVN[tb](c, VQMOVNb(a, b)) => VQMOVN[tb](c, b)
The other to perform demand bits analysis on the lanes of a VMOVN. We
know that only the bottom lanes of the second operand and the top or
bottom lanes of the Qd operand are needed in the result, depending on if
the VMOVN is bottom or top.

Differential Revision: https://reviews.llvm.org/D77718

72f1fb2e

[ARM] MVE saturating truncates · 2e1fbf85

David Green authored May 16, 2020

This adds some custom lowering for VQMOVN, an instruction that can be
used to perform saturating truncates from a pair of min(max(X, -0x8000),
0x7fff), providing those constants are correct. This leaves a VQMOVNBs
which saturates the value and inserts that into the bottom lanes of an
existing vector. We then need to do something with the other lanes,
extending the value using a vmovlb.

Ideally, as will often be the case, only the bottom lane of what remains
will be demanded, allowing the vmovlb to be removed. Which should mean
the instruction is either equal or a win most of the time, and allows
some extra follow-up folding to happen.

Differential Revision: https://reviews.llvm.org/D77590

2e1fbf85

DIEHash.cpp - remove headers explicitly included in DIEHash.h. NFC. · 22891378
Simon Pilgrim authored May 16, 2020
```
Don't duplicate module header includes.
```
22891378

AggressiveAntiDepBreaker.cpp - remove headers explicitly included in... · 25656332

Simon Pilgrim authored May 16, 2020

AggressiveAntiDepBreaker.cpp - remove headers explicitly included in AggressiveAntiDepBreaker.h. NFC.

Don't duplicate module header includes.

25656332

LLParser.cpp - remove headers explicitly included in LLParser.h. NFC. · 43bf2be4
Simon Pilgrim authored May 16, 2020
```
Don't duplicate module header includes.
```
43bf2be4
Fix -Wdocumentation warning. NFC. · be6847b1
Simon Pilgrim authored May 16, 2020
```
Remove non-existant DataLayoutCallback param comment.
```
be6847b1
[ARM] Extra VQMOVN/VQSHRN tests. NFC · 42a9ca02
David Green authored May 16, 2020

42a9ca02

[mlir][spirv] Handle debuginfo for control flow ops. · 0dc91bfd

Denis Khalikov authored May 16, 2020

Summary:
Handle debuginfo for control flow operations: spv.Selection,
spv.Loop, spv.BranchOp, spv.BranchConditional.

Differential Revision: https://reviews.llvm.org/D79931

0dc91bfd

[ValueTracking] Fix computeKnownBits() with bitwidth-changing ptrtoint · d86fff6a

Nikita Popov authored May 01, 2020

computeKnownBitsFromAssume() currently asserts if m_V matches a
ptrtoint that changes the bitwidth. Because InstCombine
canonicalizes ptrtoint instructions to use explicit zext/trunc,
we never ran into the issue in practice. I'm adding unit tests,
as I don't know if this can be triggered via IR anywhere.

Fix this by calling anyextOrTrunc(BitWidth) on the computed
KnownBits. Note that we are going from the KnownBits of the
ptrtoint result to the KnownBits of the ptrtoint operand,
so we need to truncate if the ptrtoint zexted and anyext if
the ptrtoint truncated.

Differential Revision: https://reviews.llvm.org/D79234

d86fff6a

[libcxx testing] Remove ALLOW_RETRIES from last futures test · 3f66bb20

David Zarzycki authored May 16, 2020

Like other uses of ALLOW_RETRIES, this test tried to verify that an API
returned "quickly" but quick is not safe to define given slow and/or
busy machines.

Instead, we now verify that these "wait" APIs actually wait, which the
old test did not.

3f66bb20