Commits · 24562c6588bf45d4ca51a4934dc2e220f16130b1 · Lorenzo Albano / LLVM bpEVL

Mar 29, 2020

[InstCombine] Add tests for trunc (extelt x); (NFC) · 24562c65
Daan Sprenkels authored Mar 29, 2020
```
Baseline tests for D76983 (PR45314)

Differential Revision: https://reviews.llvm.org/D77024
```
24562c65
[X86] Add sse4.2 command lines to min/max reduction tests. · 2451e4c5
Craig Topper authored Mar 29, 2020
```
SSE4.2 has the pcmpgtq instruction which we will use in
vXi64 reductions when its available.
```
2451e4c5
[ARMMVE] Create fewer temporary SmallVectors · 6e0afb5f
Benjamin Kramer authored Mar 29, 2020
```
Shrinks clang by 40k.
```
6e0afb5f
Don't claim template names that name non-templates are undeclared. · a5458bb0
Richard Smith authored Mar 29, 2020

a5458bb0
[ELF][test] Improve arm-exidx-output.s to test different output text sections · 00c76f34
Fangrui Song authored Mar 18, 2020
```
Delete arm-exidx-link.s which is now covered by arm-exidx-output.s

Differential Revision: https://reviews.llvm.org/D76409
```
00c76f34
[ARM] VMOV.64 immediate tests. NFC · 7c1a6873
David Green authored Mar 28, 2020

7c1a6873
[gn build] Port 854f268c · 6628c525
LLVM GN Syncbot authored Mar 29, 2020

6628c525
[MC] Move deprecation infos from MCTargetDesc to MCInstrInfo · 854f268c
Benjamin Kramer authored Mar 29, 2020
```
This allows emitting it only when the feature is used by a target.
Shrinks Release+Asserts clang by 900k.
```
854f268c

[clangd] Handle clang-tidy suppression comments for diagnostics inside macro expansions · b9d9968f

Nathan Ridge authored Mar 29, 2020

Summary:
Not handling this was a side-effect of being overly cautious when trying
to avoid reading files for which clangd doesn't have the source mapped.

Fixes https://github.com/clangd/clangd/issues/266

Reviewers: sammccall

Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet,
usaxena95, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D75286

b9d9968f

clang-format fixes in ClangTidyDiagnosticConsumer.cpp and DiagnosticsTets.cpp · 15f1fe15

Nathan Ridge authored Mar 29, 2020

Subscribers: jkorous, arphaman, kadircet, usaxena95, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D77023

15f1fe15

[X86][AVX] Combine 128/256-bit lane shuffles with zeroable upper subvectors to... · 9c8ec99c

Simon Pilgrim authored Mar 29, 2020

[X86][AVX] Combine 128/256-bit lane shuffles with zeroable upper subvectors to EXTRACT_SUBVECTOR (PR40720)

As explained on PR40720, EXTRACTF128 is always as good/better than VPERM2F128/SHUF128, and we can use the implicit zeroing of the uppers.

9c8ec99c

Fix -Wdocumentation warning. NFC. · fe0723dc
Simon Pilgrim authored Mar 29, 2020
```
gcc was misinterpreting the template code snippet as html.
```
fe0723dc
[X86] Add isAnyZero shuffle mask helper · 8206c50c
Simon Pilgrim authored Mar 29, 2020

8206c50c

[InstCombine] Erase old mul when creating umulo · 8253a86b

Nikita Popov authored Mar 29, 2020

As we don't return the result of replaceInstUsesWith(), we are
responsible for erasing the instruction.

There is a small subtlety here in that we need to do this after
the other uses of Builder, which uses the original multiply as
the insertion point.

NFC apart from worklist order changes.

8253a86b

[InstCombine] Use replaceOperand() in demanded elements simplification · 53d20907

Nikita Popov authored Mar 29, 2020

To make sure that dead operands get DCEd. This fixes the largest
source of leftover dead operands we see in tests.

NFC apart from worklist changes.

53d20907

[MLIR] Add missing asserts in interchangeLoops util, doc comment update · 4e4ea2cd

Uday Bondhugula authored Mar 29, 2020

Add missing assert checks for input to mlir::interchangeLoops utility.
Rename interchangeLoops -> permuteLoops; update doc comments to clarify
inputs / return val. Other than the assert checks, this is NFC.

Signed-off-by: Uday Bondhugula <uday@polymagelabs.com>

Differential Revision: https://reviews.llvm.org/D77003

4e4ea2cd

[InstCombine] Use replaceOperand() in assoc cast simplification · 0c871400
Nikita Popov authored Mar 29, 2020
```
To make sure the old operands are DCEd.

NFC apart from worklist order.
```
0c871400

[InstCombine] Erase old add when optimizing add overflow · a9ddcd64

Nikita Popov authored Mar 29, 2020

We don't return the replaceInstUsesWith() result, so we're
responsible for cleaning up.

NFC apart from worklist order changes.

a9ddcd64

Introduce support for lib function aligned_alloc in TLI / memory builtins · c0955edf

Uday Bondhugula authored Mar 28, 2020

Aligned_alloc is a standard lib function and has been in glibc since
2.16 and in the C11 standard. It has semantics similar to malloc/calloc
for several analyses/transforms. This patch introduces aligned_alloc
in target library info and memory builtins. Subsequent ones will
make other passes aware and fix https://bugs.llvm.org/show_bug.cgi?id=44062

This change will also be useful to LLVM generators that need to allocate
buffers of vector elements larger than 16 bytes (for eg. 256-bit ones),
element boundary alignment for which is not typically provided by glibc malloc.

Signed-off-by: Uday Bondhugula <uday@polymagelabs.com>

Differential Revision: https://reviews.llvm.org/D76970

c0955edf

GlobalISel: Add matcher for G_SHL · cce3d96b
Matt Arsenault authored Mar 29, 2020

cce3d96b
AMDGPU/GlobalISel: Remove redundant virtual · d15723ef
Matt Arsenault authored Mar 28, 2020

d15723ef
AMDGPU: Fix using wrong instruction for FP conversion · ab7a4106
Matt Arsenault authored Mar 29, 2020
```
This was was never actually hit, but FTRUNC was clearly not the intent
here.
```
ab7a4106
AMDGPU: Add some additional tests for v_cvt_ubyte* formation · 0b68ca51
Matt Arsenault authored Mar 29, 2020
```
Use functions now that we have them for less boilerplate in the
output.
```
0b68ca51
AMDGPU: Fix typo · 97bbe7ad
Matt Arsenault authored Mar 29, 2020

97bbe7ad
[VectorCombine] skip debug intrinsics first for efficiency · fc3cc8a4
Sanjay Patel authored Mar 29, 2020

fc3cc8a4
[InstCombine] make test independent of branch undef/UB; NFC · febcb24f
Sanjay Patel authored Mar 29, 2020

febcb24f
[X86][AVX] Add tests for 512-bit shuffle patterns that could reduce to subvector extractions · 443dcc0e
Simon Pilgrim authored Mar 29, 2020

443dcc0e
Remove unnecessary empty comments from test check lines. NFC. · b44f0704
Simon Pilgrim authored Mar 29, 2020

b44f0704

[InstCombine] Simplify select of cmpxchg transform · 26fa3375

Nikita Popov authored Mar 29, 2020

Rather than converting to a dummy select with equal true and false
ops, just directly return the resulting value.

As a side-effect, this fixes missing DCE of the previously replaced
operand.

26fa3375

[OpenMP] set_bits iterator yields unsigned elements, no reference (NFC). · 99913ef3

Florian Hahn authored Mar 29, 2020

BitVector::set_bits() returns an iterator range yielding unsinged
elements, which always will be copied while const & gives the impression
that there will be no copy. Newer version of clang complain:

warning: loop variable 'SetBitsIt' is always a copy because the range of type 'iterator_range<llvm::BitVector::const_set_bits_iterator>' (aka 'iterator_range<const_set_bits_iterator_impl<llvm::BitVector> >') does not return a reference [-Wrange-loop-analysis]

Reviewers: jdoerfert, rnk

Reviewed By: rnk

Differential Revision: https://reviews.llvm.org/D77010

99913ef3

[InstCombine] Fix worklist management in varargs transform · 28f67bd5

Nikita Popov authored Mar 29, 2020

Add a replaceUse() helper to mirror replaceOperand() for the
rare cases where we're working directly on uses.

NFC apart from worklist order changes.

28f67bd5

[InstCombine] Erase original add when creating saddo · 6f07a9e8

Nikita Popov authored Mar 29, 2020

Usually when we replaceInstUsesWith() we also return the original
instruction, and InstCombine will take care of erasing it. Here
we don't do that, so we need to manually erase it.

NFC apart from worklist order changes.

6f07a9e8

[InstCombine] Use replaceOperand() in a few more places · 1e363023
Nikita Popov authored Mar 29, 2020
```
To make sure the old operands get DCEd.

NFC apart from worklist order changes.
```
1e363023

[X86][AVX] Combine 128-bit lane shuffles with a zeroable upper half to EXTRACT_SUBVECTOR (PR40720) · 7734e4b3

Simon Pilgrim authored Mar 29, 2020

As explained on PR40720, EXTRACTF128 is always as good/better than VPERM2F128, and we can use the implicit zeroing of the upper half.

I've added some extra tests to vector-shuffle-combining-avx2.ll to make sure we don't lose coverage.

7734e4b3

[X86] Rename matchShuffleAsByteRotate to matchShuffleAsElementRotate. NFC. · da4c7db7

Simon Pilgrim authored Mar 29, 2020

This was an inner helper function for the real matchShuffleAsByteRotate function, but it is more generic and is used directly for VALIGN lowering which doesn't work at the byte level.

da4c7db7

[X86][AVX] Add X86ISD::VALIGN target shuffle decode support · 10439f9e
Simon Pilgrim authored Mar 29, 2020
```
Allows us to combine VALIGN instructions with other shuffles - the combiner doesn't create VALIGN yet though.
```
10439f9e

[mlir] NFC: fix trivial typo in documents · b632bd88

Kazuaki Ishizaki authored Mar 29, 2020

Reviewers: mravishankar, antiagainst, nicolasvasilache, herhut, aartbik, mehdi_amini, bondhugula

Reviewed By: mehdi_amini, bondhugula

Subscribers: bondhugula, jdoerfert, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, bader, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D76993

b632bd88

[VPlan] Use one VPWidenRecipe per original IR instruction. (NFC). · 49d00824

Florian Hahn authored Mar 29, 2020

This patch changes VPWidenRecipe to only store a single original IR
instruction. This is the first required step towards modeling it's
operands as VPValues and also towards breaking it up into a
VPInstruction.

Discussed as part of D74695.

Reviewers: Ayal, gilr, rengolin

Reviewed By: gilr

Differential Revision: https://reviews.llvm.org/D76988

49d00824

[PostOrderIterator] Use SmallVector to store stack; NFC · 6ba63510
Nikita Popov authored Mar 29, 2020
```
We use a SmallPtrSet to track visited nodes, use a SmallVector
of the same size for the stack.
```
6ba63510

[X86] X86CallFrameOptimization - generalize slow push code path · a7115d51

Simon Pilgrim authored Mar 29, 2020

Replace the explicit isAtom() || isSLM() test with the more general (and more specific) slowTwoMemOps() check to avoid the use of the PUSHrmm push from memory case.

This is actually very tricky to test in anything but quite complex code, but the atomic-idempotent.ll tests seem to be the most straightforward to use.

Differential Revision: https://reviews.llvm.org/D76239

a7115d51