Commits · 517fe058d42a1f937e14de4b61a5ac2ad326c850 · Lorenzo Albano / LLVM bpEVL

Jul 29, 2020

[clang][NFC] clang-format fix after eb10b065 · 517fe058
Bruno Ricci authored Jul 29, 2020

517fe058
[clang][NFC] Pass the ASTContext to CXXRecordDecl::setCaptures · 1ae63b41
Bruno Ricci authored Jul 28, 2020
```
In general Decl::getASTContext() is relatively expensive and here the changes
are non-invasive. NFC.
```
1ae63b41
Forward extent tensors through shape.broadcast. · ad793ed9
Tres Popp authored Jul 29, 2020
```
Differential Revision: https://reviews.llvm.org/D84832
```
ad793ed9

[ConstantFolding] update test checks FP min/max intrinsics · 8c3262a7

Sanjay Patel authored Jul 29, 2020

There's a slight difference in functionality with the new CHECK lines:
before, we allowed either -0.0 or 0.0 for maxnum/minnum. That matches
the definition, but we should always get a deterministic result from
constant folding within the compiler, so now we assert that we got
the single expected result in all cases.

8c3262a7

[Driver][ARM] Fix testcase that should only run on ARM · 71bf6dd6
Victor Campos authored Jul 29, 2020
```
Fix testcase introduced in d1a3396b.
```
71bf6dd6
[CostModel][X86] Add SSE costs for ABS intrinsics · 0a0f2825
Simon Pilgrim authored Jul 29, 2020

0a0f2825

[Driver][ARM] Disable unsupported features when nofp arch extension is used · d1a3396b

Victor Campos authored Jul 21, 2020

A list of target features is disabled when there is no hardware
floating-point support. This is the case when one of the following
options is passed to clang:

 - -mfloat-abi=soft
 - -mfpu=none

This option list is missing, however, the extension "+nofp" that can be
specified in -march flags, such as "-march=armv8-a+nofp".

This patch also disables unsupported target features when nofp is passed
to -march.

Differential Revision: https://reviews.llvm.org/D82948

d1a3396b

[ELF][test] Add test coverage of `__real_` to wrap-plt.s · 8725a494
Andrew Ng authored Jul 28, 2020
```
Differential Revision: https://reviews.llvm.org/D84749
```
8725a494
[TTI] Move abs/smax/smin/umax/umin cost expansion to ICA getIntrinsicInstrCost variant · 75182104
Simon Pilgrim authored Jul 29, 2020
```
This will simplify target overrides, and matches what we do for most integer intrinsic costs.
```
75182104

[mlir][Standard] Allow unranked memrefs as operands to dim and rank · 823ffef0

Stephan Herhut authored Jul 29, 2020

`std.dim` currently only accepts ranked memrefs and `std.rank` is limited to
tensors.

Differential Revision: https://reviews.llvm.org/D84790

823ffef0

[ARM] Tune getCastInstrCost for extending masked loads and truncating masked stores · 9ddb2896

David Green authored Jul 29, 2020

This patch uses the feature added in D79162 to fix the cost of a
sext/zext of a masked load, or a trunc for a masked store.
Previously, those were considered cheap or even free, but it's
not the case as we cannot split the load in the same way we would for
normal loads.

This updates the costs to better reflect reality, and adds a test for it
in test/Analysis/CostModel/ARM/cast.ll.

It also adds a vectorizer test that showcases the improvement: in some
cases, the vectorizer will now choose a smaller VF when
tail-predication is enabled, which results in better codegen. (Because
if it were to use a higher VF in those cases, the code we see above
would be generated, and the vmovs would block tail-predication later in
the process, resulting in very poor codegen overall)

Original Patch by Pierre van Houtryve

Differential Revision: https://reviews.llvm.org/D79163

9ddb2896

[Analysis] TTI: Add CastContextHint for getCastInstrCost · 60280e98

David Green authored Jul 29, 2020

Currently, getCastInstrCost has limited information about the cast it's
rating, often just the opcode and types. Sometimes there is a context
instruction as well, but it isn't trustworthy: for instance, when the
vectorizer is rating a plan, it calls getCastInstrCost with the old
instructions when, in fact, it's trying to evaluate the cost of the
instruction post-vectorization. Thus, the current system can get the
cost of certain casts incorrect as the correct cost can vary greatly
based on the context in which it's used.

For example, if the vectorizer queries getCastInstrCost to evaluate the
cost of a sext(load) with tail predication enabled, getCastInstrCost
will think it's free most of the time, but it's not always free. On ARM
MVE, a VLD2 group cannot be extended like a normal VLDR can. Similar
situations can come up with how masked loads can be extended when being
split.

To fix that, this path adds a new parameter to getCastInstrCost to give
it a hint about the context of the cast. It adds a CastContextHint enum
which contains the type of the load/store being created by the
vectorizer - one for each of the types it can produce.

Original patch by Pierre van Houtryve

Differential Revision: https://reviews.llvm.org/D79162

60280e98

[SVE][CodeGen] Add simple integer add tests for SVE tuple types · 20787717

David Sherwood authored Jul 16, 2020

I have added tests to:

  CodeGen/AArch64/sve-intrinsics-int-arith.ll

for doing simple integer add operations on tuple types. Since these
tests introduced new warnings due to incorrect use of
getVectorNumElements() I have also fixed up these warnings in the
same patch. These fixes are:

1. In narrowExtractedVectorBinOp I have changed the code to bail out
early for scalable vector types, since we've not yet hit a case that
proves the optimisations are profitable for scalable vectors.
2. In DAGTypeLegalizer::WidenVecRes_CONCAT_VECTORS I have replaced
calls to getVectorNumElements with getVectorMinNumElements in cases
that work with scalable vectors. For the other cases I have added
asserts that the vector is not scalable because we should not be
using shuffle vectors and build vectors in such cases.

Differential revision: https://reviews.llvm.org/D84016

20787717

[ARM] Optimize immediate selection · 85342c27

Sjoerd Meijer authored Jul 29, 2020

Optimize some specific immediates selection by materializing them with sub/mvn
instructions as opposed to loading them from the constant pool.

Patch by Ben Shi, powerman1st@163.com.

Differential Revision: https://reviews.llvm.org/D83745

85342c27

AMDGPU/GlobalISel: Refactor special argument management · 200bb519
Matt Arsenault authored Jul 20, 2020

200bb519
AMDGPU: Make saturating add/sub legal for DAG path · c230965c
Matt Arsenault authored Jul 13, 2020

c230965c
AMDGPU/GlobalISel: Select llvm.amdgcn.global.atomic.csub · cdd45d5f
Matt Arsenault authored Jul 21, 2020
```
Remove the custom node boilerplate. Not sure why this tried to handle
the LDS atomic stuff.
```
cdd45d5f
[libc] [obvious] Fix typo in binary header. · 33abb729
Chris Gyurgyik authored Jul 29, 2020

33abb729

[SVE] Add checks for no warnings in CodeGen/AArch64/sve-sext-zext.ll · f43b5c7a

David Sherwood authored Jul 03, 2020

Previous patches fixed up all the warnings in this test:

  llvm/test/CodeGen/AArch64/sve-sext-zext.ll

and this change simply checks that no new warnings are added in future.

Differential revision: https://reviews.llvm.org/D83205

f43b5c7a

[CodeGen] Remove calls to getVectorNumElements in DAGTypeLegalizer::SplitVecOp_EXTRACT_SUBVECTOR · 5d84eafc

David Sherwood authored Jul 03, 2020

In DAGTypeLegalizer::SplitVecOp_EXTRACT_SUBVECTOR I have replaced
calls to getVectorNumElements with getVectorMinNumElements, since
this code path works for both fixed and scalable vector types. For
scalable vectors the index will be multiplied by VSCALE.

Fixes warnings in this test:

  sve-sext-zext.ll

Differential revision: https://reviews.llvm.org/D83198

5d84eafc

[mlir] LLVMType: make getUnderlyingType private · aec38c61

Alex Zinenko authored Jul 23, 2020

The current modeling of LLVM IR types in MLIR is based on the LLVMType class
that wraps a raw `llvm::Type *` and delegates uniquing, printing and parsing to
LLVM itself. This is model makes thread-safe type manipulation hard and is
being progressively replaced with a cleaner MLIR model that replicates the type
system. In the new model, LLVMType will no longer have an underlying LLVM IR
type. Restrict access to this type in the current model in preparation for the
change.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D84389

aec38c61

[NewGVN] Require asserts for crashing tests. · 2aa2c40d

Florian Hahn authored Jul 29, 2020

Without asserts, it might take a long time for the tests to crash.
Only run them with assert builds.

2aa2c40d

[LoopSimplifyCFG] Delete landing pads in dead exit blocks · 5d6cd619

Yevgeny Rouban authored Jul 29, 2020

In addition to removing phi nodes this patch removes any
landing pad that the dead exit block might have. Without
this fix Verifier complains about a new switch instruction
jumps to a block with a landing pad.

Differential Revision: https://reviews.llvm.org/D84320

5d6cd619

[CMAKE] Fix 'clean' target not working · c970bb5b

Pushpinder Singh authored Jul 03, 2020

cmake was still considering the empty value of ${fake_version_inc}
even if it was not defined.

Reviewed By: vsapsai

Differential Revision: https://reviews.llvm.org/D82847

c970bb5b

[TTI] Add default cost expansion for abs/smax/smin/umax/umin intrinsics · c5ef1f1e
Simon Pilgrim authored Jul 29, 2020

c5ef1f1e

[llvm-readobj] - Move out the common code from printRelocations() methods. · 08a26543

Georgii Rymar authored Jul 15, 2020

This introduces the printRelocationsHelper() which now contains the common
code used by both GNU and LLVM output styles.

Differential revision: https://reviews.llvm.org/D83935

08a26543

[libunwind] Provide a way to set '_LIBUNWIND_IS_BAREMETAL' through cmake. · 380fee34

Hafiz Abid Qadeer authored Jul 29, 2020

Libunwind uses _LIBUNWIND_IS_BAREMETAL in a lot of places but there is no cmake variable to set it. This patch adds such a variable. It is quite like what LIBCXXABI_BAREMETAL does in libcxxabi.

Reviewed By: compnerd, #libunwind

Differential Revision: https://reviews.llvm.org/D84759

380fee34

[MLIR][Shape] Remove type conversion from lowering to standard · b6b9d3ea

Frederik Gossen authored Jul 29, 2020

Operating on indices and extent tensors directly, the type conversion is no
longer needed for the supported cases.

Differential Revision: https://reviews.llvm.org/D84442

b6b9d3ea

[MLIR][Shape] Add conversion for missing ops to standard · 5d9f33aa

Stephan Herhut authored Jul 28, 2020

This adds conversions for const_size and to_extent_tensor. Also, cast-like operations are now folded away if the source and target types are the same.

Differential Revision: https://reviews.llvm.org/D84745

5d9f33aa

[X86][SSE] getV4X86ShuffleImm8 - canonicalize broadcast masks · 0c005be6

Simon Pilgrim authored Jul 29, 2020

If the mask input to getV4X86ShuffleImm8 only refers to a single source element (+ undefs) then canonicalize to a full broadcast.

getV4X86ShuffleImm8 defaults to inline values for undefs, which can be useful for shuffle widening/narrowing but does leave SimplifyDemanded* calls thinking the shuffle depends on unnecessary elements.

I'm still investigating what we should do more generally to avoid these undemanded elements, but broadcast cases was a simpler win.

0c005be6

[MLIR][Shape] Allow `shape.add` to operate on indices · 2e7baf61
Frederik Gossen authored Jul 29, 2020
```
Differential Revision: https://reviews.llvm.org/D84441
```
2e7baf61
[DWARFYAML][test] Make the check lines stricter. NFC. · 2f98eff3
Xing GUO authored Jul 29, 2020
```
This patch makes the check lines stricter.
```
2f98eff3
[DWARFYAML] Replace uint*_t with yaml::Hex* in the 'debug_aranges' entry. · 334a7025
Xing GUO authored Jul 29, 2020
```
Normally, we use yaml::Hex* to describe the length, offsets,
address/segment size. NFC.
```
334a7025

[clangd] Fix clangd-indexeer builds after D84697 · 1603470e

Kirill Bobyrev authored Jul 29, 2020

Some buildbots require explicit clangdSupport dependency:

http://lab.llvm.org:8011/builders/llvm-avr-linux/builds/3996/steps/build%20stage%201/logs/stdio

1603470e

[MLIR][SPIRVToLLVM] Branch weights support for BranchConditional conversion · 1f4aa30a

George Mitenkov authored Jul 29, 2020

Conversion of `spv.BranchConditional` now supports branch weights
that are mapped to weights vector in `llvm.cond_br`.

Reviewed By: antiagainst

Differential Revision: https://reviews.llvm.org/D84657

1f4aa30a

[clang] Fix ConceptSpecializationExpr::getEndLoc() · 89247792

Nathan Ridge authored Jul 26, 2020

Summary:
It returned an invalid location in case of a constrained-parameter
with no explicit arguments.

Reviewers: hokein

Subscribers: cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D84613

89247792

[InstCombine] Add tests for select(freeze(undef)); NFC · 1ae766e3
Juneyoung Lee authored Jul 29, 2020

1ae766e3

Test including rpc/xdr.h requires sunrpc · 2ead4fca

Stephan Bergmann authored Jul 29, 2020

...which is set based on HAVE_RPC_XDR_H.  At least Fedora 32 does not have a
/usr/include/rpc/xdr.h, so failed this test introduced with
<https://reviews.llvm.org/D83358> "[Sanitizers] Add interceptor for
xdrrec_create".

Differential Revision: https://reviews.llvm.org/D84740

2ead4fca

[MLIR][SPIRV] Added storage class constraint on global variable · 8a66bb7a

George Mitenkov authored Jul 29, 2020

Added a check for 'Function' storage class in `spv.globalVariable`
verifier since it only can be used with `spv.Variable`.

Reviewed By: antiagainst

Differential Revision: https://reviews.llvm.org/D84731

8a66bb7a

[MLIR][SPIRVToLLVM] Support of volatile/nontemporal memory access in load/store · b1e39892

George Mitenkov authored Jul 29, 2020

This patch adds support of Volatile and Nontemporal
memory accesses to `spv.Load` and `spv.Store`. These attributes are
modelled with a `volatile` and `nontemporal` flags.

Reviewed By: antiagainst

Differential Revision: https://reviews.llvm.org/D84739

b1e39892