Commits · 9e43e7ec111d9029da1b2098733f67a239e4edef · Raul Torres / llvm-target-spread

Jun 07, 2021

[clang] Fix using-enum breakage · 9e43e7ec

Nathan Sidwell authored Jun 07, 2021

This fixes a build breakage.  I managed to attach this particular
change to the wrong diff in the stack when rebasing.  And flubbed
testing :(

Differential Revision: https://reviews.llvm.org/D101777

9e43e7ec

[AMDGPU] Use s_add_i32 for address additions · 96e1fcb1

Sebastian Neubauer authored Jun 07, 2021

This allows to convert the add instruction to s_addk_i32 and
v_add_nc_u32 instead of needing v_add_co_u32 when converting to a VALU
instruction.

Differential Revision: https://reviews.llvm.org/D103322

96e1fcb1

[test] Use host platform specific error message substitution · 7e176ff2

Abhina Sreeskantharajan authored Jun 07, 2021

This testcase is failing on z/OS because the regex doesn't match the spelling. This patch modifies the testcase to use the error substitution so it will pass on all platforms.

Reviewed By: jhenderson

Differential Revision: https://reviews.llvm.org/D103804

7e176ff2

[Constants] Extend support for scalable-vector splats · fd3b5569

Fraser Cormack authored May 31, 2021

This patch extends the various "isXXX" functions of the `Constant` class
to include scalable-vector splats.

In several "isXXX" functions, code that was separately inspecting
`ConstantVector` and `ConstantDataVector` was unified to use
`getSplatValue`, which already includes support for said splats.

In the varous "isNotXXX" functions, code was added to check whether the
scalar splat value -- if any -- satisfies the predicate.

An extra fix for `isNotMinSignedValue` was included, as it previously
crashed when passed a scalable-vector type because it unconditionally
cast to `FixedVectorType`

These changes address numerous missed optimizations, a compiler crash
mentioned above and -- perhaps most egregiously -- an infinite loop in
InstCombine due to the compiler breaking canonical form when it failed
to pick up on a splat in a select instruction.

Test cases have been added to cover as many of these functions as
possible, though existing coverage is slim; it doesn't appear that there
are any in-tree uses of `Constant::isNegativeZeroValue`, for example.

Reviewed By: RKSimon

Differential Revision: https://reviews.llvm.org/D103421

fd3b5569

[clangd] Bump recommended gRPC version (1.33.2 -> 1.36.3) · d12000ca

Kirill Bobyrev authored Jun 07, 2021

Context: https://github.com/clangd/clangd/pull/783

Reviewed By: kadircet

Differential Revision: https://reviews.llvm.org/D103393

d12000ca

[clang][NFC] Break out enum completion from other type context completion · 84ab3155

Nathan Sidwell authored May 04, 2021

This prepatch for using-enum breaks out the enum completion that that
will need from the existing scope completion logic.

Differential Revision: https://reviews.llvm.org/D102239

84ab3155

[clang][NFC] Break out BaseUsingDecl from UsingDecl · ddda05ad

Nathan Sidwell authored May 03, 2021

This is a pre-patch for adding using-enum support.  It breaks out
the shadow decl handling of UsingDecl to a new intermediate base
class, BaseUsingDecl, altering the decl hierarchy to

def BaseUsing : DeclNode<Named, "", 1>;
  def Using : DeclNode<BaseUsing>;
def UsingPack : DeclNode<Named>;
def UsingShadow : DeclNode<Named>;
  def ConstructorUsingShadow : DeclNode<UsingShadow>;

Differential Revision: https://reviews.llvm.org/D101777

ddda05ad

[AMDGPU] Increase alignment of LDS globals if necessary before LDS lowering. · 52ffbfdf

hsmahesha authored Jun 07, 2021

Before packing LDS globals into a sorted structure, make sure that
their alignment is properly updated based on their size. This will make
sure that the members of sorted structure are properly aligned, and
hence it will further reduce the probability of unaligned LDS access.

Reviewed By: rampitec

Differential Revision: https://reviews.llvm.org/D103261

52ffbfdf

[InstCombine] Missed optimization for pow(x, y) * pow(x, z) with fast-math · 7736c193

Daniil Seredkin authored Jun 07, 2021

If FP reassociation (fast-math) is allowed, then LLVM is free to do the
following transformation pow(x, y) * pow(x, z) -> pow(x, y + z).
This patch adds this transformation and tests for it.
See more https://bugs.llvm.org/show_bug.cgi?id=47205

It handles two cases

1. When operands of fmul are different instructions

%4 = call reassoc float @llvm.pow.f32(float %0, float %1)
%5 = call reassoc float @llvm.pow.f32(float %0, float %2)
%6 = fmul reassoc float %5, %4
-->
%3 = fadd reassoc float %1, %2
%4 = call reassoc float @llvm.pow.f32(float %0, float %3)

2. When operands of fmul are the same instruction

%4 = call reassoc float @llvm.pow.f32(float %0, float %1)
%5 = fmul reassoc float %4, %4
-->
%3 = fadd reassoc float %1, %1
%4 = call reassoc float @llvm.pow.f32(float %0, float %3)

Differential Revision: https://reviews.llvm.org/D102574

7736c193

[MLIR][SPIRV] Use getAsmResultName(...) hook for AddressOfOp. · 2def12eb

KareemErgawy authored Jun 07, 2021

Implements better naming for results of spv.mlir.addressof ops by making it
inherit from OpAsmOpInterface and implementing the associated
getAsmResultName(...) hook.

Reviewed By: antiagainst

Differential Revision: https://reviews.llvm.org/D103594

2def12eb

[clang] Fix a crash during code completion · 721476e6

Adam Czachorowski authored Jun 01, 2021

During code completion, lookupInDeclContext() calls
CodeCompletionDeclConsumer::FoundDecl(),which can mutate StoredDeclsMap,
over which lookupInDeclContext() iterates. This can lead to invalidation
of iterators and an assert()-crash.

Example code where this happens:
 #include <list>
 int main() {
   std::list<int>;
   std::^
 }
with code completion on ^ with -std=c++20.

I do not have a repro case that does not need standard library.

This fix stores pointers to NamedDecls in a temporary vector, then
visits them outside of the main loop, when StoredDeclsMap iterators are
gone.

Differential Revision: https://reviews.llvm.org/D103472

721476e6

ExternalASTSource.h - remove unused StringRef and <string> includes. NFCI. · 8b58092d
Simon Pilgrim authored Jun 07, 2021

8b58092d
[gn build] fix syntax error from 50bb1b93 · cf29cdcc
Nico Weber authored Jun 07, 2021

cf29cdcc

[clangd] Drop TestTUs dependency on gtest · 4728aca9

Kadir Cetinkaya authored Jun 04, 2021

TestTU now prints errors to llvm::errs and aborts on failures via
llvm_unreachable, rather than executing ASSERT_FALSE.

We'd like to make use of these testing libraries in different test suits that
might be compiling with a different gtest version than LLVM has.

Differential Revision: https://reviews.llvm.org/D103685

4728aca9

[AArch64][SVE] Improve codegen for dupq SVE ACLE intrinsics · 60c9b5f3

Bradley Smith authored May 20, 2021

Use llvm.experimental.vector.insert instead of storing into an alloca
when generating code for these intrinsics. This defers the codegen of
the generated vector to instruction selection, allowing existing
shufflevector style optimizations to apply.

Additionally, introduce a new target transform that can recognise fixed
predicate patterns in the svbool variants of these intrinsics.

Differential Revision: https://reviews.llvm.org/D103082

60c9b5f3

[mlir][linalg] Add padding helper functions to PadTensorOp · fe0befb1

Matthias Springer authored Jun 07, 2021

Add helper functions to quickly check for zero low/high padding.

Differential Revision: https://reviews.llvm.org/D103781

fe0befb1

[LV] Update more target-specific tests after 23c2f2e6 . · 8344e215
Florian Hahn authored Jun 07, 2021

8344e215

[Matrix] Add -matrix-allow-contract=false to tests. · 87c99d2b

Florian Hahn authored Jun 07, 2021

Explicitly specify contract behavior, so the tests are independent of
the current default of the flag.

87c99d2b

[mlir] Add offset/stride helper functions to OffsetSizeAndStrideOpInterface · 6e7bbdd6

Matthias Springer authored Jun 07, 2021

* Add hasUnitStride and hasZeroOffset to OffsetSizeAndStrideOpInterface. These functions are useful for various patterns. E.g., some vectorization patterns apply only for tensor ops with zero offsets and/or unit stride.
* Add getConstantIntValue and isEqualConstantInt helper functions, which are useful for implementing the two above functions, as well as various patterns.

Differential Revision: https://reviews.llvm.org/D103763

6e7bbdd6

[AMDGPU][Libomptarget] Remove atlc global · 4f8bc7ca

Pushpinder Singh authored Jun 07, 2021

This global struct used to hold various flags for monitoring the
initialization of hsa.

Reviewed By: JonChesterfield

Differential Revision: https://reviews.llvm.org/D103795

4f8bc7ca

[OpenCL] Add const attribute to ctz() builtins · 9b14670f
Stuart Brady authored Mar 01, 2021
```
Reviewed By: svenvh

Differential Revision: https://reviews.llvm.org/D97725
```
9b14670f

[llvm] Add interface to order inlining · 4a0de622

Liqiang Tao authored Jun 07, 2021

This patch abstract Calls in Inliner:run() to InlineOrder.
With this patch, it's possible to customize the inlining order,
e.g. use queue or priority queue.

Reviewed By: kazu

Differential Revision: https://reviews.llvm.org/D103315

4a0de622

[lld/mac] Implement support for searching dylibs with @rpath/ in install name · c5ffe979

Nico Weber authored Jun 06, 2021

Also adjust a few comments, and move the DylibFile comment talking about
umbrella next to the parameter again.

Differential Revision: https://reviews.llvm.org/D103783

c5ffe979

[clang] NFC: test for undefined behaviour in RawComment::getFormattedText() · aa0d7179

Dmitry Polukhin authored Jun 04, 2021

This diff adds testcase for the issue fixed in https://reviews.llvm.org/D77468
but regression test was not added in the diff. On Clang 9 it caused
crash in cland during code completion.

Test Plan: check-clang-unit

Differential Revision: https://reviews.llvm.org/D103722

aa0d7179

[NFC] Fix semantic discrepancy for MVT::LAST_VALUETYPE · 1da2c7d2
Guillaume Chatelet authored Jun 07, 2021
```
Differential Revision: https://reviews.llvm.org/D103251
```
1da2c7d2
[PhaseOrdering] Update tests after 23c2f2e6 . · 131343d3
Florian Hahn authored Jun 07, 2021

131343d3
ASTConcept.h - remove unused <string> include. NFCI. · 30a89a75
Simon Pilgrim authored Jun 07, 2021

30a89a75

[SimpleLoopBoundSplit] Split Bound of Loop which has conditional branch with IV · a2a0ac42

Jingu Kang authored May 06, 2021

This pass transforms loops that contain a conditional branch with induction
variable. For example, it transforms left code to right code:

                             newbound = min(n, c)
 while (iv < n) {            while(iv < newbound) {
   A                           A
   if (iv < c)                 B
     B                         C
   C                         }
 }                           if (iv != n) {
                               while (iv < n) {
                                 A
                                 C
                               }
                             }

Differential Revision: https://reviews.llvm.org/D102234

a2a0ac42

[Clang] Support a user-defined __dso_handle · b31f41e7

Andrew Savonichev authored May 24, 2021

This fixes PR49198: Wrong usage of __dso_handle in user code leads to
a compiler crash.

When Init is an address of the global itself, we need to track it
across RAUW. Otherwise the initializer can be destroyed if the global
is replaced.

Differential Revision: https://reviews.llvm.org/D101156

b31f41e7

[LV] Mark increment of main vector loop induction variable as NUW. · 23c2f2e6

Florian Hahn authored Jun 07, 2021

This patch marks the induction increment of the main induction variable
of the vector loop as NUW when not folding the tail.

If the tail is not folded, we know that End - Start >= Step (either
statically or through the minimum iteration checks). We also know that both
Start % Step == 0 and End % Step == 0. We exit the vector loop if %IV +
%Step == %End. Hence we must exit the loop before %IV + %Step unsigned
overflows and we can mark the induction increment as NUW.

This should make SCEV return more precise bounds for the created vector
loops, used by later optimizations, like late unrolling.

At the moment quite a few tests still need to be updated, but before
doing so I'd like to get initial feedback to make sure I am not missing
anything.

Note that this could probably be further improved by using information
from the original IV.

Attempt of modeling of the assumption in Alive2:
https://alive2.llvm.org/ce/z/H_DL_g

Part of a set of fixes required for PR50412.

Reviewed By: mkazantsev

Differential Revision: https://reviews.llvm.org/D103255

23c2f2e6

[AMDGPU] Fix MC tests for v_fmaak_f16 and v_fmamk_f16 · 9e9edede

Jay Foad authored Jun 04, 2021

This looks like a mistake when the tests were committed in r363946.
There were two sets of tests for the f32 variant of these instructions,
instead of one set for f16 and one set for f32.

Differential Revision: https://reviews.llvm.org/D103699

9e9edede

[mlir][linalg] Cleanup LinalgOp usage in comprehensive bufferization. · caf26612

Tobias Gysi authored Jun 07, 2021

Replace the uses of deprecated Structured Op Interface methods in ComprehensiveBufferize.cpp. This patch is based on https://reviews.llvm.org/D103394.

Differential Revision: https://reviews.llvm.org/D103520

caf26612

[OpenCL] Fix missing addrspace on implicit move assignment operator · 438cf557

Ole Strohm authored Jun 07, 2021

This fixes the missing address space on `this` in the implicit move
assignment operator.
The function called here is an abstraction around the lines that have
been removed which also sets the address space correctly.
This is copied from CopyConstructor, CopyAssignment and MoveConstructor,
all of which use this function, and now MoveAssignment does too.

Fixes: PR50259

Reviewed By: svenvh

Differential Revision: https://reviews.llvm.org/D103252

438cf557

[AMDGPU][Libomptarget] Rework logic for locating kernarg pools · f5f329a3

Pushpinder Singh authored Jun 03, 2021

Previous logic was to always use the first kernarg pool found to allocate
kernel args. This patch changes this to use only the kernarg pool which
has non-zero size. This logic is also reworked to not use any globals.

Reviewed By: JonChesterfield

Differential Revision: https://reviews.llvm.org/D103600

f5f329a3

Fixed the build failure of yaml2obj in XCOFFEmitter.cpp: · bcb20aa7

Esme-Yi authored Jun 07, 2021

  error: ambiguous overload for 'operator=='
  (operand types are 'llvm::yaml::Hex16' and 'llvm::XCOFF::MagicNumber')
     Is64Bit = Obj.Header.Magic == XCOFF::XCOFF64;

bcb20aa7

[yaml2obj] Initial the support of yaml2obj for 32-bit XCOFF. · 50bb1b93

Esme-Yi authored Jun 07, 2021

Summary: The patch implements the mapping of the Yaml
information to XCOFF object file to enable the yaml2obj
tool for XCOFF. Currently only 32-bit is supported.

Reviewed By: jhenderson, shchenz

Differential Revision: https://reviews.llvm.org/D95505

50bb1b93

[lld/mac] Implement support for searching dylibs with @loader_path/ in install name · 52489021
Nico Weber authored Jun 06, 2021
```
Differential Revision: https://reviews.llvm.org/D103779
```
52489021
[lld/mac] Implement support for searching dylibs with @executable_path/ in install name · a48bd587
Nico Weber authored Jun 06, 2021
```
Differential Revision: https://reviews.llvm.org/D103775
```
a48bd587

[lld/mac] Rename DylibFile::dylibName to DylibFile::installName · 7def7006

Nico Weber authored Jun 06, 2021

The flag to set it is called `-install_name`, and it's called `installName` in tbd files.

No behavior change.

Differential Revision: https://reviews.llvm.org/D103776

7def7006

[lld/mac] Use fewer magic numbers in magic $ld$ handling code · e9104374

Nico Weber authored Jun 06, 2021

Also simply a conditional and de-alias a variable.
Minor cleanups, no behavior change.

Differential Revision: https://reviews.llvm.org/D103774

e9104374