Commits · 37ca7a795b277c20c02a218bf44052278c03344b · Lorenzo Albano / LLVM bpEVL

Oct 16, 2021

Fix missing failures in clang-ppc64be* and retry fixing clang-x64-windows-msvc · 37ca7a79
Juneyoung Lee authored Oct 16, 2021

37ca7a79

[MLIR] Generalize Affine dependence analysis using Affine Relations · 52d6c5df

Groverkss authored Oct 16, 2021

This patch removes code very specific to affine dependence analysis and
refactors it as a FlatAfffineRelation.

A FlatAffineRelation represents a set of ordered pairs (domain -> range) where
"domain" and "range" are tuples of identifiers. These relations are used to
represent an "access relation" for memory access on a memref. An access
relation maps elements of an iteration domain to the element(s) of an array
domain accessed by that iteration of the associated statement through some
array reference. The dependence relation representing the dependence
constraints between two memory accesses can be built by composing the access
relation of the destination access by the inverse of the access relation of
source access.

This patch does not change the functionality of the existing dependence
analysis in checkMemrefAccessDependence, but refactors it to use
FlatAffineRelations to deduplicate code and enable code reuse for future
development of features like scheduling, value-based dependence analysis, etc.

Reviewed By: bondhugula

Differential Revision: https://reviews.llvm.org/D110563

52d6c5df

Fix lit test failures in clang-ppc* and clang-x64-windows-msvc · 9aa6c72b
Juneyoung Lee authored Oct 16, 2021

9aa6c72b
Resolve lit failures in clang after 8ca4b3ef 's land · 705387c5
Juneyoung Lee authored Oct 16, 2021

705387c5

[Clang/Test]: Rename enable_noundef_analysis to disable-noundef-analysis and... · 8ca4b3ef

Juneyoung Lee authored Oct 15, 2021

[Clang/Test]: Rename enable_noundef_analysis to disable-noundef-analysis and turn it off by default (2)

This patch updates test files after D105169.
Autogenerated test codes are changed by `utils/update_cc_test_checks.py,` and non-autogenerated test codes are changed as follows:

(1) I wrote a python script that (partially) updates the tests using regex: {F18594904} The script is not perfect, but I believe it gives hints about which patterns are updated to have `noundef` attached.

(2) The remaining tests are updated manually.

Reviewed By: eugenis

Differential Revision: https://reviews.llvm.org/D108453

8ca4b3ef

[Clang/Test]: Rename enable_noundef_analysis to disable-noundef-analysis and turn it off by default · 80dba72a

Juneyoung Lee authored Oct 15, 2021

Turning on `enable_noundef_analysis` flag allows better codegen by removing freeze instructions.
I modified clang by renaming `enable_noundef_analysis` flag to `disable-noundef-analysis` and turning it off by default.

Test updates are made as a separate patch: D108453

Reviewed By: eugenis

Differential Revision: https://reviews.llvm.org/D105169

80dba72a

[Polly][docs] Fix Sphinx warning. · da2e1f62
Michael Kruse authored Oct 15, 2021
```
ReStructured Text is not Markdown.
```
da2e1f62
[X86] Add more tests for D111858. NFC · f6cd43c0
Craig Topper authored Oct 15, 2021
```
Add tests with sub instead of neg.
```
f6cd43c0

[WebAssembly] Add prototype relaxed laneselect instructions · da079428

Zhi An Ng authored Oct 15, 2021

Add i8x16, i16x8, i32x4, i64x2 laneselect instructions. These are only
exposed as builtins, and require user opt-in.

da079428

[mlir] Add folder for shape.add · 965ec6db
Jacques Pienaar authored Oct 15, 2021

965ec6db

[mlir][sparse] run less combinations of SpMM in test (to reduce runtime) · e9b1c974

Aart Bik authored Oct 15, 2021

This revision also adds a few passes to the sparse compiler part to unify the transformation sequence with all other paths we currently use.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D111900

e9b1c974

[MLIR][TOSA] Drop "OnTensors" suffix · efc6fe96

Geoffrey Martin-Noble authored Oct 15, 2021

This is the only lowering to Linalg Tosa has, so it's needlessly
verbose. Likely this was a carry over from IREE's usage where we
originally lowered to linalg on buffers (the only linalg that existed at
the time), so the everything on tensors needed the suffix. We're dropping
it in IREE also, having transitioned entirely to using Linalg on
tensors.

Reviewed By: sjarus

Differential Revision: https://reviews.llvm.org/D111911

efc6fe96

[ELF] Require two-dash form for --pack-dyn-relocs · f8ee74fc
Fangrui Song authored Oct 15, 2021
```
LLD specific options can be more rigid.
Also add a test.
```
f8ee74fc

[clang] fix typo correction not looking for candidates in base classes. · 489561d4

Matheus Izvekov authored Oct 14, 2021



RecordMemberExprValidator was not looking through ElaboratedType
nodes when looking for candidates which occur in base classes.

Signed-off-by: Matheus Izvekov <mizvekov@gmail.com>

Reviewed By: rsmith

Differential Revision: https://reviews.llvm.org/D111830

489561d4

Revert "[HIP] [AlwaysInliner] Disable AlwaysInliner to eliminate undefined symbols" · 1830ec94
Anshil Gandhi authored Oct 15, 2021
```
This reverts commit 03375a3f.
```
1830ec94

$Lawrence D'\''Anna's avatar$

Fix Xcode project for debugserver · 4594f811

Lawrence D'\''Anna authored Oct 15, 2021

It seems StringConvert.cpp was moved, and the Xcode project file
wasn't updated.

Reviewed By: JDevlieghere

Differential Revision: https://reviews.llvm.org/D111910

4594f811

Oct 15, 2021

[ConstantRange] Compute precise shl range for single elements · 587493b4

Nikita Popov authored Oct 15, 2021

For the common case where the shift amount is constant (a single
element range) we can easily compute a precise range (up to
unsigned envelope), so do that.

587493b4

[HIP] Relax conditions for address space cast in builtin args · f92db6d3

Anshil Gandhi authored Oct 15, 2021

Allow (implicit) address space casting between LLVM-equivalent
target address spaces.

Reviewed By: yaxunl, tra

Differential Revision: https://reviews.llvm.org/D111734

f92db6d3

[NFC] Make Assume2KnowledgeMap's typedef more precise · 2a2432e9
Arthur Eubanks authored Oct 15, 2021

2a2432e9

[InstCombine] generalize fold for mask-with-signbit-splat, part 2 · a49f5386

Sanjay Patel authored Oct 15, 2021

This removes an over-specified fold. The more general transform
was added with:
727e642e

There's a difference on an existing test that shows a potentially
unnecessary use limit on an icmp fold.

That fold is in InstCombinerImpl::foldICmpSubConstant(), and IIRC
there was some back-and-forth on it and similar folds because they
could cause analysis/passes (SCEV, LSR?) to miss optimizations.

Differential Revision: https://reviews.llvm.org/D111410

a49f5386

[AMDGPU] Precommit fused-bitlogic.ll test. NFC. · cd538a6b
Stanislav Mekhanoshin authored Oct 15, 2021

cd538a6b

[ConstantRange] Support checking optimality for subset of inputs (NFC) · 9eb8040a

Nikita Popov authored Oct 15, 2021

We always want to check correctness, but for some operations we
can only guarantee optimality for a subset of inputs. Accept an
additional predicate that determines whether optimality for a
given pair of ranges should be checked.

9eb8040a

Revert "[HIP] Relax conditions for address space cast in builtin args" · 53fc5100
Anshil Gandhi authored Oct 15, 2021
```
This reverts commit 3b48e117.
```
53fc5100

[InstCombine] generalize fold for mask-with-signbit-splat · 727e642e

Sanjay Patel authored Oct 15, 2021

(iN X s>> (N-1)) & Y --> (X < 0) ? Y : 0

https://alive2.llvm.org/ce/z/qeYhdz

I was looking at a missing abs() transform and found my way to this
generalization of an existing fold that was added with D67799.
As discussed in that review, we want to make sure codegen handles
this difference well, and for all of the targets/types that I
spot-checked, it looks good.

I am leaving the existing fold in place in this commit because
it covers a potentially missing icmp fold, but I plan to remove
that as a follow-up commit as suggested during review.

Differential Revision: https://reviews.llvm.org/D111410

727e642e

[HIP] Relax conditions for address space cast in builtin args · 3b48e117

Anshil Gandhi authored Oct 15, 2021

Allow (implicit) address space casting between LLVM-equivalent
target address spaces.

Reviewed By: yaxunl

Differential Revision: https://reviews.llvm.org/D111734

3b48e117

[BasicAA] Rename ExtendedValue to CastedValue (NFC) · 0c52c271

Nikita Popov authored Oct 15, 2021

As suggested on D110977, rename ExtendedValue to CastedValue,
because it will contain more than just extensions in the future.

0c52c271

[ConstantRange] Better diagnostic for correctness test failure (NFC) · 82e858d1

Nikita Popov authored Oct 15, 2021

Print a friendly error message including the inputs, result and
not-contained element if an exhaustive correctness test fails,
same as we do if the optimality test fails.

82e858d1

[modules] Make a module map referenced by a system map a system one too. · d0e7bdc2

Volodymyr Sapsai authored Oct 08, 2021

Mimic the behavior of including headers where a system includer makes an
includee a system header too.

rdar://84049469

Differential Revision: https://reviews.llvm.org/D111476

d0e7bdc2

[VectorCombine] Add option to only run scalarization transforms. · 4a1d63d7

Florian Hahn authored Oct 15, 2021

This patch adds a pass option to only run transforms that scalarize
vector operations and do not create new vector instructions.

When running VectorCombine early in the pipeline introducing new vector
operations can have negative effects, like blocking loop or SLP
vectorization. To avoid regressions, restrict the early VectorCombine
run (when using -enable-matrix) to only perform scalarization and not
introduce new vector operations.

This is done as option to the pass directly, which is then set when
adding the pass to the pipeline. This is done for the new pass manager
only.

Reviewed By: spatel

Differential Revision: https://reviews.llvm.org/D111800

4a1d63d7

[compiler-rt/profile] Hide __llvm_profile_raw_version · 69708477

Pirama Arumuga Nainar authored Oct 15, 2021

Hide __llvm_profile_raw_version so as not to resolve reference from a
dependent shared object.  Since libclang_rt.profile is added later in
the command line, a definition of __llvm_profile_raw_version is not
included if it is provided from an earlier object, e.g.  from a shared
dependency.

This causes an extra dependence edge where if libA.so depends on libB.so
and both are coverage-instrumented, libA.so uses libB.so's definition of
__llvm_profile_raw_version.  This leads to a runtime link failure if the
libB.so available at runtime does not provide this symbol (but provides
the other dependent symbols).  Such a scenario can occur in Android's
mainline modules.
E.g.:
  ld -o libB.so libclang_rt.profile-x86_64.a
  ld -o libA.so -l B libclang_rt.profile-x86_64.a

libB.so has a global definition of __llvm_profile_raw_version.  libA.so
uses libB.so's definition of __llvm_profile_raw_version.  At runtime,
libB.so may not be coverage-instrumented (i.e. not export
__llvm_profile_raw_version) so runtime linking of libA.so will fail.

Marking this symbol as hidden forces each binary to use the definition
of __llvm_profile_raw_version from libclang_rt.profile.

Differential Revision: https://reviews.llvm.org/D111759

69708477

[WebAssembly] Add import info to `dylink` section of shared libraries · 659a0839

Sam Clegg authored Oct 07, 2021

See https://github.com/WebAssembly/tool-conventions/pull/175

Differential Revision: https://reviews.llvm.org/D111345

659a0839

[SelectionDAG] Fix typo in option help · cfd155c4
Mingming Liu authored Oct 15, 2021
```
Reviewed By: MaskRay

Differential Revision: https://reviews.llvm.org/D111867
```
cfd155c4

[HIP] [AlwaysInliner] Disable AlwaysInliner to eliminate undefined symbols · 03375a3f

Anshil Gandhi authored Oct 15, 2021

By default clang emits complete contructors as alias of base constructors if they are the same.
The backend is supposed to emit symbols for the alias, otherwise it causes undefined symbols.
@yaxunl observed that this issue is related to the llvm options `-amdgpu-early-inline-all=true`
and `-amdgpu-function-calls=false`. This issue is resolved by only inlining global values
with internal linkage. The `getCalleeFunction()` in AMDGPUResourceUsageAnalysis also had
to be extended to support aliases to functions. inline-calls.ll was corrected appropriately.

Reviewed By: yaxunl, #amdgpu

Differential Revision: https://reviews.llvm.org/D109707

03375a3f

[lld/mac] Mark private externs with GOT relocs as LOCAL in indirect symbtab · 4e572db0

Nico Weber authored Oct 14, 2021

prepareSymbolRelocation() in Writer.cpp adds both symbols that need binding and
symbols relocated with a pointer relocation to the got.

Pointer relocations are emitted for non-movq GOTPCREL(%rip) loads. (movqs
become GOT_LOADs so that the linker knows they can be relaxed to leaqs, while
others, such as addq, become just GOT -- a pointer relocation -- since they
can't be relaxed in that way).

For example, this C file produces a private_extern GOT relocation when
compiled with -O2 with clang:

extern const char kString[];
const char* g(int a) { return kString + a; }

Linkers need to put pointer-relocated symbols into the GOT, but ld64 marks them
as LOCAL in the indirect symbol table. This matters, since `strip -x` looks at
the indirect symbol table when deciding what to strip.

The indirect symtab emitting code was assuming that only symbols that need
binding are in the GOT, but pointer relocations where there too. Hence, the
code needs to explicitly check if a symbol is a private extern.

Fixes https://crbug.com/1242638, which has some more information in comments 14
and 15. With this patch, the output of `nm -U` on Chromium Framework after
stripping now contains just two symbols when using lld, just like with ld64.

Differential Revision: https://reviews.llvm.org/D111852

4e572db0

[amdgpu] Fix a crash case when preserving MDT in SILowerControlFlow · bacddf47

Michael Liao authored Oct 14, 2021

- When a redundant MBB is being erased from MDT, check whether its
  single successor is dominiated by it. If yes, update that successor's
  idom before erasing MBB; otherwise, it implies MBB is a leaf node and
  could be erased directly.

Reviewed By: foad

Differential Revision: https://reviews.llvm.org/D111831

bacddf47

[ubsan] Remove REQUIRED from some TestCases · e0f3a3b2

Vitaly Buka authored Oct 14, 2021

It's not obvious why they are needed, and tests pass.

Reviewed By: lebedev.ri

Differential Revision: https://reviews.llvm.org/D111859

e0f3a3b2

[clang] Pass -clear-ast-before-backend in Clang::ConstructJob() · 47eb99aa

Arthur Eubanks authored Oct 06, 2021

This clears the memory used for the Clang AST before we run LLVM passes.

https://llvm-compile-time-tracker.com/compare.php?from=d0a5f61c4f6fccec87fd5207e3fcd9502dd59854&to=b7437fee79e04464dd968e1a29185495f3590481&stat=max-rss
shows significant memory savings with no slowdown (in fact -O0 slightly speeds up).

For more background, see
https://lists.llvm.org/pipermail/cfe-dev/2021-September/068930.html.

Turn this off for the interpreter since it does codegen multiple times.

Differential Revision: https://reviews.llvm.org/D111270

47eb99aa

[SystemZ] Handle huge immediates in SystemZInstrInfo::loadImmediate(). · ccbfcfda
Jonas Paulsson authored Oct 15, 2021
```
This is needed during isel pseudo expansion in order not to crash on huge
immediates.

Review: Ulrich Weigand
```
ccbfcfda
[clang] Use llvm::is_contained (NFC) · 6a154e60
Kazu Hirata authored Oct 15, 2021

6a154e60

NFC: Remove wayward MIR tests from lib/Target · 59b94c4a

Jessica Paquette authored Oct 15, 2021

These were put in lib/Target instead of tests.

Thankfully dupes of them already existed in the tests directory.

So, just delete them.

59b94c4a