Commits · 628f5c9da29b64777b96cb6787c06b14d288a792 · Lorenzo Albano / LLVM bpEVL

Mar 19, 2021

[mlir] Add a roundtrip test for 'linalg.tiled_loop' on buffers. · 628f5c9d
Alexander Belyaev authored Mar 18, 2021
```
https://llvm.discourse.group/t/rfc-add-linalg-tileop/2833

Differential Revision: https://reviews.llvm.org/D98900
```
628f5c9d

[mlir] Remove ConvertKernelFuncToBlob · 74ffe8dc

Christian Sigg authored Mar 19, 2021

All users have been converted to gpu::SerializeToBlobPass.

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D98928

74ffe8dc

[NVPTX] Fix warning, remove extra ";" [NFC] · 6d22ba48

Mikael Holmen authored Mar 19, 2021

gcc complained with
../lib/Target/NVPTX/NVPTXLowerArgs.cpp:203:2: warning: extra ';' [-Wpedantic]
  203 | };
      |  ^

6d22ba48

[InstCombine] Add unit test with @llvm.annotation. · 926cca96
Clement Courbet authored Mar 19, 2021
```
In preparation for https://reviews.llvm.org/D98925
```
926cca96

[lit] Pass the USERPROFILE variable through on Windows · 9de63b2e

Martin Storsjö authored Mar 18, 2021

When running in a Windows Container, the Git for Windows Unix tools
(C:\Program Files\Git\usr\bin) just hang if this variable isn't
passed through.

Currently, running the LLVM/clang tests in a Windows Container fails
if that directory is added to the path, but succeeds after this change.
(After this change, the previously used GnuWin tools can be left out
entirely, too, as lit automatically picks up the Git for Windows tools
if necessary.)

Differential Revision: https://reviews.llvm.org/D98858

9de63b2e

[libcxx] [test] Explicitly check that some env vars are ignored in the temp_dir_path test · c9fc1a97
Martin Storsjö authored Mar 16, 2021
```
This was suggested in the review of D98139.

Differential Revision: https://reviews.llvm.org/D98696
```
c9fc1a97

[lit] Handle plain negations directly in the internal shell · d09adfd3

Martin Storsjö authored Mar 18, 2021

Keep running "not --crash" via the external "not" executable, but
for plain negations, and for cases that use the shell "!" operator,
just skip that argument and invert the return code.

The libcxx tests only use the shell operator "!" for negations,
never the "not" executable, because libcxx tests can be run without
having a fully built llvm tree available providing the "not"
executable.

This allows using the internal shell for libcxx tests.

Differential Revision: https://reviews.llvm.org/D98859

d09adfd3

[Test] Precommit one more test · a1d6c652
Max Kazantsev authored Mar 19, 2021

a1d6c652

[mlir] Remove mlir-rocm-runner · a825fb2c

Christian Sigg authored Mar 19, 2021

This change combines for ROCm what was done for CUDA in D97463, D98203, D98360, and D98396.

I did not try to compile SerializeToHsaco.cpp or test mlir/test/Integration/GPU/ROCM because I don't have an AMD card. I fixed the things that had obvious bit-rot though.

Reviewed By: whchung

Differential Revision: https://reviews.llvm.org/D98447

a825fb2c

[Test] Precommit test · 4ee4f9bf
Max Kazantsev authored Mar 19, 2021

4ee4f9bf
[NFC] Move function up in code · 8eefa07f
Max Kazantsev authored Mar 19, 2021

8eefa07f
[NFC] Factor out utility function for finding common dom of user set · 8bb952b5
Max Kazantsev authored Mar 19, 2021

8bb952b5
Revert "[WoA][MSVC] Use default linker setting in MSVC-compatible driver" · ce97d8e6
Petr Hosek authored Mar 18, 2021
```
This reverts commit ace56d41 which
broke builders that set CLANG_DEFAULT_LINKER.
```
ce97d8e6
[X86] Fix -Wunused-function in -DLLVM_ENABLE_ASSERTIONS=off builds · c241659d
Fangrui Song authored Mar 18, 2021

c241659d

[mlir] Support use-def cycles in graph regions during regionDCE · f178c13f

Andrew Young authored Mar 18, 2021

When deleting operations in DCE, the algorithm uses a post-order walk of
the IR to ensure that value uses were erased before value defs. Graph
regions do not have the same structural invariants as SSA CFG, and this
post order walk could delete value defs before uses.  This problem is
guaranteed to occur when there is a cycle in the use-def graph.

This change stops DCE from visiting the operations and blocks in any
meaningful order.  Instead, we rely on explicitly dropping all uses of a
value before deleting it.

Reviewed By: mehdi_amini, rriddle

Differential Revision: https://reviews.llvm.org/D98919

f178c13f

[mlir] Fix Python bindings tests failure in Debug mode after D98474 · 270a336f

Vladislav Vinogradov authored Mar 19, 2021

Add extra `type.isa<FloatType>()` check to `FloatAttr::get(Type, double)` method.
Otherwise it tries to call `type.cast<FloatType>()`, which fails with assertion in Debug mode.

The `!type.isa<FloatType>()` case just redirercts the call to `FloatAttr::get(Type, APFloat)`,
which will perform the actual check and emit appropriate error.

Reviewed By: mehdi_amini

Differential Revision: https://reviews.llvm.org/D98764

270a336f

[IndVars] Provide eliminateIVComparison with context · 16370e02

Max Kazantsev authored Mar 19, 2021

We can prove more predicates when we have a context when eliminating ICmp.
As first (and very obvious) approximation we can use the ICmp instruction itself,
though in the future we are going to use a common dominator of all its users.
Need some refactoring before that.

Observed ~0.5% negative compile time impact.

Differential Revision: https://reviews.llvm.org/D98697
Reviewed By: lebedev.ri

16370e02

[UniqueLinkageName] Use consistent checks when mangling symbo linkage name and debug linkage name. · fc1812a0

Hongtao Yu authored Mar 17, 2021

C functions may be declared and defined in different prototypes like below. This patch unifies the checks for mangling names in symbol linkage name emission and debug linkage name emission so that the two names are consistent.

static int go(int);

static int go(a) int a;
{
  return a;
}

Test Plan:

Differential Revision: https://reviews.llvm.org/D98799

fc1812a0

[CSSPGO] Add attribute metadata for context profile · 1410db70

Wenlei He authored Feb 19, 2021

This changes adds attribute field for metadata of context profile. Currently we have an inline attribute that indicates whether the leaf frame corresponding to a context profile was inlined in previous build.

This will be used to help estimating inlining and be taken into account when trimming context. Changes for that in llvm-profgen will follow. It will also help tuning.

Differential Revision: https://reviews.llvm.org/D98823

1410db70

[SCEV] Add false->any implication · fff1363b

Max Kazantsev authored Mar 19, 2021

By definition of Implication operator, `false -> true` and `false -> false`. It means that
`false` implies any predicate, no matter true or false. We don't need to go any further
trying to prove the statement we need and just always say that `false` implies it in this case.

In practice it means that we are trying to prove something guarded by `false` condition,
which means that this code is unreachable, and we can safely prove any fact or perform any
transform in this code.

Differential Revision: https://reviews.llvm.org/D98706
Reviewed By: lebedev.ri

fff1363b

Fix example in documentation. · d8ab7ad3
Richard Smith authored Mar 18, 2021

d8ab7ad3
Improve documentation for the [[clang::lifetimebound]] attribute. · 5c689e4b
Richard Smith authored Mar 18, 2021

5c689e4b

Don't assume that stepping out of a function will land on the next line. · 71c4da83

Jim Ingham authored Mar 18, 2021

For instance, some recent clang emits this code on x86_64:

    0x100002b99 <+57>: callq  0x100002b40               ; step_out_of_here at main.cpp:11
->  0x100002b9e <+62>: xorl   %eax, %eax
    0x100002ba0 <+64>: popq   %rbp
    0x100002ba1 <+65>: retq

and the "xorl %eax, %eax" is attributed to the same line as the callq.  Since
step out is supposed to stop just on returning from the function, you can't guarantee
it will end up on the next line.  I changed the test to check that we were either
on the call line or on the next line, since either would be right depending on the
debug information.

71c4da83

Add a couple of missing attribute query methods [NFC] · fa26da05
Philip Reames authored Mar 18, 2021

fa26da05

[WebAssembly] Remove experimental instructions from wasm_simd128.h · cbab2cd6

Thomas Lively authored Mar 18, 2021

These experimental builtin functions and the feature macro they were gated
behind have been removed.

Reviewed By: aheejin

Differential Revision: https://reviews.llvm.org/D98907

cbab2cd6

[RISCV] Spilling for Zvlsseg registers. · aa8d33a6

Hsiangkai Wang authored Mar 15, 2021

For Zvlsseg, we create several tuple register classes. When spilling for
these tuple register classes, we need to iterate NF times to load/store
these tuple registers.

Differential Revision: https://reviews.llvm.org/D98629

aa8d33a6

[SanitizerCoverage] Make __start_/__stop_ symbols extern_weak · 9558456b

Fangrui Song authored Mar 18, 2021

On ELF, we place the metadata sections (`__sancov_guards`, `__sancov_cntrs`,
`__sancov_bools`, `__sancov_pcs` in section groups (either `comdat any` or
`comdat noduplicates`).

With `--gc-sections`, LLD since D96753 and GNU ld `-z start-stop-gc` may garbage
collect such sections. If all `__sancov_bools` are discarded, LLD will error
`error: undefined hidden symbol: __start___sancov_cntrs` (other sections are similar).

```
% cat a.c
void discarded() {}
% clang -fsanitize-coverage=func,trace-pc-guard -fpic -fvisibility=hidden a.c -shared -fuse-ld=lld -Wl,--gc-sections
...
ld.lld: error: undefined hidden symbol: __start___sancov_guards
>>> referenced by a.c
>>>               /tmp/a-456662.o:(sancov.module_ctor_trace_pc_guard)
```

Use the `extern_weak` linkage (lowered to undefined weak symbols) to avoid the
undefined error.

Differential Revision: https://reviews.llvm.org/D98903

9558456b

[RISCV] Correct the output chain in lowerFixedLengthVectorMaskedLoadToRVV · c9861f72

Craig Topper authored Mar 18, 2021

We returned the input chain instead of the output chain from the
new load. This bypasses the load in the chain. I haven't found a
good way to test this yet. IR order prevents my initial attempts
at causing reordering.

c9861f72

[dfsan] Add -dfsan-fast-8-labels flag · d10f173f

George Balatsouras authored Mar 16, 2021

This is only adding support to the dfsan instrumentation pass but not
to the runtime.

Added more RUN lines for testing: for each instrumentation test that
had a -dfsan-fast-16-labels invocation, a new invocation was added
using fast8.

Reviewed By: stephan.yichao.zhao

Differential Revision: https://reviews.llvm.org/D98734

d10f173f

[mlir][tosa] Add lowering for tosa.rescale to linalg.generic · 286a9d46

Rob Suderman authored Mar 18, 2021

This adds a tosa.apply_scale operation that handles the scaling operation
common to quantized operatons. This scalar operation is lowered
in TosaToStandard.

We use a separate ApplyScale factorization as this is a replicable pattern
within TOSA. ApplyScale can be reused within pool/convolution/mul/matmul
for their quantized variants.

Tests are added to both tosa-to-standard and tosa-to-linalg-on-tensors
that verify each pass is correct.

Reviewed By: silvas

Differential Revision: https://reviews.llvm.org/D98753

286a9d46

Recommit "[AArch64][GlobalISel] Fold constants into G_GLOBAL_VALUE" · 0ca83730

Jessica Paquette authored Mar 18, 2021

This reverts commit 962b73dd.

This commit was reverted because of some internal SPEC test failures.

It turns out that this wasn't actually relevant to anything in open source, so
it's safe to recommit this.

0ca83730

Mar 18, 2021

[mlir][tosa] Add tosa.concat to subtensor inserts lowering · 5627564f

Rob Suderman authored Mar 17, 2021

Includes lowering for tosa.concat with indice computation with subtensor insert
operations. Includes tests along two different indices.

Differential Revision: https://reviews.llvm.org/D98813

5627564f

Fix test case in b4a8c0eb · 80df56f7
Yuanfang Chen authored Mar 18, 2021

80df56f7

[DAGCombiner][RISCV] Teach visitMGATHER/MSCATTER to remove gather/scatters... · 182b831a

Craig Topper authored Mar 18, 2021

[DAGCombiner][RISCV] Teach visitMGATHER/MSCATTER to remove gather/scatters with all zeros masks that use SPLAT_VECTOR.

Previously only all zeros BUILD_VECTOR was recognized.

182b831a

[LTO][MC] Discard non-prevailing defined symbols in module-level assembly · b4a8c0eb

Yuanfang Chen authored Mar 18, 2021

This is the alternative approach to D96931.

In LTO, for each module with inlineasm block, prepend directive ".lto_discard <sym>, <sym>*" to the beginning of the inline
asm. ".lto_discard" is both a module inlineasm block marker and (optionally) provides a list of symbols to be discarded.

In MC while emitting for inlineasm, discard symbol binding & symbol
definitions according to ".lto_disard".

Reviewed By: MaskRay

Differential Revision: https://reviews.llvm.org/D98762

b4a8c0eb

[OpenMP] Fixed a crash in hidden helper thread · 2df65f87

Shilei Tian authored Mar 18, 2021

It is reported that after enabling hidden helper thread, the program
can hit the assertion `new_gtid < __kmp_threads_capacity` sometimes. The root
cause is explained as follows. Let's say the default `__kmp_threads_capacity` is
`N`. If hidden helper thread is enabled, `__kmp_threads_capacity` will be offset
to `N+8` by default. If the number of threads we need exceeds `N+8`, e.g. via
`num_threads` clause, we need to expand `__kmp_threads`. In
`__kmp_expand_threads`, the expansion starts from `__kmp_threads_capacity`, and
repeatedly doubling it until the new capacity meets the requirement. Let's
assume the new requirement is `Y`.  If `Y` happens to meet the constraint
`(N+8)*2^X=Y` where `X` is the number of iterations, the new capacity is not
enough because we have 8 slots for hidden helper threads.

Here is an example.
```
#include <vector>

int main(int argc, char *argv[]) {
  constexpr const size_t N = 1344;
  std::vector<int> data(N);

#pragma omp parallel for
  for (unsigned i = 0; i < N; ++i) {
    data[i] = i;
  }

#pragma omp parallel for num_threads(N)
  for (unsigned i = 0; i < N; ++i) {
    data[i] += i;
  }

  return 0;
}
```
My CPU is 20C40T, then `__kmp_threads_capacity` is 160. After offset,
`__kmp_threads_capacity` becomes 168. `1344 = (160+8)*2^3`, then the assertions
hit.

Reviewed By: protze.joachim

Differential Revision: https://reviews.llvm.org/D98838

2df65f87

[SelectionDAG] Don't pass a scalable vector to MachinePointerInfo::getWithOffset in a unit test. · 305a0bad

Craig Topper authored Mar 18, 2021

Suppresses an implicit TypeSize to uint64_t conversion warning.

We might be able to just not offset it since we're writing to a
Fixed stack object, but I wasn't sure so I just did what
DAGTypeLegalizer::IncrementPointer does.

Reviewed By: sdesmalen

Differential Revision: https://reviews.llvm.org/D98736

305a0bad

[lli] Add Orc greedy mode as -jit-kind=orc · e1579894

Stefan Gränitz authored Mar 18, 2021

In the existing OrcLazy mode, modules go through partitioning and outgoing calls are replaced by reexport stubs that resolve on call-through. In greedy mode that this patch unlocks for lli, modules materialize as a whole and trigger materialization for all required symbols recursively. This is useful for testing (e.g. D98785) and it's more similar to the way MCJIT works.

e1579894

[mlir] Fix build failure due to 1a572f45 · 44f24f39
thomasraoux authored Mar 18, 2021

44f24f39
[AMDGPU] Remove cpol, tfe, and swz from MUBUF patterns · edd6da10
Stanislav Mekhanoshin authored Mar 15, 2021
```
These are always selected as 0 anyway.

Differential Revision: https://reviews.llvm.org/D98663
```
edd6da10