Commits · 52b0db22f8cfb594c32389224570681d2d2c2f21 · Lorenzo Albano / LLVM bpEVL

Jun 17, 2020

[InlineCost] PrinterPass prints constants to which instructions are simplified · 52b0db22

Kirill Naumov authored Jun 02, 2020

This patch enables printing of constants to see which instructions were
constant-folded. Needed for tests and better visiual analysis of
inliner's work.

Reviewers: apilipenko, mtrofin, davidxl, fedor.sergeev

Reviewed By: mtrofin

Differential Revision: https://reviews.llvm.org/D81024

52b0db22

[InlineCost] InlineCostAnnotationWriterPass introduced · 37e06e8f

Kirill Naumov authored Jun 11, 2020

This class allows to see the inliner's decisions for better
optimization verifications and tests. To use, use flag
"-passes="print<inline-cost>"".

Reviewers: apilipenko, mtrofin, davidxl, fedor.sergeev

Reviewed By: mtrofin

Differential revision: https://reviews.llvm.org/D81743

37e06e8f

[clang-tidy] warnings-as-error no longer exits with ErrorCount · ccd12700

Nathan James authored Jun 17, 2020

When using `-warnings-as-errors`, If there are any warnings promoted to errors, clang-tidy exits with the number of warnings. This really isn't needed and can cause issues when the number of warnings doesn't fit into 8 bits as POSIX terminals aren't designed to handle more than that.
This addresses https://bugs.llvm.org/show_bug.cgi?id=46305.

Bug originally added in D15528

Reviewed By: aaron.ballman

Differential Revision: https://reviews.llvm.org/D81953

ccd12700

Revert "GlobalISel: Make LLT constructors constexpr" · 81cbe0ca

Hans Wennborg authored Jun 17, 2020

This reverts commit 5a95be22.

It causes GCC 5.3 to segfault:

In file included from /work/llvm.monorepo/llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp:357:0: lib/Target/AArch64/AArch64GenGlobalISel.inc:189:17: in constexpr expansion of ‘llvm::LLT::scalar(16u)’
lib/Target/AArch64/AArch64GenGlobalISel.inc:205:1: internal compiler error: Segmentation fault

81cbe0ca

[OPENMP]Fix overflow during counting the number of iterations. · 08029595

Alexey Bataev authored Jun 04, 2020

Summary:
The OpenMP loops are normalized and transformed into the loops from 0 to
max number of iterations. In some cases, original scheme may lead to
overflow during calculation of number of iterations. If it is unknown,
if we can end up with overflow or not (the bounds are not constant and
  we cannot define if there is an overflow), cast original type to the
  unsigned.

Reviewers: jdoerfert

Subscribers: yaxunl, guansong, sstefan1, openmp-commits, cfe-commits, caomhin

Tags: #clang, #openmp

Differential Revision: https://reviews.llvm.org/D81881

08029595

[OPENMP50]Codegen for scan directive in for simd regions. · 34ee2549

Alexey Bataev authored Jun 16, 2020

Summary:
Added codegen for scan directives in parallel for regions.

Emits the code for the directive with inscan reductions.
Original code:
```
 #pragma omp for simd reduction(inscan, op : ...)
for(...) {
  <input phase>;
  #pragma omp scan (in)exclusive(...)
  <scan phase>
}
```
is transformed to something:
```
size num_iters = <num_iters>;
<type> buffer[num_iters];
 #pragma omp for simd
for (i: 0..<num_iters>) {
  <input phase>;
  buffer[i] = red;
}
 #pragma omp barrier
for (int k = 0; k != ceil(log2(num_iters)); ++k)
for (size cnt = last_iter; cnt >= pow(2, k); --k)
  buffer[i] op= buffer[i-pow(2,k)];
 #pragma omp for simd
for (0..<num_iters>) {
  red = InclusiveScan ? buffer[i] : buffer[i-1];
  <scan phase>;
}
```

Reviewers: jdoerfert

Reviewed By: jdoerfert

Subscribers: yaxunl, guansong, sstefan1, cfe-commits, caomhin

Tags: #clang

Differential Revision: https://reviews.llvm.org/D81658

34ee2549

[SCCP] Add a few more additional sext tests (NFC). · 6aae8ef1
Florian Hahn authored Jun 17, 2020

6aae8ef1
Remove global std::strings. NFCI. · df9a51da
Benjamin Kramer authored Jun 17, 2020

df9a51da

Follow up of rGe345d547a0d5, and attempt to pacify buildbot: · c1034d04

Sjoerd Meijer authored Jun 17, 2020

"error: 'get' is deprecated: The base class version of get with the scalable
argument defaulted to false is deprecated."

Changed VectorType::get() -> FixedVectorType::get().

c1034d04

Recommit "[LV] Emit @llvm.get.active.lane.mask for tail-folded loops" · e345d547
Sjoerd Meijer authored Jun 17, 2020
```
Fixed ARM regression test.

Please see the original commit message rG47650451738c for details.
```
e345d547

[SYCL][OpenMP] Implement thread-local storage restriction · 0bdcd95b

Mariya Podchishchaeva authored Jun 17, 2020

Summary:
SYCL and OpenMP prohibits thread local storage in device code,
so this commit ensures that error is emitted for device code and not
emitted for host code when host target supports it.

Reviewers: jdoerfert, erichkeane, bader

Reviewed By: jdoerfert, erichkeane

Subscribers: guansong, riccibruno, ABataev, yaxunl, ebevhan, Anastasia, sstefan1, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D81641

0bdcd95b

[LSR] Filter for postinc formulae · 076e08aa

David Green authored May 29, 2020

In more complicated loops we can easily hit the complexity limits of
loop strength reduction. If we do and filtering occurs, it's all too
easy to remove the wrong formulae for post-inc preferring accesses due
to it attempting to maximise register re-use. The patch adds an
alternative filtering step when the target is preferring postinc to pick
postinc formulae instead, hopefully lowering the complexity to below the
limit so that aggressive filtering is not needed.

There is also a change in here to stop considering existing addrecs as
free under postinc. We should already be modelling them as a reg so
don't want it to cause us to get the cost wrong. (I'm not sure that code
makes sense in general, but there are X86 tests specifically for it
where it seems to be helping so have left it around for the standard
non-post-inc case).

Differential Revision: https://reviews.llvm.org/D80273

076e08aa

[llvm-readobj] - Do not crash when GnuHashTable->symndx is greater than the dynamic symbols count. · 88c8581d

Georgii Rymar authored Jun 16, 2020

`Elf_GnuHash_Impl` has the following method:

```
ArrayRef<Elf_Word> values(unsigned DynamicSymCount) const {
  return ArrayRef<Elf_Word>(buckets().end(), DynamicSymCount - symndx);
}
```

When DynamicSymCount is less than symndx we return an array with the huge broken size.
This patch fixes the issue and adds an assert. This assert helped to fix an issue
in one of the test cases.

Differential revision: https://reviews.llvm.org/D81937

88c8581d

[llvm-readobj] - Split the printGnuHashTable(). NFCI. · e8299a80

Georgii Rymar authored Jun 16, 2020

`printGnuHashTable` contains the code to check the GNU hash table.
This patch splits it to `getGnuHashTableChains` helper
(and reorders slightly to reduce).

Differential revision: https://reviews.llvm.org/D81928

e8299a80

[AMDGPU] Fix failure in VCC spilling · ac8a2f13

Carl Ritson authored Jun 17, 2020

Spills of VCC (SGPR64) will fail with new SGPR spill code,
because super register is not correctly resolved.

Reviewed By: arsenm

Differential Revision: https://reviews.llvm.org/D81224

ac8a2f13

[CallPrinter] Remove static constructor. · 547b6da7
Benjamin Kramer authored Jun 17, 2020
```
No need to have std::string here. NFC.
```
547b6da7
[SCCP] Precommit some sext tests (NFC). · b1130c4f
Florian Hahn authored Jun 12, 2020

b1130c4f
[lldb] Remove xfail aarch64/linux from TestBuiltinTrap.py · e29b3151
Muhammad Omair Javaid authored Jun 17, 2020
```
The underlying clang bug seems to have been fixed in and test is
consistently passing on aarch64-linux buildbot.
```
e29b3151

Return "[InstCombine] Simplify compare of Phi with constant inputs against a constant" · 5bf0858c

Sam Parker authored Jun 17, 2020

I originally reverted the patch because it was causing performance
issues, but now I think it's just enabling simplify-cfg to do
something that I don't want instead :)

Sorry for the noise.

This reverts commit 3e39760f.

5bf0858c

[NFC] Run clang-format on clang/test/OpenMP/nvptx_target_codegen.cpp · 93cd4115
Alexey Bader authored Jun 17, 2020

93cd4115

[FileCheck] Implement * and / operators for ExpressionValue. · 95db1e7f

Paul Walker authored Jun 01, 2020

Subscribers: arichardson, hiraditya, thopre, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D80915

95db1e7f

[IR] Don't copy profile metadata in createCallMatchingInvoke() · 16ad6eeb

Hans Wennborg authored Jun 17, 2020

The invoke instruction can have profile metadata with branch_weights,
which does not make sense for a call instruction and will be
rejected by the verifier.

Differential revision: https://reviews.llvm.org/D81996

16ad6eeb

Fix LoopIdiomRecognize pass return status · 1cafd8a5

serge-sans-paille authored Jun 04, 2020

Introduce an helper class to aggregate the cleanup in case of rollback.

Differential Revision: https://reviews.llvm.org/D81230

1cafd8a5

Revert "[LV] Emit @llvm.get.active.mask for tail-folded loops" · d4e183f6
Sjoerd Meijer authored Jun 17, 2020
```
This reverts commit 47650451
while I investigate the build bot failures.
```
d4e183f6
[NFC] Add API for edge domination check in dom tree · 4ac9a690
Max Kazantsev authored Jun 17, 2020

4ac9a690
[SCCP] Move common code to simplify basic block to helper (NFC). · 773353be
Florian Hahn authored Jun 17, 2020
```
Reviewers: efriedma, davide

Reviewed By: efriedma

Differential Revision: https://reviews.llvm.org/D81755
```
773353be

[LV] Emit @llvm.get.active.mask for tail-folded loops · 47650451

Sjoerd Meijer authored Jun 10, 2020

This emits new IR intrinsic @llvm.get.active.mask for tail-folded vectorised
loops if the intrinsic is supported by the backend, which is checked by
querying TargetTransform hook emitGetActiveLaneMask.

This intrinsic creates a mask representing active and inactive vector lanes,
which is used by the masked load/store instructions that are created for
tail-folded loops. The semantics of @llvm.get.active.mask are described here in
LangRef:

https://llvm.org/docs/LangRef.html#llvm-get-active-lane-mask-intrinsics

This intrinsic is also used to provide a hint to the backend. That is, the
second argument of the intrinsic represents the back-edge taken count of the
loop. For MVE, for example, we use that to set up tail-predication, which is a
new form of predication in MVE for vector loops that implicitely predicates the
last vector loop iteration by implicitely setting active/inactive lanes, i.e.
the tail loop is predicated. In order to set up a tail-predicated vector loop,
we need to know the number of data elements processed by the vector loop, which
corresponds the the tripcount of the scalar loop, which we can now reconstruct
using @llvm.get.active.mask.

Differential Revision: https://reviews.llvm.org/D79100

47650451

[TTI] Refactor emitGetActiveLaneMask · 20835cff

Sjoerd Meijer authored Jun 09, 2020

Refactor TTI hook emitGetActiveLaneMask and remove the unused arguments
as suggested in D79100.

20835cff

[CallPrinter] Handle freq = 0 case · 3847737f

Kirill Bobyrev authored Jun 17, 2020

Improvement of the following revision:
bbc629eb

This might still be problematic if freq = 0, so it's better to check for
that.

3847737f

[CallPrinter] Fix maxFreq = 0 case · bbc629eb

Kirill Bobyrev authored Jun 17, 2020

llvm::getHeatColor becomes a problem when maxFreq = 0 -> freq = 0 =>
log2(double(freq)) / log2(maxFreq) -> log2(0.) / log2(0.) which
results in illegal instruction on some architectures.

Problematic revision: https://reviews.llvm.org/D77172

bbc629eb

[SveEmitter] Add builtins for svtbl2 · e51c1d06

Sander de Smalen authored Jun 16, 2020

Reviewers: david-arm, efriedma, c-rhodes

Reviewed By: c-rhodes

Tags: #clang

Differential Revision: https://reviews.llvm.org/D81462

e51c1d06

[clangd] Depend on llvm-config for lit tests · af3d8245
Kadir Cetinkaya authored Jun 17, 2020

af3d8245

[MemDep] Also remove load instructions from NonLocalDesCache. · e4b58ea8

Florian Hahn authored Jun 17, 2020

Currently load instructions are added to the cache for invariant pointer
group dependencies, but only pointer values are removed currently. That
leads to dangling AssertingVHs in the test case below, where we delete a
load from an invariant pointer group. We should also remove the entries
from the cache.

Fixes PR46054.

Reviewers: efriedma, hfinkel, asbirlea

Reviewed By: efriedma

Differential Revision: https://reviews.llvm.org/D81726

e4b58ea8

Use explicitly unsigned zero to prevent from a warning · 8bc8d2d6
Serge Pavlov authored Jun 17, 2020

8bc8d2d6
[Test] Add missing opportunity for replacement of select with Phi · 9465dd5d
Max Kazantsev authored Jun 17, 2020

9465dd5d

[DebugInfo] Unify Cursor usage for all debug line opcodes · b21794a9

James Henderson authored Jun 10, 2020

This is a natural extension of the previous changes to use the Cursor
class independently in the standard and extended opcode paths, and in
turn allows delaying error handling until the entire line has been
printed in verbose mode, removing interleaved output in some cases.

Reviewed by: MaskRay, JDevlieghere

Differential Revision: https://reviews.llvm.org/D81562

b21794a9

[gn build] Port 6754a0e2 · d1b4e6a0
LLVM GN Syncbot authored Jun 17, 2020

d1b4e6a0

[SafeStack,NFC] Fix names after files move · d812efb1

Vitaly Buka authored Jun 15, 2020

Summary: Depends on D81831.

Reviewers: eugenis, pcc

Reviewed By: eugenis

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D81832

d812efb1

[SafeStack,NFC] Move SafeStackColoring code · 6754a0e2

Vitaly Buka authored Jun 15, 2020

Summary:
This code is going to be used in StackSafety.
This patch is file move with minimal changes. Identifiers
will be fixed in the followup patch.

Reviewers: eugenis, pcc

Reviewed By: eugenis

Subscribers: mgorny, hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D81831

6754a0e2

[SystemZ] Bugfix in storeLoadCanUseBlockBinary(). · d3f7448e

Jonas Paulsson authored Jun 11, 2020

Check that the MemoryVT of LoadA matches that of LoadB.

This fixes https://bugs.llvm.org/show_bug.cgi?id=46239.

Review: Ulrich Weigand

Differential Revision: https://reviews.llvm.org/D81671

d3f7448e