Commits · 6e78a3086a7f563cc55d2ba83a8697b3320857fb · Lorenzo Albano / LLVM bpEVL

Jun 16, 2020

[OPENMP50]Codegen for scan directive in for simd regions. · 6e78a308

Alexey Bataev authored Jun 11, 2020

Summary:
Added codegen for scan directives in parallel for regions.

Emits the code for the directive with inscan reductions.
Original code:
```
 #pragma omp for simd reduction(inscan, op : ...)
for(...) {
  <input phase>;
  #pragma omp scan (in)exclusive(...)
  <scan phase>
}
```
is transformed to something:
```
size num_iters = <num_iters>;
<type> buffer[num_iters];
 #pragma omp for simd
for (i: 0..<num_iters>) {
  <input phase>;
  buffer[i] = red;
}
 #pragma omp barrier
for (int k = 0; k != ceil(log2(num_iters)); ++k)
for (size cnt = last_iter; cnt >= pow(2, k); --k)
  buffer[i] op= buffer[i-pow(2,k)];
 #pragma omp for simd
for (0..<num_iters>) {
  red = InclusiveScan ? buffer[i] : buffer[i-1];
  <scan phase>;
}
```

Reviewers: jdoerfert

Subscribers: yaxunl, guansong, sstefan1, cfe-commits, caomhin

Tags: #clang

Differential Revision: https://reviews.llvm.org/D81658

6e78a308

Revert "remove gold linker" · 8d4a806e
Yuanfang Chen authored Jun 16, 2020
```
This reverts commit 719c87ed.

Checked in by accident. Sorry.
```
8d4a806e
[Clang] Add a "#pragma unroll" test case for correct error reporting · 8c6c606c
Yuanfang Chen authored Jun 16, 2020
```
For PR46336.
```
8c6c606c
remove gold linker · 719c87ed
Yuanfang Chen authored Jun 10, 2020

719c87ed

[OPENMP]Fix PR46347: several ordered directives in a single region. · 3488e8c2

Alexey Bataev authored Jun 16, 2020

Summary:
According to OpenMP, During execution of an iteration of a worksharing-loop or a loop nest within a worksharing-loop, simd, or worksharing-loop SIMD region, a thread must not execute more than one ordered region corresponding to an ordered construct without a depend clause.
Need to report an error in this case.

Reviewers: jdoerfert

Subscribers: yaxunl, guansong, sstefan1, cfe-commits, caomhin

Tags: #clang

Differential Revision: https://reviews.llvm.org/D81951

3488e8c2

[SVE] Eliminate calls to default-false VectorType::get() from Vectorize · ff628f5f

Christopher Tetreault authored Jun 16, 2020

Reviewers: efriedma, fhahn, spatel, sdesmalen, kmclaughlin

Reviewed By: efriedma

Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D81521

ff628f5f

GlobalISel: Fix not failing on widening G_INSERT_VECTOR_ELT · e4f19d1d

Matt Arsenault authored Jun 16, 2020

This doesn't actually handled type idx 0, but was reporting Legalized
on it. No test changes because nothing was trying to use this.

e4f19d1d

[PowerPC] Add -m[no-]power10-vector clang and llvm option · 37e72f47

Ahsan Saghir authored Jun 16, 2020

Summary: This patch adds command line option for enabling power10-vector support.

Reviewers: hfinkel, nemanjai, lei, amyk, #powerpc

Reviewed By: lei, amyk, #powerpc

Subscribers: wuzish, kbarton, hiraditya, shchenz, cfe-commits, llvm-commits

Tags: #llvm, #clang, #powerpc

Differential Revision: https://reviews.llvm.org/D80758

37e72f47

[Matrix] Add align info to some more loads/stores (NFC). · 08f62ff8

Florian Hahn authored Jun 16, 2020

Some tests were missing alignment info. Subsequent changes properly
preserve the set alignment. Set it properly beforehand, to avoid
unnecessary test changes.

08f62ff8

[lit] Improve consistency for showing result groups · 7837de13

Julian Lettner authored Jun 16, 2020

Before this change we showed all result groups with a code that was not
explicitly hard-coded set.  This set missed the FLAKYPASS result code.

Let's generalize the code to always show failures and the additionally
requested result codes.

7837de13

Driver: Accept multiple --config options if filenames are the same · d970ab63

Tom Stellard authored Jun 16, 2020

Summary:
We're trying to use the --config options to pass distro specific
options for Fedora via the CFLAGS variable.  However, some projects
end up using the CFLAGS variable multiple times in their command line,
which leads to an error when --config is used.

This patch resolves this issue by allowing more than one --config option
on the command line as long as the file names are the same.

Reviewers: sepavloff, hfinkel

Reviewed By: sepavloff

Subscribers: cfe-commits, llvm-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D81424

d970ab63

[mlir] refactor Linalg LoopNestBuilder to use common infra · b4bc72af

Alex Zinenko authored Jun 16, 2020

Recent work has introduced support for constructing loops via `::build` with
callbacks that construct loop bodies using only the core OpBuilder. This is now
supported on all loop types that Linalg lowers to. Refactor LoopNestBuilder in
Linalg to rely on this functionality instead of using a custom EDSC-based
approach to creating loop nests.

The specialization targeting parallel loops is also simplified by factoring out
the recursive call into a separate static function and considering only two
alternatives: top-level loop is parallel or sequential.

This removes the last remaining in-tree use of edsc::LoopBuilder, which is now
deprecated and will be removed soon.

Differential Revision: https://reviews.llvm.org/D81873

b4bc72af

[mlir] Introduce callback-based builders to SCF Parallel and Reduce ops · 3adced34

Alex Zinenko authored Jun 16, 2020

Similarly to `scf::ForOp`, introduce additional `function_ref` arguments to
`::build` functions of SCF `ParallelOp` and `ReduceOp`. The provided functions
will be called to construct the body of the respective operations while
constructing the operation itself. Exercise them in LoopUtils.

Differential Revision: https://reviews.llvm.org/D81872

3adced34

GlobalISel: Use early return and reduce indentation · 8a3340d2
Matt Arsenault authored Jun 15, 2020

8a3340d2
[MLIR] Add documentation for generate-check-lines.py · b877f33d
Tim Shen authored Jun 16, 2020

b877f33d

GlobalISel: Make special case handling clearer · 91bec1d3

Matt Arsenault authored Jun 16, 2020

The special case here is really G_UNMERGE_VALUES, not G_EXTRACT. The
other opcodes can hardcode index 1 like G_EXTRACT.

91bec1d3

GlobalISel: Use Register · d98a7c3c
Matt Arsenault authored Jun 16, 2020

d98a7c3c
[MLIR] Remove generated spaces at eof for generate-test-checks.py. · a6150de4
Tim Shen authored Jun 16, 2020

a6150de4

[MLIR] Rework generate-test-checks.py to attach CHECK lines to the source (test) file. · 25b38067

Tim Shen authored Jun 15, 2020

Summary:
This patch adds --source flag to indicate the source file. Then it tries to find insert
points in the source file and insert corresponding checks at those places.

Example output from Tensorflow XLA:

// -----

// CHECK-LABEL:   func @main.3(
// CHECK-SAME:                 %[[VAL_0:.*]]: memref<2x2xf32> {xla_lhlo.params = 0 : index},
// CHECK-SAME:                 %[[VAL_1:.*]]: memref<16xi8> {xla_lhlo.alloc = 0 : index, xla_lhlo.liveout = true}) {
// CHECK:           %[[VAL_2:.*]] = constant 0 : index
// CHECK:           %[[VAL_3:.*]] = constant 0 : index
// CHECK:           %[[VAL_4:.*]] = std.view %[[VAL_1]]{{\[}}%[[VAL_3]]][] : memref<16xi8> to memref<2x2xf32>
// CHECK:           "xla_lhlo.tanh"(%[[VAL_0]], %[[VAL_4]]) : (memref<2x2xf32>, memref<2x2xf32>) -> ()
// CHECK:           return
// CHECK:         }
func @main(%value0: tensor<2x2xf32>) -> tensor<2x2xf32> {
  %res = "xla_hlo.tanh"(%value0) : (tensor<2x2xf32>) -> tensor<2x2xf32>
  return %res : tensor<2x2xf32>
}

Differential Revision: https://reviews.llvm.org/D81903

25b38067

Fix ubsan error in tblgen with signed left shift · 3f0c9c16

Stanislav Mekhanoshin authored Jun 16, 2020

UBSAN complains when tblgen performs SHL of a negative
value.

Differential Revision: https://reviews.llvm.org/D81952

3f0c9c16

[TLI] Add four C++17 delete variants. · 6bc2b042

Hiroshi Yamauchi authored Jun 10, 2020

Summary:
delete(void*, unsigned int, align_val_t)
delete(void*, unsigned long, align_val_t)
delete[](void*, unsigned int, align_val_t)
delete[](void*, unsigned long, align_val_t)

Differential Revision: https://reviews.llvm.org/D81853

6bc2b042

[AIX][compiler-rt] Pick the right form of COMPILER_RT_ALIAS for AIX · 8aef01ee

David Tenty authored Jun 16, 2020

Summary: we use the alias attribute, similar to what is done for ELF.

Reviewers: ZarkoCA, jasonliu, hubert.reinterpretcast, sfertile

Reviewed By: jasonliu

Subscribers: dberris, aheejin, mstorsjo, #sanitizers

Tags: #sanitizers

Differential Revision: https://reviews.llvm.org/D81120

8aef01ee

[lldb/Python] Fix the infinitely looping Python prompt bug · 4dd3dfe8

Jonas Devlieghere authored Jun 16, 2020

Executing commands below will get you bombarded by a wall of Python
command prompts (>>> ).

$ echo 'foo' | ./bin/lldb -o script
$ cat /tmp/script
script
print("foo")
$ lldb --source /tmp/script

The issue is that our custom input reader doesn't handle EOF. According
to the Python documentation, file.readline always includes a trailing
newline character unless the file ends with an incomplete line. An empty
string signals EOF. This patch raises an EOFError when that happens.

[1] https://docs.python.org/2/library/stdtypes.html#file.readline

Differential revision: https://reviews.llvm.org/D81898

4dd3dfe8

[VectorCombine] scalarize compares with insertelement operand(s) · ed67f5e7

Sanjay Patel authored Jun 16, 2020

Generalize scalarization (recently enhanced with D80885)
to allow compares as well as binops.
Similar to binops, we are avoiding scalarization of a loaded
value because that could avoid a register transfer in codegen.
This requires 1 extra predicate that I am aware of: we do not
want to scalarize the condition value of a vector select. That
might also invert a transform that we do in instcombine that
prefers a vector condition operand for a vector select.

I think this is the final step in solving PR37463:
https://bugs.llvm.org/show_bug.cgi?id=37463

Differential Revision: https://reviews.llvm.org/D81661

ed67f5e7

[libc++] Don't trigger unsigned conversion warnings in std::advance · 12b01ab7

Louis Dionne authored Jun 08, 2020

The Standard documents the signature of std::advance as

    template <class Iter, class Distance>
    constexpr void advance(Iter& i, Distance n);

Furthermore, it does not appear to put any restriction on what the type
of Distance should be. While it is understood that it should usually
be std::iterator_traits::difference_type, I couldn't find any wording
that mandates that. Similarly, I couldn't find wording that forces the
distance to be a signed type.

This patch changes std::advance to accept any type in the second argument,
which appears to be what the Standard mandates. We then coerce it to the
iterator's difference type, but that's an implementation detail.

Differential Revision: https://reviews.llvm.org/D81425

12b01ab7

[Clang] Skip adding begin source location for PragmaLoopHint'd loop when · 4676cf44
Yuanfang Chen authored Jun 16, 2020
```
the range start is already set

The range start could be set already in some invalid cases. Fixes
PR46336.
```
4676cf44

[AArch64][GlobalISel] Avoid creating redundant ubfx when selecting G_ZEXT · 7caa9caa

Jessica Paquette authored Jun 15, 2020

When selecting 32 b -> 64 b G_ZEXTs, we don't have to always emit the extend.

If the instruction feeding into the G_ZEXT implicitly zero extends the high
half of the register, we can just emit a SUBREG_TO_REG instead.

Differential Revision: https://reviews.llvm.org/D81897

7caa9caa

[lldb/Test] Create dir if it doesn't yet exist in getReproducerArtifact · e4a84590

Jonas Devlieghere authored Jun 16, 2020

The type test use this method to store the golden output. This currently
fails if the reproducer directory hasn't yet been created.

e4a84590

[OPENMP][DOCS]Update status of the supported constrcuts, NFC. · 993c43ae
Alexey Bataev authored Jun 16, 2020

993c43ae
[Format] Add more proto enclosing function names · f1ef237d
Sam McCall authored Jun 16, 2020

f1ef237d
[mlir][shape] Add a func to populate ShapeToShape patterns. · 7a9258e9
Alexander Belyaev authored Jun 16, 2020
```
Differential Revision: https://reviews.llvm.org/D81933
```
7a9258e9

[analyzer][MallocChecker] PR46253: Correctly recognize standard realloc · 1614e354

Kirstóf Umann authored Jun 12, 2020

https://bugs.llvm.org/show_bug.cgi?id=46253

This is an obvious hack because realloc isn't any more affected than other
functions modeled by MallocChecker (or any user of CallDescription really),
but the nice solution will take some time to implement.

Differential Revision: https://reviews.llvm.org/D81745

1614e354

[GlobalISel] Delete unused variable after r353432 · 4799fb63
Fangrui Song authored Jun 16, 2020

4799fb63

Fix debug line info when line markers are present inside macros. · 56262a74

Leandro Vaz authored May 21, 2020

Compiling assembly files when newlines are reduced to line markers within a `.macro` context will generate wrong information in `.debug_line` section.
This patch fixes this issue by evaluating line markers within the macro scope but not when they are used and evaluated.

Reviewed By: probinson

Differential Revision: https://reviews.llvm.org/D80381

56262a74

GlobalISel: Add a note to G_BITCAST documentation · 59ce6ffe
Matt Arsenault authored Jun 07, 2020
```
This is currently different from the IR rules.
```
59ce6ffe
GlobalISel: Make LLT constructors constexpr · 5a95be22
Matt Arsenault authored Jun 06, 2020

5a95be22

[OpenMP][OMPT] Add callbacks for doacross loops · cbea3690

Joachim Protze authored Jun 15, 2020

Adds the callbacks for ordered with source/sink dependencies.

The test for task dependencies changed, because callbach.h now actually prints
the passed dependencies and the test also checks for the address.

Reviewed by: hbae

Differential Revision: https://reviews.llvm.org/D81807

cbea3690

[mlir][Linalg] Retire C++ MatmulOp in favor of a linalg-ods-gen'd op. · eae76fae

Nicolas Vasilache authored Jun 16, 2020

Summary:
This revision replaces MatmulOp, now that DRR rules have been dropped.
This revision also fixes minor parsing bugs and a plugs a few holes to get e2e paths working (e.g. library call emission).

During the replacement the i32 version had to be dropped because only the EDSC operators +, *, etc support type inference.

Deciding on a type-polymorphic behavior, and implementing it, is left for future work.

Reviewers: aartbik

Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, msifontes

Tags: #mlir

Differential Revision: https://reviews.llvm.org/D81935

eae76fae

[Matrix] Specify missing alignment in tests (NFC). · e02c9649

Florian Hahn authored Jun 16, 2020

Some tests were missing alignment info. Subsequent changes properly
preserve the set alignment. Set it properly beforehand, to avoid
unnecessary test changes.

It also updates cases where an alignment of 16 was specified, instead of
the vector element type alignment.

e02c9649

[MLIR][NFC] Inline lambda to workaround gcc 9.1,9.2 bug · 6cd23205

Kiran Chandramohan authored Jun 16, 2020

gcc 9.1/9.2 has a bug (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90538)
which leads to an incorrect error when expanding parameter packs multiple
times in a lambda. Inlining this lambda to work around this issue.

Reviewed By: rriddle, CarolineConcatto

Differential Revision: https://reviews.llvm.org/D81828

6cd23205