Commits · 3a506b31a341585a21b21c42253ea9fc54c55b37 · Lorenzo Albano / LLVM bpEVL

Mar 21, 2021

Change OwningRewritePatternList to carry an MLIRContext with it. · 3a506b31

Chris Lattner authored Mar 20, 2021

This updates the codebase to pass the context when creating an instance of
OwningRewritePatternList, and starts removing extraneous MLIRContext
parameters. There are many many more to be removed.

Differential Revision: https://reviews.llvm.org/D99028

3a506b31

Mar 20, 2021

[mlir] Additional folding for SelectOp · 7219b31d

Butygin authored Mar 12, 2021

* Fold SelectOp when both true and false args are same SSA value
* Fold some cmp + select patterns

Differential Revision: https://reviews.llvm.org/D98576

7219b31d

[mlir] Canonicalize IfOp with trivial `then` and `else` bodies to list of SelectOp's · 5657f93e

Butygin authored Mar 12, 2021

* Do we need a threshold on maximum number of Yeild arguments processed (maximum number of SelectOp's to be generated)?
* Had to modify some old IfOp tests to not get optimized by this pattern

Differential Revision: https://reviews.llvm.org/D98592

5657f93e

Update syntax for amx.tile_muli to use two Unit attr to mark the zext case · cdb6eb7e

Mehdi Amini authored Mar 20, 2021

This makes the annotation tied to the operand and the use of a keyword
more explicit/readable on what it means.

Differential Revision: https://reviews.llvm.org/D99001

cdb6eb7e

Mar 19, 2021

[mlir][Linalg] Make LLVM_DEBUG region bigger to avoid warnings in Release builds · 6327a7cf
Benjamin Kramer authored Mar 19, 2021
```
Transforms.cpp:586:16: error: unused variable 'v' [-Werror,-Wunused-variable]
    for (Value v : operands)
               ^
```
6327a7cf
[mlir][Linalg] NFC - Expose helper function `substituteMin`. · 5b2d8503
Nicolas Vasilache authored Mar 19, 2021

5b2d8503
[mlir] Add a roundtrip test for 'linalg.tiled_loop' on buffers. · 628f5c9d
Alexander Belyaev authored Mar 18, 2021
```
https://llvm.discourse.group/t/rfc-add-linalg-tileop/2833

Differential Revision: https://reviews.llvm.org/D98900
```
628f5c9d

[mlir] Remove mlir-rocm-runner · a825fb2c

Christian Sigg authored Mar 19, 2021

This change combines for ROCm what was done for CUDA in D97463, D98203, D98360, and D98396.

I did not try to compile SerializeToHsaco.cpp or test mlir/test/Integration/GPU/ROCM because I don't have an AMD card. I fixed the things that had obvious bit-rot though.

Reviewed By: whchung

Differential Revision: https://reviews.llvm.org/D98447

a825fb2c

Mar 18, 2021

Revert "Revert "[mlir] Add linalg.fill bufferization conversion"" · fcc1ce00
Lei Zhang authored Mar 18, 2021
```
This reverts commit c69550c1 with
proper fix applied.
```
fcc1ce00

Revert "[mlir] Add linalg.fill bufferization conversion" · c69550c1

Mehdi Amini authored Mar 18, 2021

This reverts commit 32a744ab.

CI is broken:

test/Dialect/Linalg/bufferize.mlir:274:12: error: CHECK: expected string not found in input
 // CHECK: %[[MEMREF:.*]] = tensor_to_memref %[[IN]] : memref<?xf32>
           ^

c69550c1

[mlir] Add linalg.fill bufferization conversion · 32a744ab

Eugene Zhulenev authored Mar 15, 2021

`BufferizeAnyLinalgOp` fails because `FillOp` is not a `LinalgGenericOp` and it fails while reading operand sizes attribute.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D98671

32a744ab

[mlir][linalg] Extend linalg vectorization to support non-identity input maps · 16947650

thomasraoux authored Mar 16, 2021

This propagates the affine map to transfer_read op in case it is not a
minor identity map.

Differential Revision: https://reviews.llvm.org/D98523

16947650

[mlir] Fix typo in SCF.cpp (NFC) · 4c782a24
lorenzo chelini authored Mar 18, 2021

4c782a24
[mlir][linalg] Add support for memref inputs/outputs for `linalg.tiled_loop`. · 28379915
Alexander Belyaev authored Mar 18, 2021
```
Also use `ArrayAttr` to pass iterator pass to the TiledLoopOp builder.

Differential Revision: https://reviews.llvm.org/D98871
```
28379915

[MLIR][OpenMP] Pretty printer and parser for omp.wsloop · de155f4a

David Truby authored Mar 17, 2021



Co-authored-by: Kiran Chandramohan <kiran.chandramohan@arm.com>

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D92327

de155f4a

[MLIR] Canonicalize broadcast operations on single shapes · 1ce70c15

Frederik Gossen authored Mar 18, 2021

This covers cases that are not folded away because the extent tensor type
becomes more concrete in the process.

Differential Revision: https://reviews.llvm.org/D98782

1ce70c15

Mar 17, 2021

[mlir][ODS] Support specialized Attribute class for Enums · fee90542

Vladislav Vinogradov authored Feb 27, 2021

Add a feature to `EnumAttr` definition to generate
specialized Attribute class for the particular enumeration.

This class will inherit `StringAttr` or `IntegerAttr` and
will override `classof` and `getValue` methods.

With this class the enumeration predicate can be checked with simple
RTTI calls (`isa`, `dyn_cast`) and it will return the typed enumeration
directly instead of raw string/integer.

Based on the following discussion:
https://llvm.discourse.group/t/rfc-add-enum-attribute-decorator-class/2252

Reviewed By: rriddle

Differential Revision: https://reviews.llvm.org/D97836

fee90542

[mlir] scf::ForOp: Drop iter arguments (and corresponding result) with no use · 0a74a716

lorenzo chelini authored Mar 17, 2021

'ForOpIterArgsFolder' can now remove iterator arguments (and corresponding
results) with no use.

Example:

```
%cst = constant 32 : i32

%0:2 = scf.for %arg1 = %lb to %ub step %step iter_args(%arg2 = %arg0, %arg3 = %cst)
  -> (i32, i32) {
  %1 = addu %arg2, %cst : i32
  scf.yield %1, %1 : i32, i32
}

use(%0#0)

```

%arg3 is not used in the block, and its corresponding result `%0#1` has no use,
thus remove the iter argument.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D98711

0a74a716

Mar 16, 2021

[mlir][PDL] Add support for variadic operands and results in the PDL Interpreter · 3a833a0e

River Riddle authored Mar 16, 2021

This revision extends the PDL Interpreter dialect to add support for variadic operands and results, with ranges of these values represented via the recently added !pdl.range type. To support this extension, three new operations have been added that closely match the single variant:
* pdl_interp.check_types : Compare a range of types with a known range.
* pdl_interp.create_types : Create a constant range of types.
* pdl_interp.get_operands : Get a range of operands from an operation.
* pdl_interp.get_results : Get a range of results from an operation.
* pdl_interp.switch_types : Switch on a range of types.

This revision handles adding support in the interpreter dialect and the conversion from PDL to PDLInterp. Support for variadic operands and results in the bytecode will be added in a followup revision.

Differential Revision: https://reviews.llvm.org/D95722

3a833a0e

[mlir][PDL] Add support for variadic operands and results in PDL · 1eb6994d

River Riddle authored Mar 16, 2021

This revision extends the PDL dialect to add support for variadic operands and results, with ranges of these values represented via the recently added !pdl.range type. To support this extension, three new operations have been added that closely match the single variant:
* pdl.operands : Define a range of input operands.
* pdl.results : Extract a result group from an operation.
* pdl.types : Define a handle to a range of types.

Support for these in the pdl interpreter dialect and byte code will be added in followup revisions.

Differential Revision: https://reviews.llvm.org/D95721

1eb6994d

[mlir][pdl] Remove CreateNativeOp in favor of a more general ApplyNativeRewriteOp. · 02c4c0d5

River Riddle authored Mar 16, 2021

This has a numerous amount of benefits, given the overly clunky nature of CreateNativeOp:
* Users can now call into arbitrary rewrite functions from inside of PDL, allowing for more natural interleaving of PDL/C++ and enabling for more of the pattern to be in PDL.
* Removes the need for an additional set of C++ functions/registry/etc. The new ApplyNativeRewriteOp will use the same PDLRewriteFunction as the existing RewriteOp. This reduces the API surface area exposed to users.

This revision also introduces a new PDLResultList class. This class is used to provide results of native rewrite functions back to PDL. We introduce a new class instead of using a SmallVector to simplify the work necessary for variadics, given that ranges will require some changes to the structure of PDLValue.

Differential Revision: https://reviews.llvm.org/D95720

02c4c0d5

[mlir][pdl] Restructure how results are represented. · 242762c9

River Riddle authored Mar 16, 2021

Up until now, results have been represented as additional results to a pdl.operation. This is fairly clunky, as it mismatches the representation of the rest of the IR constructs(e.g. pdl.operand) and also isn't a viable representation for operations returned by pdl.create_native. This representation also creates much more difficult problems when factoring in support for variadic result groups, optional results, etc. To resolve some of these problems, and simplify adding support for variable length results, this revision extracts the representation for results out of pdl.operation in the form of a new `pdl.result` operation. This operation returns the result of an operation at a given index, e.g.:

```
%root = pdl.operation ...
%result = pdl.result 0 of %root
```

Differential Revision: https://reviews.llvm.org/D95719

242762c9

[mlir] NFC - Expose GlobalCreator so it can be reused. · b661788b
Nicolas Vasilache authored Mar 16, 2021

b661788b
[mlir]: Add canonicalization for dim of 1D alloc of size rank. · 2995e161
Adrian Kuegel authored Mar 03, 2021
```
Differential Revision: https://reviews.llvm.org/D97542
```
2995e161

scf::ForOp: Fold away iterator arguments with no use and for which the... · fd7eee64

Lorenzo Chelini authored Mar 16, 2021

scf::ForOp: Fold away iterator arguments with no use and for which the corresponding input is yielded

Enhance 'ForOpIterArgsFolder' to remove unused iteration arguments in a
scf::ForOp. If the block argument corresponding to the given iterator has no
use and the yielded value equals the input, we fold it away.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D98503

fd7eee64

[mlir][amx] Add Intel AMX dialect (architectural-specific vector dialect) · 6ad7b97e

Aart Bik authored Mar 15, 2021

The Intel Advanced Matrix Extensions (AMX) provides a tile matrix
multiply unit (TMUL), a tile control register (TILECFG), and eight
tile registers TMM0 through TMM7 (TILEDATA). This new MLIR dialect
provides a bridge between MLIR concepts like vectors and memrefs
and the lower level LLVM IR details of AMX.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D98470

6ad7b97e

Mar 15, 2021

[mlir] fix shared-lib build fallout of · 0fb4a201

Alex Zinenko authored Mar 15, 2021

The patch in question broke the build with shared libraries due to
missing dependencies, one of which would have been circular between
MLIRStandard and MLIRMemRef if added. Fix this by moving more code
around and swapping the dependency direction. MLIRMemRef now depends on
MLIRStandard, but MLIRStandard does _not_ depend on MLIRMemRef.
Arguably, this is the right direction anyway since numerous libraries
depend on MLIRStandard and don't necessarily need to depend on
MLIRMemref.

Other otable changes include:
- some EDSC code is moved inline to MemRef/EDSC/Intrinsics.h because it
  creates MemRef dialect operations;
- a utility function related to shape moved to BuiltinTypes.h/cpp
  because it only realtes to shaped types and not any particular dialect
  (standard dialect is erroneously believed to contain MemRefType);
- a Python test for the standard dialect is disabled completely because
  the ops it tests moved to the new MemRef dialect, but it is not
  exposed to Python bindings, and the change for that is non-trivial.

0fb4a201

[MLIR] Create memref dialect and move dialect-specific ops from std. · e2310704

Julian Gross authored Feb 10, 2021

Create the memref dialect and move dialect-specific ops
from std dialect to this dialect.

Moved ops:
AllocOp -> MemRef_AllocOp
AllocaOp -> MemRef_AllocaOp
AssumeAlignmentOp -> MemRef_AssumeAlignmentOp
DeallocOp -> MemRef_DeallocOp
DimOp -> MemRef_DimOp
MemRefCastOp -> MemRef_CastOp
MemRefReinterpretCastOp -> MemRef_ReinterpretCastOp
GetGlobalMemRefOp -> MemRef_GetGlobalOp
GlobalMemRefOp -> MemRef_GlobalOp
LoadOp -> MemRef_LoadOp
PrefetchOp -> MemRef_PrefetchOp
ReshapeOp -> MemRef_ReshapeOp
StoreOp -> MemRef_StoreOp
SubViewOp -> MemRef_SubViewOp
TransposeOp -> MemRef_TransposeOp
TensorLoadOp -> MemRef_TensorLoadOp
TensorStoreOp -> MemRef_TensorStoreOp
TensorToMemRefOp -> MemRef_BufferCastOp
ViewOp -> MemRef_ViewOp

The roadmap to split the memref dialect from std is discussed here:
https://llvm.discourse.group/t/rfc-split-the-memref-dialect-from-std/2667

Differential Revision: https://reviews.llvm.org/D98041

e2310704

[MLIR] Add canonicalization for `shape.broadcast` · b55f424f

Frederik Gossen authored Mar 15, 2021

Remove redundant operands and fold if only one left.

Differential Revision: https://reviews.llvm.org/D98402

b55f424f

Mar 13, 2021

[mlir][sparse] disable nonunit stride dense vectorization · e7ee4eaa

Aart Bik authored Mar 12, 2021

This is a temporary work-around to get our all-annotations-all-flags
stress testing effort run clean. In the long run, we want to provide
efficient implementations of strided loads and stores though

Reviewed By: bixia

Differential Revision: https://reviews.llvm.org/D98563

e7ee4eaa

Mar 12, 2021

[mlir] Annotate functions used only in debug mode with LLVM_ATTRIBUTE_UNUSED · 39b2cd40

Eugene Zhulenev authored Mar 12, 2021

Functions used only in `assert` cause warnings in release mode

Reviewed By: mehdi_amini, dcaballe, ftynse

Differential Revision: https://reviews.llvm.org/D98476

39b2cd40

[mlir] Fix ConstantOp verifier · 849f8183

Marius Brehler authored Mar 08, 2021

This restricts the attributes to integers for constants of type
IndexType. So far an attribute like StringAttr as in

  %c1 = constant "" : index

is valid.

Reviewed By: mehdi_amini

Differential Revision: https://reviews.llvm.org/D98216

849f8183

[mlir][Vector] Lowering of transfer_read/write to vector.load/store · fd2b0896

Sergei Grechanik authored Mar 11, 2021

This patch introduces progressive lowering patterns for rewriting
vector.transfer_read/write to vector.load/store and vector.broadcast
in certain supported cases.

Reviewed By: dcaballe, nicolasvasilache

Differential Revision: https://reviews.llvm.org/D97822

fd2b0896

Replace use of OperationState with builder::create in GPU Kernel Outlining (NFC) · e1364f10
Mehdi Amini authored Mar 12, 2021
```
OperationState is a low level API that is rarely indicated, the builder
API convenient wrapper is preferred when possible.
```
e1364f10

Reland: [mlir][Affine][Vector] Add initial support for 'iter_args' to Affine vectorizer. · 0fd0fb53

Diego Caballero authored Mar 10, 2021

This patch adds support for vectorizing loops with 'iter_args' when those loops
are not a vector dimension. This allows vectorizing outer loops with an inner
'iter_args' loop (e.g., reductions). Vectorizing scenarios where 'iter_args'
loops are vector dimensions would require more work (e.g., analysis,
generating horizontal reduction, etc.) not included in this patch.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D97892

0fd0fb53

Mar 11, 2021

Reland: [mlir][Vector][Affine] Improve affine vectorizer algorithm · 96891f04

Diego Caballero authored Mar 10, 2021

This patch replaces the root-terminal vectorization approach implemented in the
Affine vectorizer with a topological order approach that vectorizes all the
operations within the target loop nest. These are the most important changes
introduced by the new algorithm:
  * Removed tracking of root and terminal ops. Existing vectorization
    functionality is preserved and extended so that loop nests without
    root-terminal chains can be vectorized.
  * Vectorizing a loop nest now only requires a single topological traversal.
  * A new vector loop nest is incrementally built along the vectorization
    process. The original scalar loop is kept intact. No cloning guard is needed
    to recover the scalar loop if vectorization fails. This approach also
    simplifies the challenging task of replacing a loop operation amid the
    vectorization process without invalidating the analysis information that
    depends on the original loop.
  * Vectorization of specific operations has been implemented as independent,
    preparing them to be moved to a potential vectorization interface.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D97442

96891f04

[mlir][StorageUniquer] Properly call the destructor on non-trivially destructible storage instances · 31bb8efd

River Riddle authored Mar 11, 2021

This allows for storage instances to store data that isn't uniqued in the context, or contain otherwise non-trivial logic, in the rare situations that they occur. Storage instances with trivial destructors will still have their destructor skipped. A consequence of this is that the storage instance definition must be visible from the place that registers the type.

Differential Revision: https://reviews.llvm.org/D98311

31bb8efd

[mlir][Vector][Affine] Fix heap-use-after-free in vectorizer · ed193bce

Diego Caballero authored Mar 11, 2021

This patch fixes a heap-use-after-free introduced by the recent changes
in the vectorizer: https://reviews.llvm.org/rG95db7b4aeaad590f37720898e339a6d54313422f
The problem is due to the way candidate loops are visited. All candidate loops
are pattern-matched beforehand using the 'NestedMatch' utility. These matches may
intersect with each other so it may happen that we try to vectorize a loop that
was previously vectorized. The new vectorization algorithm replaces the original
loops that are vectorized with new loops and, therefore, any reference to the
original loops in the pre-computed matches becomes invalid.

This patch fixes the problem by classifying the candidate matches into buckets
before vectorization. Each bucket contains all the matches that intersect. The
vectorizer uses these buckets to make sure that we only vectorize *one* match from
each bucket, at most.

Differential Revision: https://reviews.llvm.org/D98382

ed193bce

[mlir] Introduce data layout modeling subsystem · 3ba14fa0

Alex Zinenko authored Mar 11, 2021

Data layout information allows to answer questions about the size and alignment
properties of a type. It enables, among others, the generation of various
linear memory addressing schemes for containers of abstract types and deeper
reasoning about vectors. This introduces the subsystem for modeling data
layouts in MLIR.

The data layout subsystem is designed to scale to MLIR's open type and
operation system. At the top level, it consists of attribute interfaces that
can be implemented by concrete data layout specifications; type interfaces that
should be implemented by types subject to data layout; operation interfaces
that must be implemented by operations that can serve as data layout scopes
(e.g., modules); and dialect interfaces for data layout properties unrelated to
specific types. Built-in types are handled specially to decrease the overall
query cost.

A concrete default implementation of these interfaces is provided in the new
Target dialect. Defaults for built-in types that match the current behavior are
also provided.

Reviewed By: rriddle

Differential Revision: https://reviews.llvm.org/D97067

3ba14fa0

[mlir] Add LLVM loop codegen options to control software pipelining · b4a516cc

Arpith C. Jacob authored Mar 11, 2021

Support specifying the II and disabling pipelining.

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D98420

b4a516cc