Commits · 3fffffa882c0c3702e1ce4c6eaf8a380ac4ab065 · Lorenzo Albano / LLVM bpEVL

Oct 27, 2020

[mlir][Pattern] Add a new FrozenRewritePatternList class · 3fffffa8

River Riddle authored Oct 26, 2020

This class represents a rewrite pattern list that has been frozen, and thus immutable. This replaces the uses of OwningRewritePatternList in pattern driver related API, such as dialect conversion. When PDL becomes more prevalent, this API will allow for optimizing a set of patterns once without the need to do this per run of a pass.

Differential Revision: https://reviews.llvm.org/D89104

3fffffa8

[mlir][NFC] Move around the code related to PatternRewriting to improve layering · b6eb26fd

River Riddle authored Oct 26, 2020

There are several pieces of pattern rewriting infra in IR/ that really shouldn't be there. This revision moves those pieces to a better location such that they are easier to evolve in the future(e.g. with PDL). More concretely this revision does the following:

* Create a Transforms/GreedyPatternRewriteDriver.h and move the apply*andFold methods there.
The definitions for these methods are already in Transforms/ so it doesn't make sense for the declarations to be in IR.

* Create a new lib/Rewrite library and move PatternApplicator there.
This new library will be focused on applying rewrites, and will also include compiling rewrites with PDL.

Differential Revision: https://reviews.llvm.org/D89103

b6eb26fd

[mlir][Linalg] Miscalleneous enhancements to cover more fusion cases. · 78f37b74

MaheshRavishankar authored Oct 26, 2020

Adds support for
- Dropping unit dimension loops for indexed_generic ops.
- Folding consecutive folding (or expanding) reshapes when the result
  (or src) is a scalar.
- Fixes to indexed_generic -> generic fusion when zero-dim tensors are
  involved.

Differential Revision: https://reviews.llvm.org/D90118

78f37b74

Oct 26, 2020

[mlir] Do not print back 0 alignment in LLVM dialect 'alloca' op · 03e6f40c

Alex Zinenko authored Oct 26, 2020

The alignment attribute in the 'alloca' op treats the '0' value as 'unset'.
When parsing the custom form of the 'alloca' op, ignore the alignment attribute
with if its value is '0' instead of actually creating it and producing a
slightly different textually yet equivalent semantically form in the output.

Reviewed By: rriddle

Differential Revision: https://reviews.llvm.org/D90179

03e6f40c

[mlir][vector] Update doc strings for insert_map/extract_map and fix insert_map semantic · bd07be4f

Thomas Raoux authored Oct 26, 2020

Based on discourse discussion, fix the doc string and remove examples with
wrong semantic. Also fix insert_map semantic by adding missing operand for
vector we are inserting into.

Differential Revision: https://reviews.llvm.org/D89563

bd07be4f

[mlir][Linalg] Add basic support for TileAndFuse on Linalg on tensors. · 37e0fdd0

Nicolas Vasilache authored Oct 26, 2020

This revision allows the fusion of the producer of input tensors in the consumer under a tiling transformation (which produces subtensors).
Many pieces are still missing (e.g. support init_tensors, better refactor LinalgStructuredOp interface support, try to merge implementations and reuse code) but this still allows getting started.

The greedy pass itself is just for testing purposes and will be extracted in a separate test pass.

Differential revision: https://reviews.llvm.org/D89491

37e0fdd0

Oct 24, 2020

Remove global dialect registration · e7021232

Mehdi Amini authored Oct 23, 2020

This has been deprecated for >1month now and removal was announced in:

https://llvm.discourse.group/t/rfc-revamp-dialect-registration/1559/11

Differential Revision: https://reviews.llvm.org/D86356

e7021232

Oct 23, 2020

Revert "Remove global dialect registration" · 6a726358
Mehdi Amini authored Oct 23, 2020
```
This reverts commit b22e2e4c.

Investigating broken builds
```
6a726358

Remove global dialect registration · b22e2e4c

Mehdi Amini authored Oct 23, 2020

This has been deprecated for >1month now and removal was announced in:

https://llvm.discourse.group/t/rfc-revamp-dialect-registration/1559/11

Differential Revision: https://reviews.llvm.org/D86356

b22e2e4c

[mlir][vector] Add folder for ExtractStridedSliceOp · ea6a60a9

Thomas Raoux authored Oct 23, 2020

Add folder for the case where ExtractStridedSliceOp source comes from a chain
of InsertStridedSliceOp. Also add a folder for the trivial case where the
ExtractStridedSliceOp is a no-op.

Differential Revision: https://reviews.llvm.org/D89850

ea6a60a9

[mlir][vector] Add folding for ExtractOp with ShapeCastOp source · 8c72eea9
Thomas Raoux authored Oct 23, 2020
```
Differential Revision: https://reviews.llvm.org/D89853
```
8c72eea9

Oct 22, 2020

[mlir] Add MemRefReinterpretCastOp definition to Standard. · 461605c4

Alexander Belyaev authored Oct 22, 2020

Reuse most code for printing/parsing/verification from SubViewOp.

https://llvm.discourse.group/t/rfc-standard-memref-cast-ops/1454/15

Differential Revision: https://https://reviews.llvm.org/D89720

461605c4

[mlir] Add MemRefReshapeOp definition to Standard. · d2ed2f16

Alexander Belyaev authored Oct 21, 2020

https://llvm.discourse.group/t/rfc-standard-memref-cast-ops/1454/15

Differential Revision: https://reviews.llvm.org/D89784

d2ed2f16

Oct 21, 2020

[mlir] Simplify DDR matching patterns with equal operands for operators where... · 281e0f36

rdzhabarov authored Oct 20, 2020

[mlir] Simplify DDR matching patterns with equal operands for operators where it's applicable. Added documentation.

This https://reviews.llvm.org/D89254 diff introduced implicit matching between same name operands.

Differential Revision: https://reviews.llvm.org/D89598

281e0f36

[mlir] Add missing dependency for MLIRSCFTransforms · cb5ab3e9
Lei Zhang authored Oct 21, 2020
```
MLIRTransforms is needed to provide BufferizeTypeConverter
definitions.
```
cb5ab3e9

[mlir][shape] Split out structural type conversions for shape dialect. · 57b338c0

Sean Silva authored Oct 19, 2020

A "structural" type conversion is one where the underlying ops are
completely agnostic to the actual types involved and simply need to update
their types. An example of this is shape.assuming -- the shape.assuming op
and the corresponding shape.assuming_yield op need to update their types
accordingly to the TypeConverter, but otherwise don't care what type
conversions are happening.

Also, the previous conversion code would not correctly materialize
conversions for the shape.assuming_yield op. This should have caused a
verification failure, but shape.assuming's verifier wasn't calling
RegionBranchOpInterface::verifyTypes (which for reasons can't be called
automatically as part of the trait verification, and requires being
called manually). This patch also adds that verification.

Differential Revision: https://reviews.llvm.org/D89833

57b338c0

[mlir] Add structural type conversions for SCF dialect. · f0292ede

Sean Silva authored Oct 15, 2020

A "structural" type conversion is one where the underlying ops are
completely agnostic to the actual types involved and simply need to update
their types. An example of this is scf.if -- the scf.if op and the
corresponding scf.yield ops need to update their types accordingly to the
TypeConverter, but otherwise don't care what type conversions are happening.

To test the structural type conversions, it is convenient to define a
bufferize pass for a dialect, which exercises them nicely.

Differential Revision: https://reviews.llvm.org/D89757

f0292ede

[mlir][gpu] Add customer printer/parser for gpu.launch_func. · 1c1803db
Christian Sigg authored Oct 21, 2020
```
Reviewed By: herhut

Differential Revision: https://reviews.llvm.org/D89262
```
1c1803db

Oct 20, 2020

Fix pretty printing of linalg GenericOps when there are no inputs. · 25649267
Federico Lebrón authored Oct 20, 2020
```
Differential Revision: https://reviews.llvm.org/D89825
```
25649267

Add llvm_unreachable to avoid MSVC warning · ad0b2d9d

Geoffrey Martin-Noble authored Oct 19, 2020

Without this I get a warning about not all paths returning.

Reviewed By: mehdi_amini

Differential Revision: https://reviews.llvm.org/D89760

ad0b2d9d

[mlir] Add std.dynamic_tensor_from_elements bufferization. · f4abd3ed

Sean Silva authored Oct 16, 2020

It's unfortunate that this requires adding a dependency on scf dialect
to std bufferization (and hence all of std transforms). This is a bit
perilous. We might want a lib/Transforms/Bufferize/ with a separate
bufferization library per dialect?

Differential Revision: https://reviews.llvm.org/D89667

f4abd3ed

[mlir] Add some more std bufferize patterns. · e3f5073a

Sean Silva authored Oct 16, 2020

Add bufferizations for extract_element and tensor_from_elements.

Differential Revision: https://reviews.llvm.org/D89594

e3f5073a

Oct 19, 2020

[mlir][gpu] NFC: Make room for more than one GPU rewrite pattern. · ad3ecc24

Christian Sigg authored Oct 14, 2020

AllReduceLowering is currently the only GPU rewrite pattern, but more are coming. This is a preparation change.

Reviewed By: herhut

Differential Revision: https://reviews.llvm.org/D89370

ad3ecc24

Oct 18, 2020

[mlir] Add canonicalization for cond_br that feed into a cond_br on the same condition · a8feeee1

River Riddle authored Oct 18, 2020

```
   ...
   cond_br %cond, ^bb1(...), ^bb2(...)
 ...
 ^bb1: // has single predecessor
   ...
   cond_br %cond, ^bb3(...), ^bb4(...)
```

 ->

```
   ...
   cond_br %cond, ^bb1(...), ^bb2(...)
 ...
 ^bb1: // has single predecessor
   ...
   br ^bb3(...)
```

Differential Revision: https://reviews.llvm.org/D89604

a8feeee1

Oct 16, 2020

[mlir] Add a new SymbolUserOpInterface class · 71eeb5ec

River Riddle authored Oct 16, 2020

The initial goal of this interface is to fix the current problems with verifying symbol user operations, but can extend beyond that in the future. The current problems with the verification of symbol uses are:
* Extremely inefficient:
Most current symbol users perform the symbol lookup using the slow O(N) string compare methods, which can lead to extremely long verification times in large modules.
* Invalid/break the constraints of verification pass
If the symbol reference is not-flat(and even if it is flat in some cases) a verifier for an operation is not permitted to touch the referenced operation because it may be in the process of being mutated by a different thread within the pass manager.

The new SymbolUserOpInterface exposes a method `verifySymbolUses` that will be invoked from the parent symbol table to allow for verifying the constraints of any referenced symbols. This method is passed a `SymbolTableCollection` to allow for O(1) lookups of any necessary symbol operation.

Differential Revision: https://reviews.llvm.org/D89512

71eeb5ec

[mlir][vector] Add unrolling patterns for Transfer read/write · edbdea74

Thomas Raoux authored Oct 15, 2020

Adding unroll support for transfer read and transfer write operation. This
allows to pick the ideal size for the memory access for a given target.

Differential Revision: https://reviews.llvm.org/D89289

edbdea74

Oct 15, 2020

[mlir] Add std.tensor_to_memref op and teach the infra about it · ee491ac9

Sean Silva authored Oct 14, 2020

The opposite of tensor_to_memref is tensor_load.

- Add some basic tensor_load/tensor_to_memref folding.
- Add source/target materializations to BufferizeTypeConverter.
- Add an example std bufferization pattern/pass that shows how the
  materialiations work together (more std bufferization patterns to come
  in subsequent commits).
  - In coming commits, I'll document how to write composable
  bufferization passes/patterns and update the other in-tree
  bufferization passes to match this convention. The populate* functions
  will of course continue to be exposed for power users.

The naming on tensor_load/tensor_to_memref and their pretty forms are
not very intuitive. I'm open to any suggestions here. One key
observation is that the memref type must always be the one specified in
the pretty form, since the tensor type can be inferred from the memref
type but not vice-versa.

With this, I've been able to replace all my custom bufferization type
converters in npcomp with BufferizeTypeConverter!

Part of the plan discussed in:
https://llvm.discourse.group/t/what-is-the-strategy-for-tensor-memref-conversion-bufferization/1938/17

Differential Revision: https://reviews.llvm.org/D89437

ee491ac9

[mlir][standard] Fix parsing of scalar subview and canonicalize · 30712453

Stephan Herhut authored Oct 15, 2020

Parsing of a scalar subview did not create the required static_offsets attribute.
This also adds support for folding scalar subviews away.

Differential Revision: https://reviews.llvm.org/D89467

30712453

[mlir][SPIRV] Adding an attribute to capture configuration for cooperative matrix operations. · 6d9a72ec

MaheshRavishankar authored Oct 14, 2020

Each hardware that supports SPV_C_CooperativeMatrixNV has a list of
configurations that are supported natively. Add an attribute to
specify the configurations supported to the `spv.target_env`.

Reviewed By: antiagainst, ThomasRaoux

Differential Revision: https://reviews.llvm.org/D89364

6d9a72ec

Oct 14, 2020

[mlir][Linalg] Rethink fusion of linalg ops with reshape ops. · de2568aa

MaheshRavishankar authored Oct 14, 2020

The current fusion on tensors fuses reshape ops with generic ops by
linearizing the indexing maps of the fused tensor in the generic
op. This has some limitations
- It only works for static shapes
- The resulting indexing map has a linearization that would be
  potentially prevent fusion later on (for ex. tile + fuse).

Instead, try to fuse the reshape consumer (producer) with generic op
producer (consumer) by expanding the dimensionality of the generic op
when the reshape is expanding (folding).  This approach conflicts with
the linearization approach. The expansion method is used instead of
the linearization method.

Further refactoring that changes the fusion on tensors to be a
collection of patterns.

Differential Revision: https://reviews.llvm.org/D89002

de2568aa

[mlir][bufferize] Rename BufferAssignment* to Bufferize* · 9a14cb53

Sean Silva authored Oct 12, 2020

Part of the refactor discussed in:
https://llvm.discourse.group/t/what-is-the-strategy-for-tensor-memref-conversion-bufferization/1938/17

Differential Revision: https://reviews.llvm.org/D89271

9a14cb53

[mlir] Rename ShapeTypeConversion to ShapeBufferize · 6b30fb76

Sean Silva authored Oct 12, 2020

Once we have tensor_to_memref ops suitable for type materializations,
this pass can be split into a generic type conversion pattern.

Part of the refactor discussed in:
https://llvm.discourse.group/t/what-is-the-strategy-for-tensor-memref-conversion-bufferization/1938/17

Differential Revision: https://reviews.llvm.org/D89258

6b30fb76

[mlir] Linalg refactor for using "bufferize" terminology. · 9ca97cde

Sean Silva authored Oct 12, 2020

Part of the refactor discussed in:
https://llvm.discourse.group/t/what-is-the-strategy-for-tensor-memref-conversion-bufferization/1938/17

Differential Revision: https://reviews.llvm.org/D89261

9ca97cde

Add Allocate Clause to MLIR Parallel Operation Definition · 65b9b9aa
Irina Dobrescu authored Sep 15, 2020
```
Differential Revision: https://reviews.llvm.org/D87684
```
65b9b9aa
[mlir][Linalg] Add missing dependency · d38277db
Nicolas Vasilache authored Oct 14, 2020

d38277db

[mlir][Linalg] Make a Linalg CodegenStrategy available. · af5be38a

Nicolas Vasilache authored Oct 14, 2020

This revision adds a programmable codegen strategy from linalg based on staged rewrite patterns. Testing is exercised on a simple linalg.matmul op.

Differential Revision: https://reviews.llvm.org/D89374

af5be38a

Oct 13, 2020

[mlir][Linalg] Lower padding attribute for pooling ops · 44865e91

Alberto Magni authored Oct 12, 2020

Update linalg-to-loops lowering for pooling operations to perform
padding of the input when specified by the corresponding attribute.

Reviewed By: hanchung

Differential Revision: https://reviews.llvm.org/D88911

44865e91

[mlir] Fix sporadic build failures due to missing dependency · 0c15a1b4

Stella Stamenova authored Oct 13, 2020

The build of MLIR occasionally fails (especially on Windows) because there is missing dependency between MLIRLLVMIR and MLIROpenMPOpsIncGen.

1) LLVMDialect.cpp includes LLVMDialect.h
2) LLVMDialect.h includes OpenMPDialect.h
3) OpenMPDialect.h includes OpenMPOpsDialect.h.inc, OpenMPOpsEnums.h.inc and OpenMPOps.h.inc

The OpenMP .inc files are generated by MLIROpenMPOpsIncGen, so MLIRLLVMIR which builds LLVMDialect.cpp should depend on MLIROpenMPOpsIncGen

Reviewed By: mehdi_amini

Differential Revision: https://reviews.llvm.org/D89275

0c15a1b4

[mlir][Linalg] Fix TensorConstantOp bufferization in Linalg. · 61211174

Nicolas Vasilache authored Oct 13, 2020

TensorConstantOp bufferization currently uses the vector dialect to store constant data into memory.
Due to natural vector size and alignment properties, this is problematic with n>1-D vectors whose most minor dimension is not naturally aligned.

Instead, this revision linearizes the constant and introduces a linalg.reshape to go back to the desired shape.

Still this is still to be considered a workaround and a better longer term solution will probably involve `llvm.global`.

Differential Revision: https://reviews.llvm.org/D89311

61211174

[mlir][gpu] Add `gpu.wait` op. · db1cf3d9

Christian Sigg authored Oct 13, 2020

This combines two separate ops (D88972: `gpu.create_token`, D89043: `gpu.host_wait`) into one.

I do after all like the idea of combining the two ops, because it matches exactly the pattern we are
going to have in the other gpu ops that will implement the AsyncOpInterface (launch_func, copies, alloc):

If the op is async, we return a !gpu.async.token. Otherwise, we synchronize with the host and don't return a token.

The use cases for `gpu.wait async` and `gpu.wait` are further apart than those of e.g. `gpu.h2d async` and `gpu.h2d`,
but I like the consistent meaning of the `async` keyword in GPU ops.

Reviewed By: herhut

Differential Revision: https://reviews.llvm.org/D89160

db1cf3d9