Commits · c4a04059026b98e8c23981f1195a61494a661cdb · Lorenzo Albano / LLVM bpEVL

Dec 02, 2020

Add `Operation* OpState::operator->()` to provide more convenient access to members of Operation. · c4a04059

Christian Sigg authored Nov 28, 2020

Given that OpState already implicit converts to Operator*, this seems reasonable.

The alternative would be to add more functions to OpState which forward to Operation.

Reviewed By: rriddle, ftynse

Differential Revision: https://reviews.llvm.org/D92266

c4a04059

Nov 26, 2020

[mlir][sparse] add ability to select pointer/index storage type · d5f0d0c0

Aart Bik authored Nov 25, 2020

This change gives sparse compiler clients more control over selecting
individual types for the pointers and indices in the sparse storage schemes.
Narrower width obviously results in smaller memory footprints, but the
range should always suffice for the maximum number of entries or index value.

Reviewed By: penpornk

Differential Revision: https://reviews.llvm.org/D92126

d5f0d0c0

[NFC] Fix pattern name. · 5488a6b0
Sean Silva authored Nov 25, 2020
```
It still had the old name from before ElementwiseMappable was added.
```
5488a6b0

Nov 25, 2020

[mlir][sparse] add parallelization strategies to sparse compiler · 5c4e397e

Aart Bik authored Nov 24, 2020

This CL adds the ability to request different parallelization strategies
for the generate code. Every "parallel" loop is a candidate, and converted
to a parallel op if it is an actual for-loop (not a while) and the strategy
allows dense/sparse outer/inner parallelization.

This will connect directly with the work of @ezhulenev on parallel loops.

Still TBD: vectorization strategy

Reviewed By: penpornk

Differential Revision: https://reviews.llvm.org/D91978

5c4e397e

Nov 24, 2020

[mlir][sparse] generalize invariant expression handling in sparse compiler · b228e2bd

Aart Bik authored Nov 23, 2020

Generalizes invariant handling to anything defined outside the Linalg op
(parameters and SSA computations). Fixes bug that was using parameter number
as tensor number.

Reviewed By: penpornk

Differential Revision: https://reviews.llvm.org/D91985

b228e2bd

[mlir] NFC - Refactor and expose a helper printOffsetSizesAndStrides helper function. · c2470810

Nicolas Vasilache authored Nov 24, 2020

Print part of an op of the form:
```
  <optional-offset-prefix>`[` offset-list `]`
  <optional-size-prefix>`[` size-list `]`
  <optional-stride-prefix>[` stride-list `]`
```

Also address some leftover nits.

Differential revision: https://reviews.llvm.org/D92031

c2470810

[mlir][linalg] Add bufferization pattern for `linalg.indexed_generic`. · fd92c5db
Alexander Belyaev authored Nov 24, 2020
```
Differential Revision: https://reviews.llvm.org/D92014
```
fd92c5db

Nov 23, 2020

[mlir] Add mising dependency · 5073e7ed
Nicolas Vasilache authored Nov 23, 2020

5073e7ed

[mlir][Linalg] NFC: Expose some utility functions used for promotion. · 11ea2e24

MaheshRavishankar authored Nov 23, 2020

Exposing some utility functions from Linalg to allow for promotion of
fused views outside of the core tile+fuse logic.
This is an alternative to patch D91322 which adds the promotion logic
to the tileAndFuse method. Downside with that approach is that it is
not easily customizable based on needs.

Differential Revision: https://reviews.llvm.org/D91503

11ea2e24

[mlir][Linalg] Fuse sequence of Linalg operation (on buffers) · e65a5e5b

MaheshRavishankar authored Nov 23, 2020

Enhance the tile+fuse logic to allow fusing a sequence of operations.

Make sure the value used to obtain tile shape is a
SubViewOp/SubTensorOp. Current logic used to get the bounds of loop
depends on the use of `getOrCreateRange` method on `SubViewOp` and
`SubTensorOp`. Make sure that the value/dim used to compute the range
is from such ops.  This fix is a reasonable WAR, but a btter fix would
be to make `getOrCreateRange` method be a method of `ViewInterface`.

Differential Revision: https://reviews.llvm.org/D90991

e65a5e5b

[mlir][Linalg] Drop symbol_source abstraction which does not pay for itself. · 9ac0b314
Nicolas Vasilache authored Nov 23, 2020
```
Differential Revision: https://reviews.llvm.org/D91956
```
9ac0b314

[mlir][Linalg] NFC - Factor out Linalg functionality for shape and loop bounds computation · 01c44185

Nicolas Vasilache authored Nov 23, 2020

This revision refactors code used in various Linalg transformations and makes it a first class citizen to the LinalgStructureOpInterface. This is in preparation to allowing more advanced Linalg behavior but is otherwise NFC.

Differential revision: https://reviews.llvm.org/D91863

01c44185

Nov 21, 2020

[mlir][sparse] refine optimization, add few more test cases · af425505

Aart Bik authored Nov 19, 2020

Adds tests for full sum reduction (tensors summed up into scalars)
and the well-known sampled-dense-dense-matrix-product. Refines
the optimizations rules slightly to handle the summation better.

Reviewed By: penpornk

Differential Revision: https://reviews.llvm.org/D91818

af425505

Nov 20, 2020

[mlir][vector] Add transfer_op LoadToStore forwarding and deadStore optimizations · 369c51a7

Thomas Raoux authored Nov 16, 2020

Add transformation to be able to forward transfer_write into transfer_read
operation and to be able to remove dead transfer_write when a transfer_write is
overwritten before being read.

Differential Revision: https://reviews.llvm.org/D91321

369c51a7

Revert "[mlir][Linalg] Fuse sequence of Linalg operation (on buffers)" · 0caa82e2

Mikhail Goncharov authored Nov 20, 2020

This reverts commit f8284d21.

Revert "[mlir][Linalg] NFC: Expose some utility functions used for promotion."

This reverts commit 0c59f515.

Revert "Remove unused isZero function"

This reverts commit 0f9f0a40.

Change f8284d21 led to multiple failures in IREE compilation.

0caa82e2

Remove unused isZero function · 0f9f0a40

Geoffrey Martin-Noble authored Nov 19, 2020

Unused since https://reviews.llvm.org/D91503 and triggering
-Wunused-function

Reviewed By: rriddle

Differential Revision: https://reviews.llvm.org/D91838

0f9f0a40

[mlir][Linalg] NFC: Expose some utility functions used for promotion. · 0c59f515

MaheshRavishankar authored Nov 19, 2020

Exposing some utility functions from Linalg to allow for promotion of
fused views outside of the core tile+fuse logic.
This is an alternative to patch D91322 which adds the promotion logic
to the tileAndFuse method. Downside with that approach is that it is
not easily customizable based on needs.

Differential Revision: https://reviews.llvm.org/D91503

0c59f515

[mlir][Linalg] Fuse sequence of Linalg operation (on buffers) · f8284d21

MaheshRavishankar authored Nov 19, 2020

Enhance the tile+fuse logic to allow fusing a sequence of operations.

Differential Revision: https://reviews.llvm.org/D90991

f8284d21

[mlir][Linalg] Add utility function that return static loop bounds of Linalg ops · 8b525c9c
MaheshRavishankar authored Nov 19, 2020
```
Differential Revision: https://reviews.llvm.org/D91749
```
8b525c9c

Nov 19, 2020

[mlir][BuiltinDialect] Resolve comments from D91571 · 65fcddff
River Riddle authored Nov 19, 2020
```
* Move ops to a BuiltinOps.h
* Add file comments
```
65fcddff

[mlir][linalg] Start a named ops to generic ops pass · 9e39a5d9

Lei Zhang authored Nov 19, 2020

This commit starts a new pass and patterns for converting Linalg
named ops to generic ops. This enables us to leverage the flexbility
from generic ops during transformations. Right now only linalg.conv
is supported; others will be added when useful.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D91357

9e39a5d9

[mlir][sparse] remove a few rewriting failures · 9ad62f62

Aart Bik authored Nov 18, 2020

Rationale:
Make sure preconditions are tested already during verfication.
Currently, the only way a sparse rewriting rule can fail is if
(1) the linalg op does not have sparse annotations, or
(2) a yet to be handled operation is encounted inside the op

Reviewed By: penpornk

Differential Revision: https://reviews.llvm.org/D91748

9ad62f62

Nov 18, 2020
- [mlir][Linalg] Add dependence type to LinalgDependenceGraphElem. · b13415b5
  MaheshRavishankar authored Nov 17, 2020
```
Differential Revision: https://reviews.llvm.org/D91502
```
  b13415b5
Nov 17, 2020

[mlir] [sparse] start of sparse tensor compiler support · eced4a8e

Aart Bik authored Nov 17, 2020

As discussed in https://llvm.discourse.group/t/mlir-support-for-sparse-tensors/2020
this CL is the start of sparse tensor compiler support in MLIR. Starting with a
"dense" kernel expressed in the Linalg dialect together with per-dimension
sparsity annotations on the tensors, the compiler automatically lowers the
kernel to sparse code using the methods described in Fredrik Kjolstad's thesis.

Many details are still TBD. For example, the sparse "bufferization" is purely
done locally since we don't have a global solution for propagating sparsity
yet. Furthermore, code to input and output the sparse tensors is missing.
Nevertheless, with some hand modifications, the generated MLIR can be
easily converted into runnable code already.

Reviewed By: nicolasvasilache, ftynse

Differential Revision: https://reviews.llvm.org/D90994

eced4a8e

[mlir][linalg] Allow tensor_to_memref in dependence analysis · ffac5b8e

Stephan Herhut authored Nov 17, 2020

This enables the use of fusion on buffers in partially lowered
programs.

Differential Revision: https://reviews.llvm.org/D91613

ffac5b8e

[mlir][NFC] Remove references to Module.h and Function.h · 73ca690d

River Riddle authored Nov 17, 2020

These includes have been deprecated in favor of BuiltinDialect.h, which contains the definitions of ModuleOp and FuncOp.

Differential Revision: https://reviews.llvm.org/D91572

73ca690d

Nov 16, 2020

[mlir][Linalg] Add support for tileAndDistribute on tensors. · 76257422

Nicolas Vasilache authored Nov 16, 2020

scf.parallel is currently not a good fit for tiling on tensors.
Instead provide a path to parallelism directly through scf.for.
For now, this transformation ignores the distribution scheme and always does a block-cyclic mapping (where block is the tile size).

Differential revision: https://reviews.llvm.org/D90475

76257422

Nov 14, 2020

[mlir] refactor common idiom into AffineMap method · 9ddb464d

Aart Bik authored Nov 13, 2020

motivated by a refactoring in the new sparse code (yet to be merged), this avoids some lengthy code dup

Reviewed By: mehdi_amini

Differential Revision: https://reviews.llvm.org/D91465

9ddb464d

[mlir] Make linalg-bufferize run on FuncOp · 703ef17e
Sean Silva authored Nov 13, 2020
```
That way, it runs in parallel across functions.
```
703ef17e

Nov 13, 2020

[mlir][Linalg] Change LinalgDependenceGraph to use LinalgOp. · bf3861bf

MaheshRavishankar authored Nov 13, 2020

Using LinalgOp will reduce the repeated conversion from Operation <->
LinalgOp.

Differential Revision: https://reviews.llvm.org/D91101

bf3861bf

[mlir][Interfaces] Add implicit casts from concrete operation types to the... · 7f61396c

River Riddle authored Nov 12, 2020

[mlir][Interfaces] Add implicit casts from concrete operation types to the interfaces they implement.

This removes the need to have an explicit `cast<>` given that we always know it `isa` instance of the interface.

Differential Revision: https://reviews.llvm.org/D91304

7f61396c

Nov 12, 2020

[mlir] Bufferize tensor constant ops · faa66b1b

Sean Silva authored Nov 09, 2020

We lower them to a std.global_memref (uniqued by constant value) + a
std.get_global_memref to produce the corresponding memref value.
This allows removing Linalg's somewhat hacky lowering of tensor
constants, now that std properly supports this.

Differential Revision: https://reviews.llvm.org/D91306

faa66b1b

[mlir] Fix subtensor_insert bufferization. · ad2f9f67

Sean Silva authored Nov 12, 2020

It was incorrect in the presence of a tensor argument with multiple
uses.

The bufferization of subtensor_insert was writing into a converted
memref operand, but there is no guarantee that the converted memref for
that operand is safe to write into. In this case, the same converted
memref is written to in-place by the subtensor_insert bufferization,
violating the tensor-level semantics.

I left some comments in a TODO about ways forward on this. I will be
working actively on this problem in the coming days.

Differential Revision: https://reviews.llvm.org/D91371

ad2f9f67

[mlir][Linalg] Improve the logic to perform tile and fuse with better dependence tracking. · 5ca20851

MaheshRavishankar authored Nov 12, 2020

This change does two main things
1) An operation might have multiple dependences to the same
   producer. Not tracking them correctly can result in incorrect code
   generation with fusion. To rectify this the dependence tracking
   needs to also have the operand number in the consumer.
2) Improve the logic used to find the fused loops making it easier to
   follow. The only constraint for fusion is that linalg ops (on
   buffers) have update semantics for the result. Fusion should be
   such that only one iteration of the fused loop (which is also a
   tiled loop) must touch only one (disjoint) tile of the output. This
   could be relaxed by allowing for recomputation that is the default
   when oeprands are tensors, or can be made legal with promotion of
   the fused view (in future).

Differential Revision: https://reviews.llvm.org/D90579

5ca20851

[mlir][sparse] integrate sparse annotation into generic linalg op · e1dbc25e

Aart Bik authored Nov 11, 2020

This CL integrates the new sparse annotations (hereto merely added as fully
transparent attributes) more tightly to the generic linalg op in order to add
verification of the annotations' consistency as well as to make make other
passes more aware of their presence (in the long run, rewriting rules must
preserve the integrity of the annotations).

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D91224

e1dbc25e

Nov 10, 2020

[mlir] Add pass to convert elementwise ops to linalg. · 53a0d45d

Sean Silva authored Oct 28, 2020

This patch converts elementwise ops on tensors to linalg.generic ops
with the same elementwise op in the payload (except rewritten to
operate on scalars, obviously). This is a great form for later fusion to
clean up.

E.g.

```
// Compute: %arg0 + %arg1 - %arg2
func @f(%arg0: tensor<?xf32>, %arg1: tensor<?xf32>, %arg2: tensor<?xf32>) -> tensor<?xf32> {
  %0 = addf %arg0, %arg1 : tensor<?xf32>
  %1 = subf %0, %arg2 : tensor<?xf32>
  return %1 : tensor<?xf32>
}
```

Running this through
`mlir-opt -convert-std-to-linalg -linalg-fusion-for-tensor-ops` we get:

```
func @f(%arg0: tensor<?xf32>, %arg1: tensor<?xf32>, %arg2: tensor<?xf32>) -> tensor<?xf32> {
  %0 = linalg.generic {indexing_maps = [#map0, #map0, #map0, #map0], iterator_types = ["parallel"]} ins(%arg0, %arg1, %arg2 : tensor<?xf32>, tensor<?xf32>, tensor<?xf32>) {
  ^bb0(%arg3: f32, %arg4: f32, %arg5: f32):  // no predecessors
    %1 = addf %arg3, %arg4 : f32
    %2 = subf %1, %arg5 : f32
    linalg.yield %2 : f32
  } -> tensor<?xf32>
  return %0 : tensor<?xf32>
}
```

So the elementwise ops on tensors have nicely collapsed into a single
linalg.generic, which is the form we want for further transformations.

Differential Revision: https://reviews.llvm.org/D90354

53a0d45d

Nov 09, 2020
- [mlir][Linalg] Add support for bufferization of SubTensorOp and SubTensorInsertOp · 6fc3a443
  Nicolas Vasilache authored Nov 09, 2020
```
This revision adds support for bufferization by using a mix of `tensor_load`, `subview`, `linalg.copy` and `tensor_to_memref`.
```
  6fc3a443
Nov 06, 2020

[mlir][Linalg] Canonicalize duplicate args. · e6e9e7ee

Sean Silva authored Nov 04, 2020

I ran into this pattern when converting elementwise ops like
`addf %arg0, %arg : tensor<?xf32>` to linalg. Redundant arguments can
also easily arise from linalg-fusion-for-tensor-ops.

Also, fix some small bugs in the logic in
LinalgStructuredOpsInterface.td.

Differential Revision: https://reviews.llvm.org/D90812

e6e9e7ee

Fix gcc warning by removing extra `;` after a macro (NFC) · f580a49d
Mehdi Amini authored Nov 06, 2020

f580a49d

Nov 05, 2020

[mlir][Linalg] Side effects interface for Linalg ops · ecca7852

Nicolas Vasilache authored Nov 05, 2020

The LinalgDependenceGraph and alias analysis provide the necessary analysis for the Linalg fusion on buffers case.

However this is not enough for linalg on tensors which require proper memory effects to play nicely with DCE and other transformations.
This revision adds side effects to Linalg ops that were previously missing and has 2 consequences:
1. one example in the copy removal pass now fails since the linalg.generic op has side effects and the pass does not perform alias analysis / distinguish between reads and writes.
2. a few examples in fusion-tensor.mlir need to return the resulting tensor otherwise DCE automatically kicks in as part of greedy pattern application.

Differential Revision: https://reviews.llvm.org/D90762

ecca7852