Commits · 430d43e010bdd07d73c4d0d6536206d22d35a2cb · Lorenzo Albano / LLVM bpEVL

Jan 22, 2021

[mlir][Linalg] Disable fusion of tensor_reshape op by expansion when unit-dims are involved · 430d43e0

MaheshRavishankar authored Jan 22, 2021

Fusion of generic/indexed_generic operations with tensor_reshape by
expansion when the latter just adds/removes unit-dimensions is
disabled since it just adds unit-trip count loops.

Differential Revision: https://reviews.llvm.org/D94626

430d43e0

[mlir][Linalg] Extend tile+fuse to work on Linalg operation on tensors. · 01defcc8
MaheshRavishankar authored Jan 22, 2021
```
Differential Revision: https://reviews.llvm.org/D93086
```
01defcc8

[mlir][Linalg] NFC: Refactor LinalgDependenceGraphElem to allow · bce318f5

MaheshRavishankar authored Jan 22, 2021

representing dependence from producer result to consumer.

With Linalg on tensors the dependence between operations can be from
the result of the producer to the consumer. This change just does a
NFC refactoring of the LinalgDependenceGraphElem to allow representing
both OpResult and OpOperand*.

Differential Revision: https://reviews.llvm.org/D95208

bce318f5

[mlir][spirv] Define spv.IsNan/spv.IsInf and add lowerings · e27197f3

Lei Zhang authored Jan 22, 2021

spv.Ordered/spv.Unordered are meant for OpenCL Kernel capability.
For Vulkan Shader capability, we should use spv.IsNan to check
whether a number is NaN.

Add a new pattern for converting `std.cmpf ord|uno` to spv.IsNan
and bumped the pattern converting to spv.Ordered/spv.Unordered
to a higher benefit. The SPIR-V target environment will properly
select between these two patterns.

Reviewed By: mravishankar

Differential Revision: https://reviews.llvm.org/D95237

e27197f3

[mlir][StandardToSPIRV] Add support for lowering uitofp to SPIR-V · 2cb130f7

Hanhan Wang authored Jan 21, 2021

- Extend spirv::ConstantOp::getZero/One to handle float, vector of int, and vector of float.
- Refactor ZeroExtendI1Pattern to use getZero/One methods.
- Add one more test for lowering std.zexti which extends vector<4xi1> to vector<4xi64>.

Reviewed By: antiagainst

Differential Revision: https://reviews.llvm.org/D95120

2cb130f7

[mlir][Linalg] Introduce linalg.pad_tensor op. · 16d4bbef

Hanhan Wang authored Jan 21, 2021

`linalg.pad_tensor` is an operation that pads the `source` tensor
with given `low` and `high` padding config.

Example 1:

```mlir
  %pad_value = ... : f32
  %1 = linalg.pad_tensor %0 low[1, 2] high[2, 3] {
  ^bb0(%arg0 : index, %arg1 : index):
    linalg.yield %pad_value : f32
  } : tensor<?x?xf32> to tensor<?x?xf32>
```

Example 2:
```mlir
  %pad_value = ... : f32
  %1 = linalg.pad_tensor %arg0 low[2, %arg1, 3, 3] high[3, 3, %arg1, 2] {
  ^bb0(%arg2: index, %arg3: index, %arg4: index, %arg5: index):
    linalg.yield %pad_value : f32
  } : tensor<1x2x2x?xf32> to tensor<6x?x?x?xf32>
```

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D93704

16d4bbef

Jan 21, 2021

[mlir] Add an interface for Cast-Like operations · 6ccf2d62

River Riddle authored Jan 20, 2021

A cast-like operation is one that converts from a set of input types to a set of output types. The arity of the inputs may be from 0-N, whereas the arity of the outputs may be anything from 1-N. Cast-like operations are removable in cases where they produce a "no-op", i.e when the input types and output types match 1-1.

Differential Revision: https://reviews.llvm.org/D94831

6ccf2d62

Jan 20, 2021

[mlir][Linalg] NFC - Fully compose map and operands when creating AffineMin in tiling. · 8dd58a50
Nicolas Vasilache authored Jan 20, 2021
```
This may simplify the composition of patterns but is otherwise NFC.
```
8dd58a50

[mlir] Fix SubTensorInsertOp semantics · 866cb260

Nicolas Vasilache authored Jan 20, 2021

Like SubView, SubTensor/SubTensorInsertOp are allowed to have rank-reducing/expanding semantics. In the case of SubTensorInsertOp , the rank of offsets/sizes/strides should be the rank of the destination tensor.

Also, add a builder flavor for SubTensorOp to return a rank-reduced tensor.

Differential Revision: https://reviews.llvm.org/D95076

866cb260

[mlir][Linalg] NFC - Expose getSmallestBoundingIndex as an utility function · c0755726
Nicolas Vasilache authored Jan 20, 2021

c0755726

[mlir][sparse] add narrower choices for pointers/indices · b5c542d6

Aart Bik authored Jan 19, 2021

Use cases with 16- or even 8-bit pointer/index structures have been identified.

Reviewed By: penpornk

Differential Revision: https://reviews.llvm.org/D95015

b5c542d6

Implement constant folding for DivFOp · 1bf2b166

Jackson Fellows authored Jan 19, 2021

Add a constant folder for DivFOp. Analogous to existing folders for
AddFOp, SubFOp, and MulFOp. Matches the behavior of existing LLVM
constant folding (https://github.com/llvm/llvm-project/blob/999f5da6b3088fa4c0bb9d05b358d015ca74c71f/llvm/lib/IR/ConstantFold.cpp#L1432).

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D94939

1bf2b166

Jan 19, 2021

[mlir][splitting std] move 2 more ops to `tensor` · be7352c0

Sean Silva authored Jan 14, 2021

- DynamicTensorFromElementsOp
- TensorFromElements

Differential Revision: https://reviews.llvm.org/D94994

be7352c0

[mlir][Affine] Revisit and simplify composeAffineMapAndOperands. · 93a873df

Nicolas Vasilache authored Jan 19, 2021

In prehistorical times, AffineApplyOp was allowed to produce multiple values.
This allowed the creation of intricate SSA use-def chains.
AffineApplyNormalizer was originally introduced as a means of reusing the AffineMap::compose method to write SSA use-def chains.
Unfortunately, symbols that were produced by an AffineApplyOp needed to be promoted to dims and reordered for the mathematical composition to be valid.

Since then, single result AffineApplyOp became the law of the land but the original assumptions were not revisited.

This revision revisits these assumptions and retires AffineApplyNormalizer.

Differential Revision: https://reviews.llvm.org/D94920

93a873df

Fix a few GCC compiler warnings (NFC) · 7dadcd02
Mehdi Amini authored Jan 19, 2021

7dadcd02

Jan 16, 2021
- [mlir] Fixing potential build break in my previous commit · fd2083d7
  Thomas Raoux authored Jan 15, 2021
  
  fd2083d7
- [mlir][NFC] Move helper substWithMin into Affine utils · 3afbfb41
  Thomas Raoux authored Jan 15, 2021
```
This allow using this helper outside of the linalg canonicalization.

Differential Revision: https://reviews.llvm.org/D94826
```
  3afbfb41
Jan 15, 2021

[mlir][Linalg] Add missing check to canonicalization of GenericOp that are identity ops. · d7bc3b7c

MaheshRavishankar authored Jan 15, 2021

The operantion is an identity if the values yielded by the operation
is the argument of the basic block of that operation. Add this missing check.

Differential Revision: https://reviews.llvm.org/D94819

d7bc3b7c

[mlir] Add Complex dialect. · d0cb0d30
Alexander Belyaev authored Jan 15, 2021
```
Differential Revision: https://reviews.llvm.org/D94764
```
d0cb0d30

[mlir] Add better support for f80 and f128 · cf0173de

Valentin Clement authored Jan 15, 2021

Add builtin f80 and f128 following @schweitz proposition
https://llvm.discourse.group/t/rfc-adding-better-support-for-higher-precision-floating-point/2526/5

Reviewed By: ftynse, rriddle

Differential Revision: https://reviews.llvm.org/D94737

cf0173de

[mlir][sparse] retry sparse-only for cyclic iteration graphs · 5508516b

Aart Bik authored Jan 14, 2021

This is a very minor improvement during iteration graph construction.
If the first attempt considering the dimension order of all tensors fails,
a second attempt is made using the constraints of sparse tensors only.
Dense tensors prefer dimension order (locality) but provide random access
if needed, enabling the compilation of more sparse kernels.

Reviewed By: penpornk

Differential Revision: https://reviews.llvm.org/D94709

5508516b

[mlir][Linalg] NFC: Verify tiling on linalg.generic operation on tensors. · 42444d0c

MaheshRavishankar authored Jan 14, 2021

With the recent changes to linalg on tensor semantics, the tiling
operations works out-of-the-box for generic operations. Add a test to
verify that and some minor refactoring.

Differential Revision: https://reviews.llvm.org/D93077

42444d0c

[mlir][Linalg] Add canonicalization of linalg op -> dim op. · 774c9c6e

MaheshRavishankar authored Jan 14, 2021

Add canonicalization to replace use of the result of a linalg
operation on tensors in a dim operation, to use one of the operands of
the linalg operations instead. This allows the linalg op itself to be
deleted when all its non-dim uses are removed (say through tiling, etc.)

Differential Revision: https://reviews.llvm.org/D93076

774c9c6e

Jan 14, 2021

[mlir][Linalg] Add canonicalization to remove no-op linalg operations. · 722ae109

MaheshRavishankar authored Jan 14, 2021

linalg.generic/indexed_generic operations on tensors whose body is
just yielding the (non-induction variable) arguments of the operation
can be canonicalized by replacing uses of the result with the
corresponding arguments.

Differential Revision: https://reviews.llvm.org/D94581

722ae109

Jan 13, 2021

[mlir][sparse] add vectorization strategies to sparse compiler · f4f158b2

Aart Bik authored Jan 13, 2021

Similar to the parallelization strategies, the vectorization strategies
provide control on what loops should be vectorize. Unlike the parallel
strategies, only innermost loops are considered, but including reductions,
with the control of vectorizing dense loops only or dense and sparse loops.

The vectorized loops are always controlled by a vector mask to avoid
overrunning the iterations, but subsequent vector operation folding removes
redundant masks and replaces the operations with more efficient counterparts.
Similarly, we will rely on subsequent loop optimizations to further optimize
masking, e.g. using an unconditional full vector loop and scalar cleanup loop.

The current strategy already demonstrates a nice interaction between the
sparse compiler and all prior optimizations that went into the vector dialect.

Ongoing discussion at:
https://llvm.discourse.group/t/mlir-support-for-sparse-tensors/2020/10

Reviewed By: penpornk

Differential Revision: https://reviews.llvm.org/D94551

f4f158b2

[mlir] Correct 2 places that result in corrupted conversion rollbacks · 3bd620d4

Tres Popp authored Jan 13, 2021

This corrects the last 2 issues caught by tests when causing dialect
conversion rollbacks to occur.

Differential Revision: https://reviews.llvm.org/D94623

3bd620d4

Delete unused function (was breaking the -Werror build) · 0d88d7d8
David Blaikie authored Jan 12, 2021

0d88d7d8

Jan 12, 2021

[mlir][Linalg] NFC - Refactor fusion APIs · 80f07854

Nicolas Vasilache authored Jan 12, 2021

This revision uniformizes fusion APIs to allow passing OpOperand, OpResult and adds a finer level of control fusion.

Differential Revision: https://reviews.llvm.org/D94493

80f07854

[mlir] use built-in vector types instead of LLVM dialect types when possible · bd30a796

Alex Zinenko authored Jan 11, 2021

Continue the convergence between LLVM dialect and built-in types by using the
built-in vector type whenever possible, that is for fixed vectors of built-in
integers and built-in floats. LLVM dialect vector type is still in use for
pointers, less frequent floating point types that do not have a built-in
equivalent, and scalable vectors. However, the top-level `LLVMVectorType` class
has been removed in favor of free functions capable of inspecting both built-in
and LLVM dialect vector types: `LLVM::getVectorElementType`,
`LLVM::getNumVectorElements` and `LLVM::getFixedVectorType`. Additional work is
necessary to design an implemented the extensions to built-in types so as to
remove the `LLVMFixedVectorType` entirely.

Note that the default output format for the built-in vectors does not have
whitespace around the `x` separator, e.g., `vector<4xf32>` as opposed to the
LLVM dialect vector type format that does, e.g., `!llvm.vec<4 x fp128>`. This
required changing the FileCheck patterns in several tests.

Reviewed By: mehdi_amini, silvas

Differential Revision: https://reviews.llvm.org/D94405

bd30a796

[MLIR][Linalg] Refactor transforms to use linalg::getDynOperands helper · f75f391f

Rob Suderman authored Jan 08, 2021

getDynOperands behavior is commonly used in a number of passes. Refactored to
use a helper function and avoid code reuse.

Differential Revision: https://reviews.llvm.org/D94340

f75f391f

Jan 11, 2021

[mlir][vector] verify memref of vector memory ops · 046612d2

Aart Bik authored Jan 11, 2021

This ensures the memref base + indices expression is well-formed

Reviewed By: ThomasRaoux, ftynse

Differential Revision: https://reviews.llvm.org/D94441

046612d2

[mlir][Linalg] Fix reshape fusion to reshape the outs instead of creating new tensors. · c4486cfd

MaheshRavishankar authored Jan 11, 2021

When fusing tensor_reshape ops with generic/indexed_Generic op, new
linalg.init_tensor operations were created for the `outs` of the fused
op. While correct (technically) it is better to just reshape the
original `outs` operands and rely on canonicalization of init_tensor
-> tensor_reshape to achieve the same effect.

Differential Revision: https://reviews.llvm.org/D93774

c4486cfd

[mlir][vector] Add memory effects to transfer_read transfer_write ops · 3d693bd0

Thomas Raoux authored Jan 08, 2021

This allow more accurate modeling of the side effects and allow dead code
elimination to remove dead transfer ops.

Differential Revision: https://reviews.llvm.org/D94318

3d693bd0

[mlir][Linalg] Fold init_tensor -> linalg.tensor_reshape. · 9c0dc0b2

MaheshRavishankar authored Jan 11, 2021

Reshaping an init_tensor can be folded to a init_tensor op of the
final type.

Differential Revision: https://reviews.llvm.org/D93773

9c0dc0b2

[mlir][linalg] Support permutation when lowering to loop nests · 55225471

Lei Zhang authored Jan 11, 2021

Linalg ops are perfect loop nests. When materializing the concrete
loop nest, the default order specified by the Linalg op's iterators
may not be the best for further CodeGen: targets frequently need
to plan the loop order in order to gain better data access. And
different targets can have different preferences. So there should
exist a way to control the order.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D91795

55225471

[mlir] Make GpuAsyncRegion pass depend on async dialect. · 4c372a35

Christian Sigg authored Jan 11, 2021

Do not cache gpu.async.token type so that the pass can be created before the GPU dialect is registered.

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D94397

4c372a35

[MLIR][SPIRV] Add (de-)serialization support for SpecConstantOpeation. · a40767ec

ergawy authored Jan 11, 2021

This commit adds support for (de-)serializing SpecConstantOpeation. One
thing worth noting is that during deserialization, we assign a fake ID to
enclosed ops inside SpecConstantOpeation. We need to do this in order
for deserialization logic to properly update ID to value map and to
later reference the created value from the sibling 'spv::YieldOp'.

Reviewed By: antiagainst

Differential Revision: https://reviews.llvm.org/D93591

a40767ec

Jan 10, 2021

[mlir] NFC - Drop spurious assertion on symbols during `promoteComposedSymbolsAsDims` · a9224860

Nicolas Vasilache authored Jan 10, 2021

This assertion is an old remnant from earlier days when only affine functions existed.
It is not the place of affine map composition to check whether orthogonal considerations
on what is allowed to be a symbol under the AffineScope trait.

a9224860

Jan 09, 2021

[mlir][vector] modified scatter/gather syntax, pass_thru mandatory · 6728af16

Aart Bik authored Jan 08, 2021

This change makes the scatter/gather syntax more consistent with
the syntax of all the other memory operations in the Vector dialect
(order of types, use of [] for index, etc.). This will make the MLIR
code easier to read. In addition, the pass_thru parameter of the
gather has been made mandatory (there is very little benefit in
using the implicit "undefined" values).

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D94352

6728af16

[mlir][spirv] Replace SPIRVOpLowering with OpConversionPattern · 7c3ae48f

Lei Zhang authored Jan 09, 2021

The dialect conversion framework was enhanced to handle type
conversion automatically. OpConversionPattern already contains
a pointer to the TypeConverter. There is no need to duplicate it
in a separate subclass. This removes the only reason for a
SPIRVOpLowering subclass. It adapts to use core infrastructure
and simplifies the code.

Also added a utility function to OpConversionPattern for getting
TypeConverter as a certain subclass.

Reviewed By: hanchung

Differential Revision: https://reviews.llvm.org/D94080

7c3ae48f