Commits · 430d43e010bdd07d73c4d0d6536206d22d35a2cb · Lorenzo Albano / LLVM bpEVL

Jan 22, 2021

[mlir][Linalg] Disable fusion of tensor_reshape op by expansion when unit-dims are involved · 430d43e0

MaheshRavishankar authored Jan 22, 2021

Fusion of generic/indexed_generic operations with tensor_reshape by
expansion when the latter just adds/removes unit-dimensions is
disabled since it just adds unit-trip count loops.

Differential Revision: https://reviews.llvm.org/D94626

430d43e0

[mlir][OpFormatGen] Add support for anchoring optional groups with types · 29d420e0

River Riddle authored Jan 22, 2021

This revision adds support for using either operand or result types to anchor an optional group. It also removes the arbitrary restriction that type directives must refer to variables in the same group, which is overly limiting for a declarative format syntax.

Fixes PR#48784

Differential Revision: https://reviews.llvm.org/D95109

29d420e0

Add more explicit assert for failures · 73de3df1
Jacques Pienaar authored Jan 22, 2021
```
Differential Revision: https://reviews.llvm.org/D95201
```
73de3df1
[mlir][Linalg] Extend tile+fuse to work on Linalg operation on tensors. · 01defcc8
MaheshRavishankar authored Jan 22, 2021
```
Differential Revision: https://reviews.llvm.org/D93086
```
01defcc8

[mlir][Linalg] NFC: Refactor LinalgDependenceGraphElem to allow · bce318f5

MaheshRavishankar authored Jan 22, 2021

representing dependence from producer result to consumer.

With Linalg on tensors the dependence between operations can be from
the result of the producer to the consumer. This change just does a
NFC refactoring of the LinalgDependenceGraphElem to allow representing
both OpResult and OpOperand*.

Differential Revision: https://reviews.llvm.org/D95208

bce318f5

[mlir][spirv] Define spv.IsNan/spv.IsInf and add lowerings · e27197f3

Lei Zhang authored Jan 22, 2021

spv.Ordered/spv.Unordered are meant for OpenCL Kernel capability.
For Vulkan Shader capability, we should use spv.IsNan to check
whether a number is NaN.

Add a new pattern for converting `std.cmpf ord|uno` to spv.IsNan
and bumped the pattern converting to spv.Ordered/spv.Unordered
to a higher benefit. The SPIR-V target environment will properly
select between these two patterns.

Reviewed By: mravishankar

Differential Revision: https://reviews.llvm.org/D95237

e27197f3

[mlir][spirv] Fix script for availability autogen and refresh ops · 167fb9b4

Lei Zhang authored Jan 22, 2021

Previously we only autogen the availability for ops that are
direct instantiating `SPV_Op` and expected other subclasses of
`SPV_Op` to define aggregated availability for all ops. This is
quite error prone and we can miss capabilities for certain ops.
Also it's arguable to have multiple levels of subclasses and try
to deduplicate too much: having the availability directly in the
op can be quite explicit and clear. A few extra lines of
declarative code is fine.

Reviewed By: mravishankar

Differential Revision: https://reviews.llvm.org/D95236

167fb9b4

[mlir] Add coro intrinsics operations to LLVM dialect · cc77a2c7

Eugene Zhulenev authored Jan 22, 2021

This PR only has coro intrinsics needed for the Async to LLVM lowering. Will add other intrinsics as needed in the followup PRs.

Reviewed By: mehdi_amini

Differential Revision: https://reviews.llvm.org/D95143

cc77a2c7

[mlir][StandardOps] Fix typos in the td file. · 1b535df1

Hanhan Wang authored Jan 22, 2021

- Fix arguments name for subview and subtensor.
- Fix a typo in a comment of subtensor's method.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D95211

1b535df1

[MLIR] Add support for extracting an integer sample point (if one exists) from... · 14056dfb

Arjun P authored Jan 22, 2021

[MLIR] Add support for extracting an integer sample point (if one exists) from an unbounded FlatAffineConstraints.

With this, we have complete support for finding integer sample points in FlatAffineConstraints.

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D95047

14056dfb

[mlir][StandardToSPIRV] Add support for lowering uitofp to SPIR-V · 2cb130f7

Hanhan Wang authored Jan 21, 2021

- Extend spirv::ConstantOp::getZero/One to handle float, vector of int, and vector of float.
- Refactor ZeroExtendI1Pattern to use getZero/One methods.
- Add one more test for lowering std.zexti which extends vector<4xi1> to vector<4xi64>.

Reviewed By: antiagainst

Differential Revision: https://reviews.llvm.org/D95120

2cb130f7

[mlir][Linalg] Introduce linalg.pad_tensor op. · 16d4bbef

Hanhan Wang authored Jan 21, 2021

`linalg.pad_tensor` is an operation that pads the `source` tensor
with given `low` and `high` padding config.

Example 1:

```mlir
  %pad_value = ... : f32
  %1 = linalg.pad_tensor %0 low[1, 2] high[2, 3] {
  ^bb0(%arg0 : index, %arg1 : index):
    linalg.yield %pad_value : f32
  } : tensor<?x?xf32> to tensor<?x?xf32>
```

Example 2:
```mlir
  %pad_value = ... : f32
  %1 = linalg.pad_tensor %arg0 low[2, %arg1, 3, 3] high[3, 3, %arg1, 2] {
  ^bb0(%arg2: index, %arg3: index, %arg4: index, %arg5: index):
    linalg.yield %pad_value : f32
  } : tensor<1x2x2x?xf32> to tensor<6x?x?x?xf32>
```

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D93704

16d4bbef

[mlir] Enable passing crash reproducer stream factory method · aee622fa

Jacques Pienaar authored Jan 21, 2021

Add factory to create streams for logging the reproducer. Allows for more general logging (beyond file) and logging the configuration/module separately (logged in order, configuration before module).

Also enable querying filename of ToolOutputFile.

Differential Revision: https://reviews.llvm.org/D94868

aee622fa

[mlir] Support FuncOpSignatureConversion for more FunctionLike ops. · 0a7a1ac7

mikeurbach authored Jan 18, 2021

This extracts the implementation of getType, setType, and getBody from
FunctionSupport.h into the mlir::impl namespace and defines them
generically in FunctionSupport.cpp. This allows them to be used
elsewhere for any FunctionLike ops that use FunctionType for their
type signature.

Using the new helpers, FuncOpSignatureConversion is generalized to
work with all such FunctionLike ops. Convenience helpers are added to
configure the pattern for a given concrete FunctionLike op type.

Reviewed By: rriddle

Differential Revision: https://reviews.llvm.org/D95021

0a7a1ac7

Jan 21, 2021

Add Python bindings for the builtin dialect · 922b26cd

Mehdi Amini authored Jan 20, 2021

This includes some minor customization for FuncOp and ModuleOp.

Differential Revision: https://reviews.llvm.org/D95022

922b26cd

Revert [mlir] Link mlir_runner_utils statically into cuda/rocm-runtime-wrappers (cf50f4f7) · bd3a387e
Christian Sigg authored Jan 21, 2021
```
There are cmake failures that I do not know how to fix.

Differential Revision: https://reviews.llvm.org/D95162
```
bd3a387e
Remove deprecated methods from OpState. · 8827e07a
Christian Sigg authored Jan 21, 2021
```
Reviewed By: rriddle

Differential Revision: https://reviews.llvm.org/D95123
```
8827e07a

[mlir]][SPIRV] Define OrderedOp and UnorderedOp and add lowerings from Standard. · 615167c9

MaheshRavishankar authored Jan 20, 2021

Define OrderedOp and UnorderedOp instructions in SPIR-V and convert
cmpf operations with `ord` and `uno` tag to these instructions
respectively.

Differential Revision: https://reviews.llvm.org/D95098

615167c9

[mlir][SPIRV] Rename OpSpecConstantOperation -> OpSpecConstantOp · 4234292e

MaheshRavishankar authored Jan 20, 2021

The SPIR-V spec uses OpSpecConstantOp. Using an inconsistent name
makes the dialect generation scripts fail. Update to use the right
operation name, and fix the auto generation scripts as well.

Differential Revision: https://reviews.llvm.org/D95097

4234292e

Add log1p lowering from standard to ROCDL intrinsics · 4ef38f9c
Frederik Gossen authored Jan 21, 2021
```
Differential Revision: https://reviews.llvm.org/D95129
```
4ef38f9c
Add log1p lowering from standard to NVVM intrinsics · 294e2544
Frederik Gossen authored Jan 21, 2021
```
Differential Revision: https://reviews.llvm.org/D95130
```
294e2544

[mlir] Remove complex ops from Standard dialect. · fc58bfd0

Alexander Belyaev authored Jan 20, 2021

`complex` dialect should be used instead.
https://llvm.discourse.group/t/rfc-split-the-complex-dialect-from-std/2496/2

Differential Revision: https://reviews.llvm.org/D95077

fc58bfd0

[mlir][OpFormatGen] Fix incorrect kind used for RegionsDirective · 825c2b4a

River Riddle authored Jan 20, 2021

I attempted to write a test case for this, but the situations in which the kind is used for RegionDirective and ResultsDirective have zero overlap; meaning that there isn't a situation in which sharing the kind creates a conflict.

Differential Revision: https://reviews.llvm.org/D94988

825c2b4a

[mlir] Make MLIRContext::getOrLoadDialect(StringRef, TypeID, ...) public · 8a7ff730

mfehr authored Jan 21, 2021

Having this function in a public scope is helpful to register dialects that are
defined at runtime, and thus that need a runtime-defined TypeID.

Also, a similar function in DialectRegistry, insert(TypeID, StringRef, ...), has
a public scope.

Reviewed By: mehdi_amini

Differential Revision: https://reviews.llvm.org/D95091

8a7ff730

[mlir] Add a new builtin `unrealized_conversion_cast` operation · c78219f6

River Riddle authored Jan 20, 2021

An `unrealized_conversion_cast` operation represents an unrealized conversion
from one set of types to another, that is used to enable the inter-mixing of
different type systems. This operation should not be attributed any special
representational or execution semantics, and is generally only intended to be
used to satisfy the temporary intermixing of type systems during the conversion
of one type system to another.

This operation was discussed in the following RFC(and ODM):

https://llvm.discourse.group/t/open-meeting-1-14-dialect-conversion-and-type-conversion-the-question-of-cast-operations/

Differential Revision: https://reviews.llvm.org/D94832

c78219f6

[mlir] Add an interface for Cast-Like operations · 6ccf2d62

River Riddle authored Jan 20, 2021

A cast-like operation is one that converts from a set of input types to a set of output types. The arity of the inputs may be from 0-N, whereas the arity of the outputs may be anything from 1-N. Cast-like operations are removable in cases where they produce a "no-op", i.e when the input types and output types match 1-1.

Differential Revision: https://reviews.llvm.org/D94831

6ccf2d62

Jan 20, 2021

Revert "[mlir][Affine] Add support for multi-store producer fusion" · 735a07f0
Diego Caballero authored Jan 21, 2021
```
This reverts commit 7dd19885.

ASAN issue.
```
735a07f0

[mlir][sparse] add asserts on reading in tensor data · 5959c28f

Aart Bik authored Jan 20, 2021

Rationale:
Since I made the argument that metadata helps with extra
verification checks, I better actually do that ;-)

Reviewed By: penpornk

Differential Revision: https://reviews.llvm.org/D95072

5959c28f

[mlir] NFC - Fix unused variable in non-debug mode · 555a395f
Nicolas Vasilache authored Jan 20, 2021

555a395f

[mlir:async] Fix data races in AsyncRuntime · a2223b09

Eugene Zhulenev authored Jan 20, 2021

Resumed coroutine potentially can deallocate the token/value/group and destroy the mutex before the std::unique_ptr destructor.

Reviewed By: mehdi_amini

Differential Revision: https://reviews.llvm.org/D95037

a2223b09

[mlir][Linalg] NFC - Fully compose map and operands when creating AffineMin in tiling. · 8dd58a50
Nicolas Vasilache authored Jan 20, 2021
```
This may simplify the composition of patterns but is otherwise NFC.
```
8dd58a50
[mlir] Add ComplexDialect to SCF->GPU pass. · b1e1bbae
Alexander Belyaev authored Jan 20, 2021

b1e1bbae

[mlir] Fix SubTensorInsertOp semantics · 866cb260

Nicolas Vasilache authored Jan 20, 2021

Like SubView, SubTensor/SubTensorInsertOp are allowed to have rank-reducing/expanding semantics. In the case of SubTensorInsertOp , the rank of offsets/sizes/strides should be the rank of the destination tensor.

Also, add a builder flavor for SubTensorOp to return a rank-reduced tensor.

Differential Revision: https://reviews.llvm.org/D95076

866cb260

[mlir][Linalg] NFC - Expose getSmallestBoundingIndex as an utility function · c0755726
Nicolas Vasilache authored Jan 20, 2021

c0755726
[mlir][Linalg] NFC - getAssumedNonShapedOperands now returns OperandRange · f5d8eb08
Nicolas Vasilache authored Jan 20, 2021
```
Also adds a isInput interface method.
```
f5d8eb08
[MLIR][Standard] Add log1p operation to std · cc4244d5
Frederik Gossen authored Jan 20, 2021
```
Differential Revision: https://reviews.llvm.org/D95041
```
cc4244d5

[mlir] fix the rocm runtime wrapper to account for cuda / rocm api differences · 4c1eaf26

Tobias Gysi authored Jan 20, 2021

The patch adapts the rocm runtime wrapper due to subtle differences between the cuda and the rocm/hip runtime api.

Reviewed By: csigg

Differential Revision: https://reviews.llvm.org/D95027

4c1eaf26

Avoid unused variable warning in opt mode · cad16e4a
Jacques Pienaar authored Jan 20, 2021

cad16e4a

[mlir][Affine] Add support for multi-store producer fusion · 7dd19885

Diego Caballero authored Jan 20, 2021

This patch adds support for producer-consumer fusion scenarios with
multiple producer stores to the AffineLoopFusion pass. The patch
introduces some changes to the producer-consumer algorithm, including:

* For a given consumer loop, producer-consumer fusion iterates over its
producer candidates until a fixed point is reached.

* Producer candidates are gathered beforehand for each iteration of the
consumer loop and visited in reverse program order (not strictly guaranteed)
to maximize the number of loops fused per iteration.

In general, these changes were needed to simplify the multi-store producer
support and remove some of the workarounds that were introduced in the past
to support more fusion cases under the single-store producer limitation.

This patch also preserves the existing functionality of AffineLoopFusion with
one minor change in behavior. Producer-consumer fusion didn't fuse scenarios
with escaping memrefs and multiple outgoing edges (from a single store).
Multi-store producer scenarios will usually (always?) have multiple outgoing
edges so we couldn't fuse any with escaping memrefs, which would greatly limit
the applicability of this new feature. Therefore, the patch enables fusion for
these scenarios. Please, see modified tests for specific details.

Reviewed By: andydavis1, bondhugula

Differential Revision: https://reviews.llvm.org/D92876

7dd19885

Fix cuda-runner tests. · cba1ca90
Christian Sigg authored Jan 20, 2021

cba1ca90