Commit 28555793 authored Jul 17, 2023 by Guray Ozen

[mlir][nvvm] Add `cp.async.bulk.tensor.shared.cluster.global`

This work introduce `cp.async.bulk.tensor.shared.cluster.global` in NVVM dialect that executes load using TMA.

Depends on D155056

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D155060

parent 960ab522

Show whitespace changes

Inline Side-by-side

Please to comment