Unverified Commit 108380da authored Nov 16, 2023 by Guray Ozen Committed by GitHub Nov 16, 2023

[mlir][nvvm] Add `cp.async.bulk.tensor.shared.cluster.global.multicast` (#72429)

This PR introduce `cp.async.bulk.tensor.shared.cluster.global.multicast`
Op in NVVM dialect. It loads data using TMA data from global memory to
shared memory of multiple CTAs in the cluster.

It resolves #72368

parent 25d0f9fc

Show whitespace changes

Inline Side-by-side

Please to comment