Skip to content
Unverified Commit 108380da authored by Guray Ozen's avatar Guray Ozen Committed by GitHub
Browse files

[mlir][nvvm] Add `cp.async.bulk.tensor.shared.cluster.global.multicast` (#72429)

This PR introduce `cp.async.bulk.tensor.shared.cluster.global.multicast`
Op in NVVM dialect. It loads data using TMA data from global memory to
shared memory of multiple CTAs in the cluster.

It resolves #72368
parent 25d0f9fc
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment