Skip to content
Unverified Commit 39cdefb5 authored by Guray Ozen's avatar Guray Ozen Committed by GitHub
Browse files

[mlir][nvvm] Add prefetch.tensormap (#67564)

This PR adds `prefetch.tensormap` Op. It brings the cache line
containing the given tma descriptor for subsequent use by the
cp.async.bulk.tensor instruction.


https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-prefetch-prefetchu
parent f2898def
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment