Unverified Commit 39cdefb5 authored Oct 17, 2023 by Guray Ozen Committed by GitHub Oct 17, 2023

[mlir][nvvm] Add prefetch.tensormap (#67564)

This PR adds `prefetch.tensormap` Op. It brings the cache line
containing the given tma descriptor for subsequent use by the
cp.async.bulk.tensor instruction.

https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-prefetch-prefetchu

parent f2898def

Show whitespace changes

Inline Side-by-side

Please to comment