[mlir][NVGPU] Support cache all (.ca) in nvgpu.device_async_copy
This patch adds support for cache all (.ca) in conversion from nvgpu-to-nvvm for inline asm `cp.async`. For sizes other than 16 bytes cp.async cache global is not allowed and cache all is required to generate a valid ptx. Differential revision: https://reviews.llvm.org/D148604 Authored-by:Manish Gupta <manigupta@google.com>
Loading
Please sign in to comment