[mlir][gpu] Handle async in gpu.launch_func lowering.
For the synchronous case, destroy the stream after synchronization. Sneak in a unrelated change to report why the gpu.wait conversion pattern didn't match. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D89933
Loading
Please sign in to comment