Skip to content
Unverified Commit 71bdd2c2 authored by Rohan Yadav's avatar Rohan Yadav Committed by GitHub
Browse files

mlir/lib/Dialect/GPU/Transforms: improve context management in SerializeToCubin (#65779)

This commit adjusts the CUDA context management in the SerializeToCubin
pass. In particular, it uses the device 0 primary context instead of
creating a new CUDA context on each invocation of SerializeToCubin. This
yields very large improvements in compile time, especially if an
application (like a JIT compiler) is calling SerializeToCubin
repeatedly.

Differential Revision: https://reviews.llvm.org/D159487



Co-authored-by: default avatarRohan Yadav <rohany@cs.stanford.edu>
parent 6a2f68be
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment