Skip to content
Commit b8676da1 authored by Christian Sigg's avatar Christian Sigg Committed by A. Unique TensorFlower
Browse files

Outline GPU kernel function into a nested module.

Roll forward of commit 5684a124.

When outlining GPU kernels, put the kernel function inside a nested module. Then use a nested pipeline to generate the cubins, independently per kernel. In a final pass, move the cubins back to the parent module.

PiperOrigin-RevId: 270639748
parent c900d499
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment