Skip to content
Commit f68ac464 authored by Christian Sigg's avatar Christian Sigg Committed by A. Unique TensorFlower
Browse files

Switch from shfl.bfly to shfl.down.

Both work for the current use case, but the latter allows implementing
prefix sums and is a little easier to understand for partial warps.

PiperOrigin-RevId: 285145287
parent 851a8516
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment