Skip to content
Commit 1364d268 authored by Jeroen Ketema's avatar Jeroen Ketema
Browse files

Implement mem_fence on ptx



PTX does not differentiate between read and write fences. Hence, these a
lowered to a mem_fence call. The mem_fence function compiles to the
“member.cta” instruction, which commits all outstanding reads and writes
of a thread such that these become visible to all other threads in the same
CTA (i.e., work-group). The instruction does not differentiate between
global and local memory. Hence, the flags parameter is ignored, except
for deciding whether a “member.cta” instruction should be issued at all.

Reviewed-by: default avatarJan Vesely <jan.vesely@rutgers.edu>
llvm-svn: 315235
parent 492d7134
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment