Skip to content
Unverified Commit 460840c0 authored by Joseph Huber's avatar Joseph Huber Committed by GitHub
Browse files

[OpenMP] Support 'omp_get_num_procs' on the device (#65501)

Summary:
The `omp_get_num_procs()` function should return the amount of
parallelism availible. On the GPU, this was not defined. We have elected
to define this function as the maximum amount of wavefronts / warps that
can be simultaneously resident on the device. For AMDGPU this is the
number of CUs multiplied byth CU's per wave. For NVPTX this is the
maximum threads per SM divided by the warp size and multiplied by the
number of SMs.
parent 6f387135
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment