Skip to content
  1. Nov 04, 2021
  2. Nov 03, 2021
  3. Oct 30, 2021
  4. Oct 29, 2021
  5. Oct 28, 2021
  6. Oct 26, 2021
  7. Oct 25, 2021
  8. Oct 23, 2021
  9. Oct 21, 2021
    • Jon Chesterfield's avatar
      [libomptarget][DeviceRTL] Generalise and simplify cmakelists · a602c2b5
      Jon Chesterfield authored
      Step towards building the DeviceRTL for amdgpu.
      
      Mostly replaces cuda-specific toolchain finding logic with the
      generic logic currently found in the amdgpu deviceRTL cmake. Also
      deletes dead code and changes the default to build on systems
      without cuda installed, as the library doesn't use cuda and the
      amdgpu-only systems generally won't have cuda installed.
      
      Reviewed By: Meinersbur
      
      Differential Revision: https://reviews.llvm.org/D111983
      a602c2b5
  10. Oct 19, 2021
  11. Oct 18, 2021
  12. Oct 16, 2021
    • Shilei Tian's avatar
      [OpenMP][deviceRTLs] Fix wrong return value of `__kmpc_is_spmd_exec_mode` · 2c941fa2
      Shilei Tian authored
      D110279 introduced a bug to the device runtime. In `__kmpc_parallel_51`, we detect
      whether we are already in parallel region by `__kmpc_parallel_level() > __kmpc_is_spmd_exec_mode()`.
      It is based on the assumption that:
      - In SPMD mode, parallel level is initialized to 1.
      - In generic mode, parallel level is initialized to 0.
      - `__kmpc_is_spmd_exec_mode` returns `1` for SPMD mode, 0 otherwise.
      
      Because the return value type of `__kmpc_is_spmd_exec_mode` is `int8_t`, there
      was an implicit cast from `bool` to `int8_t`. We can make sure it is either 0 or
      1 since C++14. In D110279, the return value is the result of an `and` operation,
      which is 2 in SPMD mode. This breaks the assumption in `__kmpc_parallel_51`.
      
      Reviewed By: carlo.bertolli, dpalermo
      
      Differential Revision: https://reviews.llvm.org/D111905
      2c941fa2
  13. Oct 09, 2021
  14. Oct 08, 2021
Loading