Skip to content
  1. Oct 26, 2021
  2. Oct 25, 2021
  3. Oct 23, 2021
  4. Oct 21, 2021
    • Jon Chesterfield's avatar
      [libomptarget][DeviceRTL] Generalise and simplify cmakelists · a602c2b5
      Jon Chesterfield authored
      Step towards building the DeviceRTL for amdgpu.
      
      Mostly replaces cuda-specific toolchain finding logic with the
      generic logic currently found in the amdgpu deviceRTL cmake. Also
      deletes dead code and changes the default to build on systems
      without cuda installed, as the library doesn't use cuda and the
      amdgpu-only systems generally won't have cuda installed.
      
      Reviewed By: Meinersbur
      
      Differential Revision: https://reviews.llvm.org/D111983
      a602c2b5
  5. Oct 19, 2021
  6. Oct 18, 2021
  7. Oct 16, 2021
    • Shilei Tian's avatar
      [OpenMP][deviceRTLs] Fix wrong return value of `__kmpc_is_spmd_exec_mode` · 2c941fa2
      Shilei Tian authored
      D110279 introduced a bug to the device runtime. In `__kmpc_parallel_51`, we detect
      whether we are already in parallel region by `__kmpc_parallel_level() > __kmpc_is_spmd_exec_mode()`.
      It is based on the assumption that:
      - In SPMD mode, parallel level is initialized to 1.
      - In generic mode, parallel level is initialized to 0.
      - `__kmpc_is_spmd_exec_mode` returns `1` for SPMD mode, 0 otherwise.
      
      Because the return value type of `__kmpc_is_spmd_exec_mode` is `int8_t`, there
      was an implicit cast from `bool` to `int8_t`. We can make sure it is either 0 or
      1 since C++14. In D110279, the return value is the result of an `and` operation,
      which is 2 in SPMD mode. This breaks the assumption in `__kmpc_parallel_51`.
      
      Reviewed By: carlo.bertolli, dpalermo
      
      Differential Revision: https://reviews.llvm.org/D111905
      2c941fa2
  8. Oct 09, 2021
  9. Oct 08, 2021
  10. Oct 07, 2021
  11. Oct 04, 2021
    • Michał Górny's avatar
      [openmp] [elf_common] Fix linking against LLVM dylib · 0873b9be
      Michał Górny authored
      The hand-rolled linking logic in elf_common does not account for
      the possibility of using LLVM dylib rather than a dozen static
      libraries.  Since it does not seem to be easily convertible
      to add_llvm_library, just hand-roll support for LLVM_LINK_LLVM_DYLIB.
      This is necessary to support stand-alone builds against installed LLVM.
      
      Differential Revision: https://reviews.llvm.org/D111038
      0873b9be
  12. Oct 01, 2021
  13. Sep 30, 2021
  14. Sep 29, 2021
    • Dhruva Chakrabarti's avatar
      [libomptarget] [amdgpu] After a kernel dispatch packet is published, its... · 62262702
      Dhruva Chakrabarti authored
      [libomptarget] [amdgpu] After a kernel dispatch packet is published, its contents must not be accessed.
      
      Fixes: SWDEV-275232 (With contributions from Ammar Elwazir, Laurent Morichetti, and Tony Tye)
      
      The current code is racy. After the packet is submitted, the GPU will increment the read index. If this wraps around before the memory is read from it'll refer to a signal from an unrelated packet. Change avoids reading from the packet post-submission.
      
      Reviewed By: JonChesterfield
      
      Differential Revision: https://reviews.llvm.org/D110679
      62262702
  15. Sep 27, 2021
  16. Sep 26, 2021
  17. Sep 23, 2021
    • Joseph Huber's avatar
      [OpenMP] Fix data-race in new device RTL · d83ca624
      Joseph Huber authored
      This patch fixes a data-race observed when using the new device runtime
      library. The Internal control variable for the parallel level is read in
      the `__kmpc_parallel_51` function while it could potentially be written
      by other threads. This causes data corruption and will cause
      nondetermistic behaviour in the runtime. This patch fixes this by adding
      an explicit synchronization before the region starts.
      
      Reviewed By: jdoerfert
      
      Differential Revision: https://reviews.llvm.org/D110366
      d83ca624
  18. Sep 22, 2021
    • Shilei Tian's avatar
      [OpenMP][Offloading] Change `bool IsSPMD` to `int8_t Mode` in... · 423d34f7
      Shilei Tian authored
      [OpenMP][Offloading] Change `bool IsSPMD` to `int8_t Mode` in `__kmpc_target_init` and `__kmpc_target_deinit`
      
      This is a follow-up of D110029, which uses bitset to indicate execution mode. This patches makes the changes in the function call.
      
      Reviewed By: jdoerfert
      
      Differential Revision: https://reviews.llvm.org/D110279
      423d34f7
    • Joseph Huber's avatar
      [OpenMP] Fix KeepAlive usage · 60a40cf3
      Joseph Huber authored
      Summary:
      Functions were called the wrong way around, this didn't keep the symbol
      alive.
      60a40cf3
    • Joseph Huber's avatar
      [OpenMP] Add function tracing debugging to device RTL · 277b681e
      Joseph Huber authored
      This patch adds support for an RAII struct that will print function
      traces when placed inside of a function declaration. Each successive
      call will increase the indentation to make it easier to visually
      inspect.
      
      Reviewed By: jdoerfert
      
      Differential Revision: https://reviews.llvm.org/D110202
      277b681e
    • Shilei Tian's avatar
      [OpenMP][Offloading] Use bitset to indicate execution mode instead of value · ca999f71
      Shilei Tian authored
      The execution mode of a kernel is stored in a global variable, whose value means:
      - 0 - SPMD mode
      - 1 - indicates generic mode
      - 2 - SPMD mode execution with generic mode semantics
      
      We are going to add support for SIMD execution mode. It will be come with another
      execution mode, such as SIMD-generic mode. As a result, this value-based indicator
      is not flexible.
      
      This patch changes to bitset based solution to encode execution mode. Each
      position is:
      [0] - generic mode
      [1] - SPMD mode
      [2] - SIMD mode (will be added later)
      
      In this way, `0x1` is generic mode, `0x2` is SPMD mode, and `0x3` is SPMD mode
      execution with generic mode semantics. In the future after we add the support for
      SIMD mode, `0b1xx` will be in SIMD mode.
      
      Reviewed By: jdoerfert
      
      Differential Revision: https://reviews.llvm.org/D110029
      ca999f71
Loading