Skip to content
  1. Feb 08, 2022
  2. Feb 04, 2022
    • Joseph Huber's avatar
      [OpenMP] Completely remove old device runtime · 034adaf5
      Joseph Huber authored
      This patch completely removes the old OpenMP device runtime. Previously,
      the old runtime had the prefix `libomptarget-new-` and the old runtime
      was simply called `libomptarget-`. This patch makes the formerly new
      runtime the only runtime available. The entire project has been deleted,
      and all references to the `libomptarget-new` runtime has been replaced
      with `libomptarget-`.
      
      Reviewed By: JonChesterfield
      
      Differential Revision: https://reviews.llvm.org/D118934
      034adaf5
  3. Feb 01, 2022
  4. Jan 19, 2022
  5. Jan 10, 2022
  6. Dec 17, 2021
  7. Dec 15, 2021
  8. Dec 10, 2021
    • Carlo Bertolli's avatar
      [OpenMP] Part 2 of At present, amdgpu plugin merges both asynchronous · 28309c54
      Carlo Bertolli authored
      and synchronous kernel launch implementations into a single
      synchronous version.  This patch prepares the plugin for asynchronous
      implementation by:
      
          Privatizing actual kernel launch code (valid in both cases) into
          an anonymous namespace base function (submitted at D115267)
      
          - Separating the control flow path of asynchronous and synchronous
            kernel launch functions** (this diff)
      
      Reviewed By: JonChesterfield
      
      Differential Revision: https://reviews.llvm.org/D115273
      28309c54
  9. Dec 09, 2021
  10. Dec 08, 2021
  11. Dec 07, 2021
  12. Dec 06, 2021
  13. Nov 29, 2021
    • Matt Arsenault's avatar
      OpenMP: Correctly query location for amdgpu-arch · 935abeaa
      Matt Arsenault authored
      This was trying to figure out the build path for amdgpu-arch, and
      making assumptions about where it is which were not working on my
      system. Whether a standalone build or not, we should have a proper
      imported target to get the location from.
      935abeaa
  14. Nov 23, 2021
    • Jon Chesterfield's avatar
      [openmp][amdgpu] Make plugin robust to presence of explicit implicit arguments · ae5348a3
      Jon Chesterfield authored
      OpenMP (compiler) does not currently request any implicit kernel
      arguments. OpenMP (runtime) allocates and initialises a reasonable guess at
      the implicit kernel arguments anyway.
      
      This change makes the plugin check the number of explicit arguments, instead
      of all arguments, and puts the pointer to hostcall buffer in both the current
      location and at the offset expected when implicit arguments are added to the
      metadata by D113538.
      
      This is intended to keep things running while fixing the oversight in the
      compiler (in D113538). Once that patch lands, and a following one marks
      openmp kernels that use printf such that the backend emits an args element
      with the right type (instead of hidden_node), the over-allocation can be
      removed and the hardcoded 8*e+3 offset replaced with one read from the
      .offset of the corresponding metadata element.
      
      Reviewed By: estewart08
      
      Differential Revision: https://reviews.llvm.org/D114274
      ae5348a3
  15. Nov 19, 2021
  16. Oct 28, 2021
  17. Oct 23, 2021
  18. Oct 19, 2021
  19. Oct 09, 2021
  20. Oct 07, 2021
  21. Oct 01, 2021
  22. Sep 30, 2021
    • Jon Chesterfield's avatar
      [libomptarget] Apply D110029 to amdgpu · b75a7481
      Jon Chesterfield authored
      Use enum for execution mode.
      
      This is partly a port from ROCm and partly a port from D110029. Attempted to
      make the same choices as ROCm as far as comments etc go to reduce the merge
      conflicts.
      
      There is some cleanup warranted here - in particular I like the cuda patch
      factoring out the comparisons into named variables - but I'd like to leave
      that for a follow up patch, keeping this one minimal.
      
      Reviewed By: carlo.bertolli
      
      Differential Revision: https://reviews.llvm.org/D110845
      b75a7481
  23. Sep 29, 2021
    • Dhruva Chakrabarti's avatar
      [libomptarget] [amdgpu] After a kernel dispatch packet is published, its... · 62262702
      Dhruva Chakrabarti authored
      [libomptarget] [amdgpu] After a kernel dispatch packet is published, its contents must not be accessed.
      
      Fixes: SWDEV-275232 (With contributions from Ammar Elwazir, Laurent Morichetti, and Tony Tye)
      
      The current code is racy. After the packet is submitted, the GPU will increment the read index. If this wraps around before the memory is read from it'll refer to a signal from an unrelated packet. Change avoids reading from the packet post-submission.
      
      Reviewed By: JonChesterfield
      
      Differential Revision: https://reviews.llvm.org/D110679
      62262702
  24. Sep 27, 2021
  25. Sep 26, 2021
  26. Sep 09, 2021
    • Jon Chesterfield's avatar
      [libomptarget][amdgpu] Precisely manage hsa lifetime · 6760234e
      Jon Chesterfield authored
      The hsa library must be initialized before any calls into it and
      destructed after the last call into it. There have been a number of bugs in
      this area related to member variables which would like to use raii to manage
      resources acquired from hsa.
      
      This patch moves the init/shutdown of hsa into a class, such that when used as
      the first member variable (could be a base), the lifetime of other member
      variables are reliably scoped within it. This will allow other classes to use
      raii reliably when used as member variables within the global.
      
      Reviewed By: pdhaliwal
      
      Differential Revision: https://reviews.llvm.org/D109512
      6760234e
Loading