Skip to content
  1. Apr 07, 2022
    • Michael Kruse's avatar
      [libomptarget] Add device RTL to regression test dependencies. · 7fa7b0cb
      Michael Kruse authored
      In a clean build directory, `check-openmp` or `check-libomptarget` will fail because of missing device RTL .bc files. Ensure that the new targets new custom targets `omptarget.devicertl.nvptx` and `omptarget.devicertl.amdgpu` (corresponding to the plugin rtl targets `omptarget.rtl.cuda`, respectively `omptarget.rlt.amdgpu` ) are dependencies of the regression tests.
      
      Reviewed By: JonChesterfield
      
      Differential Revision: https://reviews.llvm.org/D123177
      7fa7b0cb
  2. Mar 17, 2022
  3. Mar 06, 2022
  4. Mar 03, 2022
  5. Feb 08, 2022
  6. Feb 04, 2022
    • Joseph Huber's avatar
      [OpenMP] Completely remove old device runtime · 034adaf5
      Joseph Huber authored
      This patch completely removes the old OpenMP device runtime. Previously,
      the old runtime had the prefix `libomptarget-new-` and the old runtime
      was simply called `libomptarget-`. This patch makes the formerly new
      runtime the only runtime available. The entire project has been deleted,
      and all references to the `libomptarget-new` runtime has been replaced
      with `libomptarget-`.
      
      Reviewed By: JonChesterfield
      
      Differential Revision: https://reviews.llvm.org/D118934
      034adaf5
  7. Feb 01, 2022
  8. Jan 19, 2022
  9. Jan 10, 2022
  10. Dec 17, 2021
  11. Dec 15, 2021
  12. Dec 10, 2021
    • Carlo Bertolli's avatar
      [OpenMP] Part 2 of At present, amdgpu plugin merges both asynchronous · 28309c54
      Carlo Bertolli authored
      and synchronous kernel launch implementations into a single
      synchronous version.  This patch prepares the plugin for asynchronous
      implementation by:
      
          Privatizing actual kernel launch code (valid in both cases) into
          an anonymous namespace base function (submitted at D115267)
      
          - Separating the control flow path of asynchronous and synchronous
            kernel launch functions** (this diff)
      
      Reviewed By: JonChesterfield
      
      Differential Revision: https://reviews.llvm.org/D115273
      28309c54
  13. Dec 09, 2021
  14. Dec 08, 2021
  15. Dec 07, 2021
  16. Dec 06, 2021
  17. Nov 29, 2021
    • Matt Arsenault's avatar
      OpenMP: Correctly query location for amdgpu-arch · 935abeaa
      Matt Arsenault authored
      This was trying to figure out the build path for amdgpu-arch, and
      making assumptions about where it is which were not working on my
      system. Whether a standalone build or not, we should have a proper
      imported target to get the location from.
      935abeaa
  18. Nov 23, 2021
    • Jon Chesterfield's avatar
      [openmp][amdgpu] Make plugin robust to presence of explicit implicit arguments · ae5348a3
      Jon Chesterfield authored
      OpenMP (compiler) does not currently request any implicit kernel
      arguments. OpenMP (runtime) allocates and initialises a reasonable guess at
      the implicit kernel arguments anyway.
      
      This change makes the plugin check the number of explicit arguments, instead
      of all arguments, and puts the pointer to hostcall buffer in both the current
      location and at the offset expected when implicit arguments are added to the
      metadata by D113538.
      
      This is intended to keep things running while fixing the oversight in the
      compiler (in D113538). Once that patch lands, and a following one marks
      openmp kernels that use printf such that the backend emits an args element
      with the right type (instead of hidden_node), the over-allocation can be
      removed and the hardcoded 8*e+3 offset replaced with one read from the
      .offset of the corresponding metadata element.
      
      Reviewed By: estewart08
      
      Differential Revision: https://reviews.llvm.org/D114274
      ae5348a3
  19. Nov 19, 2021
  20. Oct 28, 2021
  21. Oct 23, 2021
  22. Oct 19, 2021
  23. Oct 09, 2021
  24. Oct 07, 2021
  25. Oct 01, 2021
  26. Sep 30, 2021
    • Jon Chesterfield's avatar
      [libomptarget] Apply D110029 to amdgpu · b75a7481
      Jon Chesterfield authored
      Use enum for execution mode.
      
      This is partly a port from ROCm and partly a port from D110029. Attempted to
      make the same choices as ROCm as far as comments etc go to reduce the merge
      conflicts.
      
      There is some cleanup warranted here - in particular I like the cuda patch
      factoring out the comparisons into named variables - but I'd like to leave
      that for a follow up patch, keeping this one minimal.
      
      Reviewed By: carlo.bertolli
      
      Differential Revision: https://reviews.llvm.org/D110845
      b75a7481
  27. Sep 29, 2021
    • Dhruva Chakrabarti's avatar
      [libomptarget] [amdgpu] After a kernel dispatch packet is published, its... · 62262702
      Dhruva Chakrabarti authored
      [libomptarget] [amdgpu] After a kernel dispatch packet is published, its contents must not be accessed.
      
      Fixes: SWDEV-275232 (With contributions from Ammar Elwazir, Laurent Morichetti, and Tony Tye)
      
      The current code is racy. After the packet is submitted, the GPU will increment the read index. If this wraps around before the memory is read from it'll refer to a signal from an unrelated packet. Change avoids reading from the packet post-submission.
      
      Reviewed By: JonChesterfield
      
      Differential Revision: https://reviews.llvm.org/D110679
      62262702
  28. Sep 27, 2021
Loading