Skip to content
  1. Feb 10, 2022
  2. Feb 09, 2022
    • Joseph Huber's avatar
      [Libomptarget] Increase stack size for bug49779 test · 9582f096
      Joseph Huber authored
      The 'bug49779.cpp' test has been failing recently. This is because the
      runtime is sufficiently complex when using nested parallelism without
      optimizations that the CUDA tools cannot statically determine the stack
      size. Because of this the kernel can exceed the thread stack size and
      crash. Work around this using the 'LIBOMPTARGET_STACK_SIZE' environment
      variable and add an FAQ entry for this situation.
      
      Fixes #53670
      
      Reviewed By: Meinersbur
      
      Differential Revision: https://reviews.llvm.org/D119357
      9582f096
  3. Feb 08, 2022
  4. Feb 07, 2022
    • Joseph Huber's avatar
      [Libomptarget] Replace Value RAII with default value · d28051c4
      Joseph Huber authored
      This patch replaces the ValueRAII pointer with a default 'nullptr'
      value. Previously this was initialized as a reference to an existing
      variable. The use of this variable caused overhead as the compiler could
      not look through the uses and determine that it was unused if 'Active'
      was not set. Because of this accesses to the variable would be left in
      the runtime once compiled.
      
      Fixes #53641
      
      Reviewed By: jdoerfert
      
      Differential Revision: https://reviews.llvm.org/D119187
      d28051c4
  5. Feb 04, 2022
    • Joseph Huber's avatar
      [OpenMP] Completely remove old device runtime · 034adaf5
      Joseph Huber authored
      This patch completely removes the old OpenMP device runtime. Previously,
      the old runtime had the prefix `libomptarget-new-` and the old runtime
      was simply called `libomptarget-`. This patch makes the formerly new
      runtime the only runtime available. The entire project has been deleted,
      and all references to the `libomptarget-new` runtime has been replaced
      with `libomptarget-`.
      
      Reviewed By: JonChesterfield
      
      Differential Revision: https://reviews.llvm.org/D118934
      034adaf5
    • Joseph Huber's avatar
      [Libomptarget] Remove AMDGPU XFAIL from test · b4be1821
      Joseph Huber authored
      Summary;
      This test should pass now with AMDGPU. Previously the symbols were
      hidden and would fail when read.
      b4be1821
  6. Feb 01, 2022
  7. Jan 31, 2022
  8. Jan 29, 2022
  9. Jan 28, 2022
  10. Jan 27, 2022
  11. Jan 26, 2022
  12. Jan 21, 2022
    • Joseph Huber's avatar
      [Libomptarget] Change visibility to hidden for device RTL · 26feef08
      Joseph Huber authored
      This patch changes the visibility for all construct in the new device
      RTL to be hidden by default. This is done after the changes introduced
      in D117806 changed the visibility from being hidden by default for all
      device compilations. This asserts that the visibility for the device
      runtime library will be hidden except for the internal environment
      variable. This is done to aid optimization and linking of the device
      library.
      
      Reviewed By: JonChesterfield
      
      Differential Revision: https://reviews.llvm.org/D117807
      26feef08
  13. Jan 20, 2022
    • Johannes Doerfert's avatar
      [OpenMP] Avoid costly shadow map traversals whenever possible · b0789a1b
      Johannes Doerfert authored
      In the OpenMC app we saw `omp target update` spending an awful lot of
      time in the shadow map traversal without ever doing any update there.
      There are two cases that allow us to avoid the traversal completely.
      The simplest thing is that small updates cannot (reasonably) contain
      an attached pointer part. The other case requires to track in the
      mapping table if an entry might contain an attached pointer as part.
      Given that we have a single location shadow map entries are created,
      the latter is actually fairly easy as well.
      
      Differential Revision: https://reviews.llvm.org/D113124
      b0789a1b
    • Johannes Doerfert's avatar
      [OpenMP] Introduce an environment variable to disable atomic map clauses · 1e447d03
      Johannes Doerfert authored
      Atomic handling of map clauses was introduced to comply with the OpenMP
      standard (see D104418). However, many apps won't need this feature which
      can be costly in certain situations. To allow for applications to
      opt-out we now introduce the `LIBOMPTARGET_MAP_FORCE_ATOMIC` environment
      flag that voids the atomicity guarantee of the standard for map clauses
      again, shifting the burden to the user.
      
      This patch also de-duplicates the code that introduces the events used
      to enforce atomicity as a cleanup.
      
      Differential Revision: https://reviews.llvm.org/D117627
      1e447d03
    • Joseph Huber's avatar
      [OpenMP] Expand short verisions of OpenMP offloading triples · 28d71860
      Joseph Huber authored
      The OpenMP offloading libraries are built with fixed triples and linked
      in during compile time. This would cause un-helpful errors if the user
      passed in the wrong expansion of the triple used for the bitcode
      library. because we only support these triples for OpenMP offloading we
      can normalize them to the full verion used in the bitcode library.
      
      Reviewed By: jdoerfert, JonChesterfield
      
      Differential Revision: https://reviews.llvm.org/D117634
      28d71860
  14. Jan 19, 2022
  15. Jan 18, 2022
  16. Jan 17, 2022
    • Joseph Huber's avatar
      [Libomptarget] Add `cold` to KeepAlive attributes · 4869a22d
      Joseph Huber authored
      This patch adds the `cold` attribute to the keepAlive functions in the
      RTL. This dummy function exists to keep certain RTL calls alive without
      them being optimized out, but it is never called and can be declared
      cold. This also helps some erroneous remarks being given on this
      function because it has weak linkage and cannot be made internal.
      
      Reviewed By: tianshilei1992
      
      Differential Revision: https://reviews.llvm.org/D117513
      4869a22d
  17. Jan 13, 2022
  18. Jan 10, 2022
Loading