Skip to content
  1. Sep 02, 2021
  2. Sep 01, 2021
    • Joel E. Denny's avatar
      [OpenMP] Use IsHostPtr where needed in rest of omptarget.cpp · 786a1406
      Joel E. Denny authored
      As started in D107925, this patch replaces the remaining occurrences
      of `UNIFIED_SHARED_MEMORY && TgtPtrBegin == HstPtrBegin` in
      `omptarget.cpp` with `IsHostPtr`.  The former condition is broken in
      the rare case that the device and host happen to use the same address
      for their mapped allocations.  I don't know how to write a test that's
      likely to reveal this case.
      
      Reviewed By: grokos
      
      Differential Revision: https://reviews.llvm.org/D107928
      786a1406
    • Joel E. Denny's avatar
      [OpenMP] Use IsHostPtr where needed for targetDataBegin · d11bab0b
      Joel E. Denny authored
      As discussed in D105990, without this patch, `targetDataBegin`
      determines whether to transfer data (as opposed to assuming it's in
      shared memory) using the condition `!UseUSM || HasCloseModifier`.
      However, this condition is broken if use of discrete memory was forced
      by `omp_target_associate_ptr`.  This patch extends
      `unified_shared_memory/associate_ptr.c` to reveal this case, and it
      fixes it using `!IsHostPtr` in `DeviceTy::getTargetPointer` to replace
      this condition.
      
      Reviewed By: grokos
      
      Differential Revision: https://reviews.llvm.org/D107927
      d11bab0b
    • Joel E. Denny's avatar
      [OpenMP][NFC] Eliminate CopyMember from targetDataEnd · fa6c2755
      Joel E. Denny authored
      This patch is based on comments in D105990.  It is NFC according to
      the following observations:
      
      1. `CopyMember` is computed as `!IsHostPtr && IsLast`.
      2. `DelEntry` is true only if `IsLast` is true.
      
      We apply those observations in order:
      
      ```
      if ((DelEntry || Always || CopyMember) && !IsHostPtr)
      
      if ((DelEntry || Always || IsLast) && !IsHostPtr)
      
      if ((Always || IsLast) && !IsHostPtr)
      ```
      
      Reviewed By: grokos
      
      Differential Revision: https://reviews.llvm.org/D107926
      fa6c2755
    • Joel E. Denny's avatar
      [OpenMP] Use IsHostPtr where needed for targetDataEnd · 8e4836b2
      Joel E. Denny authored
      As discussed in D105990, without this patch, `targetDataEnd`
      determines whether to transfer data or delete a device mapping (as
      opposed to assuming it's in shared memory) using two different
      conditions, each of which is broken for some cases:
      
      1. `!(UNIFIED_SHARED_MEMORY && TgtPtrBegin == HstPtrBegin)`: The
         broken case is rare: the device and host might happen to use the
         same address for their mapped allocations.  I don't know how to
         write a test that's likely to reveal this case, but this patch does
         fix it, as discussed below.
      2. `!UNIFIED_SHARED_MEMORY || HasCloseModifier`: There are at least
         two broken cases:
          1. The `close` modifier might have been specified on an `omp
            target enter data` but not the corresponding `omp target exit
            data`, which thus might falsely assume a mapping is in shared
            memory.  The test `unified_shared_memory/close_enter_exit.c`
            already has a missing deletion as a result, and this patch adds
            a check for that.  This patch also adds the new test
            `close_member.c` to reveal a missing transfer and deletion.
          2. Use of discrete memory might have been forced by
            `omp_target_associate_ptr`, as in the test
            `unified_shared_memory/api.c`.  In the current `targetDataEnd`
            implementation, this condition turns out not be used for this
            case: because the reference count is infinite, a transfer is
            possible only with an `always` modifier, and this condition is
            never used in that case.  To ensure it's never used for that
            case in the future, this patch adds the test
            `unified_shared_memory/associate_ptr.c`.
      
      Fortunately, `DeviceTy::getTgtPtrBegin` already has a solution: it
      reports whether the allocation was found in shared memory via the
      variable `IsHostPtr`.
      
      After this patch, `HasCloseModifier` is no longer used in
      `targetDataEnd`, and I wonder if the `close` modifier is ever useful
      on an `omp target data end`.
      
      Reviewed By: grokos
      
      Differential Revision: https://reviews.llvm.org/D107925
      8e4836b2
    • Jon Chesterfield's avatar
      Revert "[openmp] No longer use LIBRARY_PATH to find devicertl" · cef11996
      Jon Chesterfield authored
      This reverts commit 7a228f87.
      Failing test case under CI
      cef11996
    • Jon Chesterfield's avatar
      [openmp] No longer use LIBRARY_PATH to find devicertl · 7a228f87
      Jon Chesterfield authored
      Given D109057, change test runner to use the libomptarget-x-bc-path
      argument instead of the LIBRARY_PATH environment variable to find the device
      library.
      
      Also drop the use of LIBRARY_PATH environment variable as it is far
      too easy to pull in the device library from an unrelated toolchain by accident
      with the current setup. No loss in flexibility to developers as the clang
      commandline used here is still available.
      
      Reviewed By: jdoerfert
      
      Differential Revision: https://reviews.llvm.org/D109061
      7a228f87
    • Jon Chesterfield's avatar
      [libomptarget] Set runpath on libomptarget, use that to drop LD_LIBRARY_PATH from test runner · 718e5a98
      Jon Chesterfield authored
      Using rpath instead of LD_LIBRARY_PATH to find libomp.so and
      libomptarget.so lets one rerun the already built test executables without
      setting environment variables and removes the risk of the test runner picking
      up different libraries to the developer debugging the failure.
      
      rpath usually means runpath, which is not transitive, so set runpath on
      libomptarget itself so that it can find the plugins located next to it,
      spelled $ORIGIN. This provides sufficient functionality to drop D102043
      
      Reviewed By: tianshilei1992
      
      Differential Revision: https://reviews.llvm.org/D109071
      718e5a98
    • Jon Chesterfield's avatar
    • Vignesh Balasubramanian's avatar
      [OpenMP][OMPD] Implementation of OMPD debugging library - libompd. · b9a27908
      Vignesh Balasubramanian authored
      This is a continuation of the review: https://reviews.llvm.org/D100181
      Creates a new directory "libompd" under openmp.
      
      "TargetValue" provides operational access to the OpenMP runtime memory
       for OMPD APIs.
      With TargetValue, using "pointer" a user can do multiple operations
      from casting, dereferencing to accessing an element for structure.
      The member functions are designed to concatenate the operations that
      are needed to access values from structures.
      
      e.g., _a[6]->_b._c would read like :
      TValue(ctx, "_a").cast("A",2)
      	.getArrayElement(6).access("_b").cast("B").access("_c")
      For example:
      If you have a pointer "ThreadHandle" of a running program then you can
      access/retrieve "threadID" from the memory using TargetValue as below.
      
        TValue(context, thread_handle->th) /*__kmp_threads[t]->th*/
          .cast("kmp_base_info_t")
          .access("th_info") /*__kmp_threads[t]->th.th_info*/
          .cast("kmp_desc_t")
          .access("ds") /*__kmp_threads[t]->th.th_info.ds*/
          .cast("kmp_desc_base_t")
          .access("ds_thread") /*__kmp_threads[t]->th.th_info.ds.ds_thread*/
                      .cast("kmp_thread_t")
      .getRawValue(thread_id, 1);
      
      Reviewed By: @hbae
      Differential Revision: https://reviews.llvm.org/D100182
      b9a27908
    • Joel E. Denny's avatar
      1688b4cf
  3. Aug 31, 2021
    • Joel E. Denny's avatar
      [OpenMP][OpenACC] Implement `ompx_hold` map type modifier extension in runtime (2/2) · ec1ebcd3
      Joel E. Denny authored
      This patch implements OpenMP runtime support for an original OpenMP
      extension we have developed to support OpenACC: the `ompx_hold` map
      type modifier.  The previous patch in this series, D106509, implements
      Clang support and documents the new functionality in detail.
      
      Reviewed By: grokos
      
      Differential Revision: https://reviews.llvm.org/D106510
      ec1ebcd3
    • Joel E. Denny's avatar
      [OpenMP][OpenACC] Implement `ompx_hold` map type modifier extension in Clang (1/2) · 83ddfa0d
      Joel E. Denny authored
      This patch implements Clang support for an original OpenMP extension
      we have developed to support OpenACC: the `ompx_hold` map type
      modifier.  The next patch in this series, D106510, implements OpenMP
      runtime support.
      
      Consider the following example:
      
      ```
       #pragma omp target data map(ompx_hold, tofrom: x) // holds onto mapping of x
       {
         foo(); // might have map(delete: x)
         #pragma omp target map(present, alloc: x) // x is guaranteed to be present
         printf("%d\n", x);
       }
      ```
      
      The `ompx_hold` map type modifier above specifies that the `target
      data` directive holds onto the mapping for `x` throughout the
      associated region regardless of any `target exit data` directives
      executed during the call to `foo`.  Thus, the presence assertion for
      `x` at the enclosed `target` construct cannot fail.  (As usual, the
      standard OpenMP reference count for `x` must also reach zero before
      the data is unmapped.)
      
      Justification for inclusion in Clang and LLVM's OpenMP runtime:
      
      * The `ompx_hold` modifier supports OpenACC functionality (structured
        reference count) that cannot be achieved in standard OpenMP, as of
        5.1.
      * The runtime implementation for `ompx_hold` (next patch) will thus be
        used by Flang's OpenACC support.
      * The Clang implementation for `ompx_hold` (this patch) as well as the
        runtime implementation are required for the Clang OpenACC support
        being developed as part of the ECP Clacc project, which translates
        OpenACC to OpenMP at the directive AST level.  These patches are the
        first step in upstreaming OpenACC functionality from Clacc.
      * The Clang implementation for `ompx_hold` is also used by the tests
        in the runtime implementation.  That syntactic support makes the
        tests more readable than low-level runtime calls can.  Moreover,
        upstream Flang and Clang do not yet support OpenACC syntax
        sufficiently for writing the tests.
      * More generally, the Clang implementation enables a clean separation
        of concerns between OpenACC and OpenMP development in LLVM.  That
        is, LLVM's OpenMP developers can discuss, modify, and debug LLVM's
        extended OpenMP implementation and test suite without directly
        considering OpenACC's language and execution model, which can be
        handled by LLVM's OpenACC developers.
      * OpenMP users might find the `ompx_hold` modifier useful, as in the
        above example.
      
      See new documentation introduced by this patch in `openmp/docs` for
      more detail on the functionality of this extension and its
      relationship with OpenACC.  For example, it explains how the runtime
      must support two reference counts, as specified by OpenACC.
      
      Clang recognizes `ompx_hold` unless `-fno-openmp-extensions`, a new
      command-line option introduced by this patch, is specified.
      
      Reviewed By: ABataev, jdoerfert, protze.joachim, grokos
      
      Differential Revision: https://reviews.llvm.org/D106509
      83ddfa0d
    • Shilei Tian's avatar
      [OpenMP] Fix task wait doesn't work as expected in serialized team · 8442967f
      Shilei Tian authored
      As discussed in D107121, task wait doesn't work when a regular task T depends on
      a detached task or a hidden helper task T' in a serialized team. The root cause is,
      since the team is serialized, the last task will not be tracked by
      `td_incomplete_child_tasks`. When T' is finished, it first releases its
      dependences, and then decrements its parent counter. So far so good. For the thread
      that is running task wait, if at the moment it is still spinning and trying to
      execute tasks, it is fine because it can detect the new task and execute it.
      However, if it happends to finish the function `flag.execute_tasks(...)`, it will
      be broken because `td_incomplete_child_tasks` is 0 now.
      
      In this patch, we update the rule to track children tasks a little bit. If the
      task team encounters a proxy task or a hidden helper task, all following tasks
      will be tracked.
      
      Reviewed By: AndreyChurbanov
      
      Differential Revision: https://reviews.llvm.org/D107496
      8442967f
  4. Aug 30, 2021
  5. Aug 29, 2021
  6. Aug 28, 2021
  7. Aug 27, 2021
  8. Aug 26, 2021
  9. Aug 25, 2021
  10. Aug 24, 2021
  11. Aug 23, 2021
  12. Aug 20, 2021
  13. Aug 19, 2021
    • Jennifer Yu's avatar
      Add implicit map for a list item appears in a reduction clause. · c274b198
      Jennifer Yu authored
      A new rule is added in 5.0:
      If a list item appears in a reduction, lastprivate or linear clause
      on a combined target construct then it is treated as if it also appears
      in a map clause with a map-type of tofrom.
      
      Currently map clauses for all capture variables are added implicitly.
      But missing for list item of expression for array elements or array
      sections.
      
      The change is to add implicit map clause for array of elements used in
      reduction clause. Skip adding map clause if the expression is not
      mappable.
      Noted: For linear and lastprivate, since only variable name is
      accepted, the map has been added though capture variables.
      
      To do so:
      During the mappable checking, if error, ignore diagnose and skip
      adding implicit map clause.
      
      The changes:
      1> Add code to generate implicit map in ActOnOpenMPExecutableDirective,
         for omp 5.0 and up.
      2> Add extra default parameter NoDiagnose in ActOnOpenMPMapClause:
      Use that to skip error as well as skip adding implicit map during the
      mappable checking.
      
      Note: there are only tow places need to be check for NoDiagnose. Rest
      of them either the check is for < omp 5.0 or the error already generated for
      reduction clause.
      
      Differential Revision: https://reviews.llvm.org/D108132
      c274b198
    • Jon Chesterfield's avatar
    • Jon Chesterfield's avatar
Loading