Skip to content
  1. Apr 05, 2016
  2. Apr 04, 2016
    • Jonathan Peyton's avatar
      OMP_WAIT_POLICY changes · 50e8f18b
      Jonathan Peyton authored
      This change has OMP_WAIT_POLICY=active to mean that threads will busy-wait in
      spin loops and virtually never go to sleep. OMP_WAIT_POLICY=passive now means
      that threads will immediately go to sleep inside a spin loop. KMP_BLOCKTIME was
      the previous mechanism to specify this behavior via KMP_BLOCKTIME=0 or
      KMP_BLOCKTIME=infinite, but the standard OpenMP environment variable should
      also be able to specify this behavior.
      
      Differential Revision: http://reviews.llvm.org/D18577
      
      llvm-svn: 265339
      50e8f18b
  3. Mar 30, 2016
  4. Mar 29, 2016
  5. Mar 27, 2016
  6. Mar 24, 2016
  7. Mar 23, 2016
    • Jonathan Peyton's avatar
      Fix Visual Studio builds · b7d30cbc
      Jonathan Peyton authored
      Have Visual Studio use MemoryBarrier() instead of _mm_mfence() and remove
      __declspec align attribute from function parameters in kmp_atomic.h
      
      llvm-svn: 264166
      b7d30cbc
  8. Mar 21, 2016
  9. Mar 16, 2016
    • Jonathan Peyton's avatar
      [CMake] Fix Windows build problem for CMake versions < 3.3 · 8a46c067
      Jonathan Peyton authored
      Building libomp using CMake versions < 3.3 caused a link time error.  These
      errors occurred because when assembling z_Windows_NT-586_asm.asm, the
      definitions: OMPT_SUPPORT, _M_AMD64|_M_IA32 weren't defined on the command line.
      To fix the problem, the COMPILE_FLAGS property for the assembly file is appended
      to instead of the COMPILE_DEFINITIONS property being set.  For whatever reason, the
      COMPILE_DEFINITIONS property doesn't pick up the definitions for assembly files
      for the older CMake versions.
      
      llvm-svn: 263651
      8a46c067
  10. Mar 15, 2016
  11. Mar 12, 2016
    • Samuel Antao's avatar
      Initialize two variables in kmp_tasking. · 11e4c539
      Samuel Antao authored
      Summary:
      Two initialized local variables are causing clang to produce warnings:
      
      ```
      ./src/projects/openmp/runtime/src/kmp_tasking.c:3019:5: error: variable 'num_tasks' is used uninitialized whenever switch default is taken [-Werror,-Wsometimes-uninitialized]
          default:
          ^~~~~~~
      ./src/projects/openmp/runtime/src/kmp_tasking.c:3027:21: note: uninitialized use occurs here
          for( i = 0; i < num_tasks; ++i ) {
                          ^~~~~~~~~
      ./src/projects/openmp/runtime/src/kmp_tasking.c:2968:28: note: initialize the variable 'num_tasks' to silence this warning
          kmp_uint64 i, num_tasks, extras;
                                 ^
                                  = 0
      ./src/projects/openmp/runtime/src/kmp_tasking.c:3019:5: error: variable 'extras' is used uninitialized whenever switch default is taken [-Werror,-Wsometimes-uninitialized]
          default:
          ^~~~~~~
      ./src/projects/openmp/runtime/src/kmp_tasking.c:3022:52: note: uninitialized use occurs here
          KMP_DEBUG_ASSERT(tc == num_tasks * grainsize + extras);
                                                         ^~~~~~
      ./src/projects/openmp/runtime/src/kmp_debug.h:62:60: note: expanded from macro 'KMP_DEBUG_ASSERT'
              #define KMP_DEBUG_ASSERT( cond )       KMP_ASSERT( cond )
                                                                 ^
      ./src/projects/openmp/runtime/src/kmp_debug.h:60:51: note: expanded from macro 'KMP_ASSERT'
              #define KMP_ASSERT( cond )             ( (cond) ? 0 : __kmp_debug_assert( #cond, __FILE__, __LINE__ ) )
                                                        ^
      ./src/projects/openmp/runtime/src/kmp_tasking.c:2968:36: note: initialize the variable 'extras' to silence this warning
          kmp_uint64 i, num_tasks, extras;
                                         ^
                                          = 0
      2 errors generated.
      ```
      
      This patch initializes these two variables.
      
      Reviewers: tlwilmar, jlpeyton
      
      Subscribers: tlwilmar, openmp-commits
      
      Differential Revision: http://reviews.llvm.org/D17909
      
      llvm-svn: 263316
      11e4c539
  12. Mar 11, 2016
  13. Mar 03, 2016
  14. Mar 02, 2016
    • Jonathan Peyton's avatar
      Add new OpenMP 4.5 taskloop construct feature · 283a215c
      Jonathan Peyton authored
      From the standard: The taskloop construct specifies that the iterations of one
      or more associated loops will be executed in parallel using OpenMP tasks. The
      iterations are distributed across tasks created by the construct and scheduled
      to be executed.
      
      This initial implementation uses a simple linear tasks distribution algorithm.
      Later we can add other algorithms to speedup generation of huge number of tasks
      (i.e., tree-like tasks generation should be faster).
      
      This needs to be put into the OpenMP runtime library in order for the
      compiler team to develop the compiler side of the implementation.
      
      Differential Revision: http://reviews.llvm.org/D17404
      
      llvm-svn: 262535
      283a215c
    • Jonathan Peyton's avatar
      Add new OpenMP 4.5 doacross loop nest feature · 71909c57
      Jonathan Peyton authored
      From the standard: A doacross loop nest is a loop nest that has cross-iteration
      dependence. An iteration is dependent on one or more lexicographically earlier
      iterations. The ordered clause parameter on a loop directive identifies the
      loop(s) associated with the doacross loop nest.
      
      The init/fini routines allocate/free doacross buffer(s) for each loop for each
      thread.  The wait routine waits for a flag designated by the dependence vector.
      The post routine sets the flag designated by current iteration vector.  We use
      a similar technique of shared buffer indices that covers up to 7 nowait loops
      executed simultaneously by different threads (number 7 has no real meaning,
      just heuristic value).  Also, the size of structures are kept intact via
      reducing dummy arrays.
      
      This needs to be put into the OpenMP runtime library in order for the compiler
      team to develop the compiler side of the implementation.
      
      Differential Revision: http://reviews.llvm.org/D17399
      
      llvm-svn: 262532
      71909c57
  15. Feb 25, 2016
    • Jonathan Peyton's avatar
      Add new OpenMP 4.5 affinity API · 2f7c077b
      Jonathan Peyton authored
      This change introduces the new OpenMP 4.5 affinity api surrounding
      OpenMP Places. There are six new entry points:
      
      Typically called in serial region:
       * omp_get_num_places - returns the number of places available to the execution
             environment in the place list.
       * omp_get_place_num_procs - returns the number of processors available to the
             execution environment in the specified place.
       * omp_get_place_proc_ids - returns the numerical identifiers of the processors
             available to the execution environment in the specified place.
      
      Typically called inside parallel region:
       * omp_get_place_num - returns the place number of the place to which the
             encountering thread is bound.
       * omp_get_partition_num_places - returns the number of places in the place
             partition of the innermost implicit task.
       * omp_get_partition_place_nums - returns the list of place numbers
             corresponding to the places in the place-var ICV of the innermost
             implicit task.
      
      Differential Revision: http://reviews.llvm.org/D17417
      
      llvm-svn: 261915
      2f7c077b
    • Jonathan Peyton's avatar
      Add initial support for OpenMP 4.5 task priority feature · 2851072d
      Jonathan Peyton authored
      The maximum task priority value is read from envirable: OMP_MAX_TASK_PRIORITY.
      But as of now, nothing is done with it.  We just handle the environment variable
      and add the new api: omp_get_max_task_priority() which returns that value or
      zero if it is not set.
      
      Differential Revision: http://reviews.llvm.org/D17411
      
      llvm-svn: 261908
      2851072d
    • Jonathan Peyton's avatar
      dd new OpenMP 4.5 schedule clause modifiers (monotonic/non-monotonic) feature · ea0fe1df
      Jonathan Peyton authored
      The monotonic/non-monotonic flags are sent to the runtime via the sched_type by
      setting the 30th (non-monotonic) or 29th (monotonic) bit in the sched_type.
      Macros are added to probe if monotonic or non-monotonic is specified
      (SCHEDULE_HAS_[NON]MONOTONIC & SCHEDULE_HAS_NO_MODIFIERS)
      and also to to get the base sched_type (SCHEDULE_WITHOUT_MODIFIERS)
      
      Currently, nothing is done with the modifiers.
      
      Also, this patch adds some comments on the use of the enumerations in at least
       one place where it is subtle.
      
      Differential Revision: http://reviews.llvm.org/D17406
      
      llvm-svn: 261906
      ea0fe1df
  16. Feb 18, 2016
  17. Feb 12, 2016
    • Jonas Hahnfeld's avatar
      [OMPT] Frame information for openmp taskwait · 867aa20b
      Jonas Hahnfeld authored
      For pragma omp taskwait the runtime is called from the task context.
      Therefore, the reentry frame information should be updated.
      
      The information should be available for both taskwait event calls; therefore,
      set before the first event and reset after the last event.
      
      Patch by Joachim Protze
      Differential Revision: http://reviews.llvm.org/D17145
      
      llvm-svn: 260674
      867aa20b
    • Jonathan Peyton's avatar
      Fix incorrect task_team in __kmp_give_task · 134f90d5
      Jonathan Peyton authored
      When a target task finishes and it tries to access the th_task_team from the
      threads in the team where it was created, th_task_team can be NULL or point to
      a different place when that thread started a nested region that is still
      running. Finding the exact task_team that the threads were using is difficult
      as it would require to unwind the task_state_memo_stack. So a new field was added
      in the taskdata structure to point to the active task_team when the task was
      created.
      
      llvm-svn: 260615
      134f90d5
  18. Feb 11, 2016
  19. Feb 09, 2016
  20. Feb 04, 2016
    • Jonathan Peyton's avatar
      Add LIBOMP_ENABLE_SHARED option for CMake · fd74f900
      Jonathan Peyton authored
      When building executables for Cray supercomputers, statically-linked executables
      are preferred. This patch makes it possible to build the OpenMP runtime as an
      archive for building statically-linked executables.  The patch adds the flag
      LIBOMP_ENABLE_SHARED, which defaults to true. When true, a build of the OpenMP
      runtime yields dynamic libraries. When false, a build of the OpenMP runtime
      yields static libraries. There is no setting that allows both kinds of libraries
      to be built.
      
      Patch by John Mellor-Crummey
      
      Differential Revision: http://reviews.llvm.org/D16525
      
      llvm-svn: 259817
      fd74f900
  21. Jan 29, 2016
    • Jonathan Peyton's avatar
      Fix task dependency performance problem · 7d45451a
      Jonathan Peyton authored
      In: http://lists.llvm.org/pipermail/openmp-dev/2015-August/000858.html, a
      performance issue was found with libomp's task dependencies.  The task
      dependencies hash table has an issue with collisions. The current table size is
      a power of two. This combined with the current hash function causes a large
      number of collisions to occurr. Also, the current size (64) is too small for
      larger applications so the table size is increased.
      
      This patch creates a two level hash table approach for task dependencies. The
      implicit task is considered the "master" or "top-level" task which has a large
      static sized hash table (997), and nested tasks will have smaller hash
      tables (97). Prime numbers were chosen to help reduce collisions.
      
      Differential Revision: http://reviews.llvm.org/D16640
      
      llvm-svn: 259113
      7d45451a
  22. Jan 28, 2016
    • Jonas Hahnfeld's avatar
      [OMPT] Add support for ompt_event_task_dependences and ompt_event_task_dependence_pair · 39b68624
      Jonas Hahnfeld authored
      The attached patch adds support for ompt_event_task_dependences and
      ompt_event_task_dependence_pair events from the OMPT specification [1]. These
      events only apply to OpenMP 4.0 and 4.1 (aka 4.5) because task dependencies
      were introduced in 4.0.
      
      With respect to the changes:
      
      ompt_event_task_dependences
      According to the specification, this event is raised after the task has been
      created, thefore this event needs to be raised after ompt_event_task_begin
      (in __kmp_task_start). However, the dependencies are known at
      __kmpc_omp_task_with_deps which occurs before __kmp_task_start. My modifications
      extend the ompt_task_info_t struct in order to store the dependencies of the
      task when _kmpc_omp_task_with_deps occurs and then they are emitted in
      __kmp_task_start just after raising the ompt_event_task_begin. The deps field
      is allocated and valid until the event is raised and it is freed and set
      to null afterwards.
      
      ompt_event_task_dependence_pair
      The processing of the dependences (i.e. checking whenever a dependence is
      already satisfied) is done within __kmp_process_deps. That function checks
      every dependence and calls the __kmp_track_dependence routine which gives some
      support for graphical output. I used that routine to emit the dependence pair
      but I also needed to know the sink_task. Despite the fact that the code within
      KMP_SUPPORT_GRAPH_OUTPUT refers to task_sink it may be null because
      sink->dn.task (there's a comment regarding this) and in fact it does not point
      to a proper pointer value because the value is set in node->dn.task = task;
      after the __kmp_process_deps calls in __kmp_check_deps. I have extended the
      __kmp_process_deps and __kmp_track_dependence parameter list to receive the
      sink_task.
      
      [1] https://github.com/OpenMPToolsInterface/OMPT-Technical-Report/blob/target/ompt-tr.pdf
      
      Patch by Harald Servat
      Differential Revision: http://reviews.llvm.org/D14746
      
      llvm-svn: 259038
      39b68624
    • Jonas Hahnfeld's avatar
      [OMPT] Avoid SEGV when a worker thread needs its parallel id behind the barrier · dbf627db
      Jonas Hahnfeld authored
      When the code behind the barrier is executed, the master thread may have
      already resumed execution. That's why we cannot safely assume that *pteam
      is not yet freed.
      
      This has been introduced by r258866.
      
      llvm-svn: 259037
      dbf627db
    • Jonas Hahnfeld's avatar
      [OMPT] Workaround clang failing with 'declare target' · bba248c3
      Jonas Hahnfeld authored
      Current clang trunk reports _OPENMP to be 201307 = OpenMP 4.0. It doesn't
      recognize '#pragma omp declare target' though (patch still pending) and
      therefore fails compilation.
      
      Differential Revision: http://reviews.llvm.org/D16631
      
      llvm-svn: 259026
      bba248c3
  23. Jan 27, 2016
Loading