Skip to content
  1. Mar 02, 2016
    • Jonathan Peyton's avatar
      Add new OpenMP 4.5 doacross loop nest feature · 71909c57
      Jonathan Peyton authored
      From the standard: A doacross loop nest is a loop nest that has cross-iteration
      dependence. An iteration is dependent on one or more lexicographically earlier
      iterations. The ordered clause parameter on a loop directive identifies the
      loop(s) associated with the doacross loop nest.
      
      The init/fini routines allocate/free doacross buffer(s) for each loop for each
      thread.  The wait routine waits for a flag designated by the dependence vector.
      The post routine sets the flag designated by current iteration vector.  We use
      a similar technique of shared buffer indices that covers up to 7 nowait loops
      executed simultaneously by different threads (number 7 has no real meaning,
      just heuristic value).  Also, the size of structures are kept intact via
      reducing dummy arrays.
      
      This needs to be put into the OpenMP runtime library in order for the compiler
      team to develop the compiler side of the implementation.
      
      Differential Revision: http://reviews.llvm.org/D17399
      
      llvm-svn: 262532
      71909c57
  2. Feb 25, 2016
    • Jonathan Peyton's avatar
      Add new OpenMP 4.5 affinity API · 2f7c077b
      Jonathan Peyton authored
      This change introduces the new OpenMP 4.5 affinity api surrounding
      OpenMP Places. There are six new entry points:
      
      Typically called in serial region:
       * omp_get_num_places - returns the number of places available to the execution
             environment in the place list.
       * omp_get_place_num_procs - returns the number of processors available to the
             execution environment in the specified place.
       * omp_get_place_proc_ids - returns the numerical identifiers of the processors
             available to the execution environment in the specified place.
      
      Typically called inside parallel region:
       * omp_get_place_num - returns the place number of the place to which the
             encountering thread is bound.
       * omp_get_partition_num_places - returns the number of places in the place
             partition of the innermost implicit task.
       * omp_get_partition_place_nums - returns the list of place numbers
             corresponding to the places in the place-var ICV of the innermost
             implicit task.
      
      Differential Revision: http://reviews.llvm.org/D17417
      
      llvm-svn: 261915
      2f7c077b
    • Jonathan Peyton's avatar
      Add initial support for OpenMP 4.5 task priority feature · 2851072d
      Jonathan Peyton authored
      The maximum task priority value is read from envirable: OMP_MAX_TASK_PRIORITY.
      But as of now, nothing is done with it.  We just handle the environment variable
      and add the new api: omp_get_max_task_priority() which returns that value or
      zero if it is not set.
      
      Differential Revision: http://reviews.llvm.org/D17411
      
      llvm-svn: 261908
      2851072d
    • Jonathan Peyton's avatar
      dd new OpenMP 4.5 schedule clause modifiers (monotonic/non-monotonic) feature · ea0fe1df
      Jonathan Peyton authored
      The monotonic/non-monotonic flags are sent to the runtime via the sched_type by
      setting the 30th (non-monotonic) or 29th (monotonic) bit in the sched_type.
      Macros are added to probe if monotonic or non-monotonic is specified
      (SCHEDULE_HAS_[NON]MONOTONIC & SCHEDULE_HAS_NO_MODIFIERS)
      and also to to get the base sched_type (SCHEDULE_WITHOUT_MODIFIERS)
      
      Currently, nothing is done with the modifiers.
      
      Also, this patch adds some comments on the use of the enumerations in at least
       one place where it is subtle.
      
      Differential Revision: http://reviews.llvm.org/D17406
      
      llvm-svn: 261906
      ea0fe1df
  3. Feb 18, 2016
  4. Feb 12, 2016
    • Jonas Hahnfeld's avatar
      [OMPT] Frame information for openmp taskwait · 867aa20b
      Jonas Hahnfeld authored
      For pragma omp taskwait the runtime is called from the task context.
      Therefore, the reentry frame information should be updated.
      
      The information should be available for both taskwait event calls; therefore,
      set before the first event and reset after the last event.
      
      Patch by Joachim Protze
      Differential Revision: http://reviews.llvm.org/D17145
      
      llvm-svn: 260674
      867aa20b
    • Jonathan Peyton's avatar
      Fix incorrect task_team in __kmp_give_task · 134f90d5
      Jonathan Peyton authored
      When a target task finishes and it tries to access the th_task_team from the
      threads in the team where it was created, th_task_team can be NULL or point to
      a different place when that thread started a nested region that is still
      running. Finding the exact task_team that the threads were using is difficult
      as it would require to unwind the task_state_memo_stack. So a new field was added
      in the taskdata structure to point to the active task_team when the task was
      created.
      
      llvm-svn: 260615
      134f90d5
  5. Feb 11, 2016
  6. Feb 09, 2016
  7. Feb 05, 2016
  8. Feb 04, 2016
    • Jonathan Peyton's avatar
      Add LIBOMP_ENABLE_SHARED option for CMake · fd74f900
      Jonathan Peyton authored
      When building executables for Cray supercomputers, statically-linked executables
      are preferred. This patch makes it possible to build the OpenMP runtime as an
      archive for building statically-linked executables.  The patch adds the flag
      LIBOMP_ENABLE_SHARED, which defaults to true. When true, a build of the OpenMP
      runtime yields dynamic libraries. When false, a build of the OpenMP runtime
      yields static libraries. There is no setting that allows both kinds of libraries
      to be built.
      
      Patch by John Mellor-Crummey
      
      Differential Revision: http://reviews.llvm.org/D16525
      
      llvm-svn: 259817
      fd74f900
  9. Jan 29, 2016
    • Jonathan Peyton's avatar
      Fix task dependency performance problem · 7d45451a
      Jonathan Peyton authored
      In: http://lists.llvm.org/pipermail/openmp-dev/2015-August/000858.html, a
      performance issue was found with libomp's task dependencies.  The task
      dependencies hash table has an issue with collisions. The current table size is
      a power of two. This combined with the current hash function causes a large
      number of collisions to occurr. Also, the current size (64) is too small for
      larger applications so the table size is increased.
      
      This patch creates a two level hash table approach for task dependencies. The
      implicit task is considered the "master" or "top-level" task which has a large
      static sized hash table (997), and nested tasks will have smaller hash
      tables (97). Prime numbers were chosen to help reduce collisions.
      
      Differential Revision: http://reviews.llvm.org/D16640
      
      llvm-svn: 259113
      7d45451a
  10. Jan 28, 2016
    • Jonas Hahnfeld's avatar
      [OMPT] Add support for ompt_event_task_dependences and ompt_event_task_dependence_pair · 39b68624
      Jonas Hahnfeld authored
      The attached patch adds support for ompt_event_task_dependences and
      ompt_event_task_dependence_pair events from the OMPT specification [1]. These
      events only apply to OpenMP 4.0 and 4.1 (aka 4.5) because task dependencies
      were introduced in 4.0.
      
      With respect to the changes:
      
      ompt_event_task_dependences
      According to the specification, this event is raised after the task has been
      created, thefore this event needs to be raised after ompt_event_task_begin
      (in __kmp_task_start). However, the dependencies are known at
      __kmpc_omp_task_with_deps which occurs before __kmp_task_start. My modifications
      extend the ompt_task_info_t struct in order to store the dependencies of the
      task when _kmpc_omp_task_with_deps occurs and then they are emitted in
      __kmp_task_start just after raising the ompt_event_task_begin. The deps field
      is allocated and valid until the event is raised and it is freed and set
      to null afterwards.
      
      ompt_event_task_dependence_pair
      The processing of the dependences (i.e. checking whenever a dependence is
      already satisfied) is done within __kmp_process_deps. That function checks
      every dependence and calls the __kmp_track_dependence routine which gives some
      support for graphical output. I used that routine to emit the dependence pair
      but I also needed to know the sink_task. Despite the fact that the code within
      KMP_SUPPORT_GRAPH_OUTPUT refers to task_sink it may be null because
      sink->dn.task (there's a comment regarding this) and in fact it does not point
      to a proper pointer value because the value is set in node->dn.task = task;
      after the __kmp_process_deps calls in __kmp_check_deps. I have extended the
      __kmp_process_deps and __kmp_track_dependence parameter list to receive the
      sink_task.
      
      [1] https://github.com/OpenMPToolsInterface/OMPT-Technical-Report/blob/target/ompt-tr.pdf
      
      Patch by Harald Servat
      Differential Revision: http://reviews.llvm.org/D14746
      
      llvm-svn: 259038
      39b68624
    • Jonas Hahnfeld's avatar
      [OMPT] Avoid SEGV when a worker thread needs its parallel id behind the barrier · dbf627db
      Jonas Hahnfeld authored
      When the code behind the barrier is executed, the master thread may have
      already resumed execution. That's why we cannot safely assume that *pteam
      is not yet freed.
      
      This has been introduced by r258866.
      
      llvm-svn: 259037
      dbf627db
    • Jonas Hahnfeld's avatar
      [OMPT] Workaround clang failing with 'declare target' · bba248c3
      Jonas Hahnfeld authored
      Current clang trunk reports _OPENMP to be 201307 = OpenMP 4.0. It doesn't
      recognize '#pragma omp declare target' though (patch still pending) and
      therefore fails compilation.
      
      Differential Revision: http://reviews.llvm.org/D16631
      
      llvm-svn: 259026
      bba248c3
  11. Jan 27, 2016
  12. Jan 26, 2016
  13. Jan 25, 2016
  14. Jan 22, 2016
    • Jonathan Peyton's avatar
      Add missing cleanup code for cached indirect lock pool. · 3bd88d4c
      Jonathan Peyton authored
      This change fixes one issue reported at https://llvm.org/bugs/show_bug.cgi?id=26184
      There was missing cleanup code for the cached indirect lock pool. The change
      will fix the reported case where it tries to initialize a lock after runtime
      cleanup/reinitialization, but it is still possible that the user program runs
      into another problem because most test programs have a call to __kmpc_set_lock
      after cleanup/reinitialization without calling __kmpc_init_lock causing a crash/hang.
      
      llvm-svn: 258528
      3bd88d4c
  15. Jan 19, 2016
  16. Jan 15, 2016
    • Hans Wennborg's avatar
      Don't use __DATE__ or __TIME__; it breaks release builds (PR26145) · 59162da0
      Hans Wennborg authored
      The release builds are configured to be reproducible, so that the
      binaries compare equal between bootstrap iterations. The OpenMP
      run-time build was failing like this:
      
      runtime/src/kmp_version.c:108:79: error: expansion of date or time macro is not reproducible [-Werror,-Wdate-time]
      char const __kmp_version_build_time[]     = KMP_VERSION_PREFIX "build time: " __DATE__ " " __TIME__;
      
      Figuring as the build currently doesn't set LIBOMP_DATE, it's probably
      OK to skip setting the build time here too.
      
      llvm-svn: 257833
      59162da0
  17. Jan 12, 2016
    • Jonathan Peyton's avatar
      New API for restoring current thread's affinity to init affinity of application · 3076fa4c
      Jonathan Peyton authored
      This new API, int kmp_set_thread_affinity_mask_initial(), is available for use
      by other parallel runtime libraries inside a possibly OpenMP-registered thread.
      This entry point restores the current thread's affinity mask to the affinity
      mask of the application when it first began. If -1 is returned it can be assumed
      that either the thread hasn't called affinity initialization or that the thread
      isn't registered with the OpenMP library. If 0 is returned then, then the call
      was successful. Any return value greater than zero indicates an error occurred
      when setting affinity.
      
      Differential Revision: http://reviews.llvm.org/D15867
      
      llvm-svn: 257489
      3076fa4c
  18. Jan 11, 2016
  19. Jan 05, 2016
  20. Jan 04, 2016
  21. Dec 27, 2015
  22. Dec 23, 2015
    • Jonathan Peyton's avatar
      Fix build error: OMPT_SUPPORT=true was not tested after hinted lock changes · 2c295c4e
      Jonathan Peyton authored
      Recent changes to support dynamic locks didn't consider the code compiled when
      OMPT_SUPPORT=true. As a result, the OMPT support was broken by recent changes
      to nested locks to support dynamic locks. For OMPT to work with dynamic locks,
      they need to provide a return code indicating whether a nested lock acquisition
      was the first or not.
      
      This patch moves the OMPT support for nested locks into the #else case when
      DYNAMIC locks were not used. New support is needed for dynamic locks. This patch
      fixes the build and leaves a placeholder where the missing OMPT callbacks can be
      added either the author of the OMPT support for locks, or the dynamic
      locking support.
      
      Patch by John Mellor-Crummey
      
      Differential Revision: http://reviews.llvm.org/D15656
      
      llvm-svn: 256314
      2c295c4e
  23. Dec 19, 2015
  24. Dec 18, 2015
    • Jonathan Peyton's avatar
      [STATS] Have CMake do real check for stats functionality · b9e83260
      Jonathan Peyton authored
      This change allows clang to build the stats library for every architecture
      which supports __builtin_readcyclecounter().  CMake also checks for all
      necessary features for stats and will error out if the platform does not
      support it.
      
      Patch by Hal Finkel and Johnny Peyton
      
      llvm-svn: 256002
      b9e83260
  25. Dec 17, 2015
Loading