- Dec 06, 2021
-
-
Jon Chesterfield authored
Reviewed By: pdhaliwal Differential Revision: https://reviews.llvm.org/D114891
-
Jon Chesterfield authored
These tests tend to hang or crash on hardware that doesn't support USM. Disabling them helps diagnose other issues. To safely enable we require a means of testing whether USM is expected to work. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D115144
-
- Dec 04, 2021
-
-
Matt Arsenault authored
729bf9b2 should have fixed these
-
- Dec 02, 2021
-
-
Ron Lieberman authored
-
Ron Lieberman authored
-
Jon Chesterfield authored
-
- Dec 01, 2021
-
-
Kazushi (Jam) Marukawa authored
When OpenMP is compiled as a part runtimes for multiple targets, openmp is compiled under build/runtimes/runtimes-arch-unknown-linux-gnu-bins directory. Old implementation treats this directory name as errors. This patch adds a guard like "[Uu]known[^-]". Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D114346
-
- Nov 30, 2021
-
-
Jonathan Peyton authored
Add documentation for the environment variables for libomp Differential Revision: https://reviews.llvm.org/D114269
-
Jon Chesterfield authored
-
- Nov 29, 2021
-
-
Matt Arsenault authored
This was trying to figure out the build path for amdgpu-arch, and making assumptions about where it is which were not working on my system. Whether a standalone build or not, we should have a proper imported target to get the location from.
-
- Nov 23, 2021
-
-
Jon Chesterfield authored
OpenMP (compiler) does not currently request any implicit kernel arguments. OpenMP (runtime) allocates and initialises a reasonable guess at the implicit kernel arguments anyway. This change makes the plugin check the number of explicit arguments, instead of all arguments, and puts the pointer to hostcall buffer in both the current location and at the offset expected when implicit arguments are added to the metadata by D113538. This is intended to keep things running while fixing the oversight in the compiler (in D113538). Once that patch lands, and a following one marks openmp kernels that use printf such that the backend emits an args element with the right type (instead of hidden_node), the over-allocation can be removed and the hardcoded 8*e+3 offset replaced with one read from the .offset of the corresponding metadata element. Reviewed By: estewart08 Differential Revision: https://reviews.llvm.org/D114274
-
- Nov 20, 2021
-
-
Joseph Huber authored
A function with no definition was left in the old runtime, causing linker errors when trying to compile. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D114264
-
- Nov 19, 2021
-
-
Jon Chesterfield authored
Removes a +x/-x pair on the only store/load of a variable and deletes some nearby dead code. Also reduces the size of the implicit struct to reflect the code currently emitted by clang. Differential Revision: https://reviews.llvm.org/D114270
-
Jon Chesterfield authored
-
Alexey Bataev authored
Eachempati. This patch adds clang (parsing, sema, serialization, codegen) support for the 'depend' clause on the 'taskwait' directive. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D113540
-
- Nov 17, 2021
-
-
Peyton, Jonathan L authored
Teach the HWLOC topology method how to detect Atom and Core types so hybrid CPUs are properly detected and represented when using the HWLOC topology method. Differential Revision: https://reviews.llvm.org/D112270
-
Peyton, Jonathan L authored
The current implementation of Windows Processor Groups has a separate topology method to handle them. This patch deprecates that specific method and uses the regular CPUID topology method by default and inserts the Windows Processor Group objects in the topology manually. Notes: * The preference for processor groups is lowered to a value less than socket so that the user will see sockets in the KMP_AFFINITY=verbose output instead of processor groups when sockets=processor groups. * The topology's capacity is modified to handle additional topology layers without the need for reallocation. * If a user asks for a granularity setting that is "above" the processor group layer, then the granularity is adjusted "down" to the processor group since this is the coarsest layer available for threads. Differential Revision: https://reviews.llvm.org/D112273
-
Peyton, Jonathan L authored
If some CPUs are offline, then make sure they are not included in the fullMask even if norespect is given to KMP_AFFINITY. Differential Revision: https://reviews.llvm.org/D112274
-
Peyton, Jonathan L authored
Remove restriction forcing users to specify the KMP_HW_SUBSET value in topology order. This patch sorts the user KMP_HW_SUBSET value before trying to apply it. For example: 1s,4c,2t is equivalent to 2t,1s,4c Differential Revision: https://reviews.llvm.org/D112027
-
Jonathan Peyton authored
-
Martin Storsjö authored
There's an attempt to upstream this change in https://github.com/intel/ittapi/pull/25 too. Differential Revision: https://reviews.llvm.org/D114069
-
- Nov 16, 2021
-
-
Joseph Huber authored
The RAII class used for debugging RTL entry used a shared variable to keep track of the current depth. This used a global initializer, which isn't supported on AMDGPU. This patch removes the initializer and instead sets it to zero when the state is initialized in the runtime. Reviewed By: jdoerfert, JonChesterfield Differential Revision: https://reviews.llvm.org/D113963
-
- Nov 15, 2021
-
-
Shao-Ce SUN authored
-
- Nov 12, 2021
-
-
Nawrin Sultana authored
Differential Revision:https://reviews.llvm.org/D113012
-
Joel E. Denny authored
Fixes what's left of https://bugs.llvm.org/show_bug.cgi?id=51781. Reviewed By: jdoerfert, JonChesterfield, tianshilei1992 Differential Revision: https://reviews.llvm.org/D113602
-
- Nov 11, 2021
-
-
Bran Hagger authored
Differential Revision: https://reviews.llvm.org/D110354
-
- Nov 10, 2021
-
-
Jon Chesterfield authored
Extension of D112504. Lower amdgpu printf to `__llvm_omp_vprintf` which takes the same const char*, void* arguments as cuda vprintf and also passes the size of the void* alloca which will be needed by a non-stub implementation of `__llvm_omp_vprintf` for amdgpu. This removes the amdgpu link error on any printf in a target region in favour of silently compiling code that doesn't print anything to stdout. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D112680
-
Joachim Protze authored
Reverting initial OMPT for target implementation in favor of a different implementation. This reverts commit 3bc8ce5d.
-
- Nov 09, 2021
-
-
Atmn Patel authored
The existing CGOpenMPRuntimeAMDGCN and CGOpenMPRuntimeNVPTX classes are just code bloat. By removing them, the codebase gets a bit cleaner. Reviewed By: jdoerfert, JonChesterfield, tianshilei1992 Differential Revision: https://reviews.llvm.org/D113421
-
Jonathan Peyton authored
Have standalone builds define uppercase_CMAKE_BUILD_TYPE and use it. llvm/CMakeLists.txt defines uppercase_CMAKE_BUILD_TYPE for regular LLVM builds with OpenMP enabled. Differential Revision: https://reviews.llvm.org/D112951
-
Atmn Patel authored
This reverts commit 81a7cad2.
-
Atmn Patel authored
The existing CGOpenMPRuntimeAMDGCN and CGOpenMPRuntimeNVPTX classes are just code bloat. By removing them, the codebase gets a bit cleaner. Reviewed By: jdoerfert, JonChesterfield, tianshilei1992 Differential Revision: https://reviews.llvm.org/D113421
-
Vyacheslav Zakharin authored
Differential Revision: https://reviews.llvm.org/D110193
-
- Nov 08, 2021
-
-
Jon Chesterfield authored
This reverts commit db81d8f6.
-
Jon Chesterfield authored
This reverts commit c499d690.
-
Jon Chesterfield authored
-
Jon Chesterfield authored
Extension of D112504. Lower amdgpu printf to `__llvm_omp_vprintf` which takes the same const char*, void* arguments as cuda vprintf and also passes the size of the void* alloca which will be needed by a non-stub implementation of `__llvm_omp_vprintf` for amdgpu. This removes the amdgpu link error on any printf in a target region in favour of silently compiling code that doesn't print anything to stdout. The exact set of changes to check-openmp probably needs revision before commit Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D112680
-
Quinn Pham authored
[NFC] As part of using inclusive language within the llvm project, this patch replaces master with main when referring to `.chm` files. Reviewed By: teemperor Differential Revision: https://reviews.llvm.org/D113299
-
@t-msn authored
It is better to set all barrier patterns to use "dist" when at least one environment variable specifies "dist". Otherwise if only one environment is set to "dist" and others left blank inadvertently, it would result in mixing dist barrier with default hyper barrier pattern. Differential Revision: https://reviews.llvm.org/D112597
-
- Nov 05, 2021
-
-
Jon Chesterfield authored
LLVM docs at https://llvm.org/docs/CMake.html#quick-start state 3.13.4 Reviewed By: atmnpatel Differential Revision: https://reviews.llvm.org/D113271
-