- Jan 21, 2022
-
-
Joseph Huber authored
This patch changes the visibility for all construct in the new device RTL to be hidden by default. This is done after the changes introduced in D117806 changed the visibility from being hidden by default for all device compilations. This asserts that the visibility for the device runtime library will be hidden except for the internal environment variable. This is done to aid optimization and linking of the device library. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D117807
-
Joseph Huber authored
This patch changes the special-case handling of visibility when compiling for an OpenMP target offloading device. This was orignally added as a precaution against the bug encountered in PR41826 when symbols in the device were being preempted by shared library symbols. This should instead be done by making the visibility protected by default. With protected visibility we are asserting that the symbols on the device will never be preempted or preempt another symbol pending a shared library load. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D117806
-
Jonas Devlieghere authored
Instrument the SB API with signposts on Darwin. This gives us a time profile on whose behalf LLDB spends time (particularly when run via the SBAPI from an IDE). Differential revision: https://reviews.llvm.org/D117632
-
Jonas Devlieghere authored
Remove the last remaining references to the reproducers from the instrumentation. This patch renames the relevant files and macros. Differential revision: https://reviews.llvm.org/D117712
-
Jonas Devlieghere authored
I revived lldb-instr to update the macros for D117712. I think the new macros are simple enough that we add them by hand, but this tool can do it automatically for you. Differential revision: https://reviews.llvm.org/D117748
-
Hsiangkai Wang authored
Originally, hasRVVFrameObject() will scan all the stack objects to check whether if there is any scalable vector object on the stack or not. However, it causes errors in the register allocator. In issue 53016, it returns false before RA because there is no RVV stack objects. After RA, it returns true because there are spilling slots for RVV values during RA. The compiler will not reserve BP during register allocation and generate BP access in the PEI pass due to the inconsistent behavior of the function. The function is changed to use hasStdExtV() as the return value. It is not precise, but it can make the register allocation correct. Refer to https://github.com/llvm/llvm-project/issues/53016. Differential Revision: https://reviews.llvm.org/D117663
-
Peter Klausler authored
Implements a near-universal extension in which NAMELIST input will skip over unrelated namelist groups in the input stream until the group with the requested name appears. Differential Revision: https://reviews.llvm.org/D117843
-
Peter Klausler authored
Array subscripts and substring limits in NAMELIST input are allowed to bear an explicit plus sign. Differential Revision: https://reviews.llvm.org/D117818
-
Craig Topper authored
This is needed to properly limit fractional LMULs for Zve32. Add new RUN Zve32 RUN lines to the existing tests for the -riscv-v-fixed-length-vector-elen-max command line option.
-
Pawe Bylica authored
Simplify bswap(x) to shl(x) or lshr(x) if x has exactly one "active byte", i.e. all active bits are contained in boundaries of a single byte of x. https://alive2.llvm.org/ce/z/nvbbU5 https://alive2.llvm.org/ce/z/KiiL3J Reviewed By: spatel, craig.topper, lebedev.ri Differential Revision: https://reviews.llvm.org/D117680
-
Johannes Doerfert authored
The old method to avoid unconstrained expansion of the constant range in a loop did not work as soon as there were multiple instructions in between the phi and its input. We now take a generic approach and limit the number of updates as a fallback. The old method is kept as it catches "the common case" early.
-
Johannes Doerfert authored
-
Fabian Wolff authored
Fixes PR#52435. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D113507
-
Dave Airlie authored
This wraps the image and rw images usages in the correct macros Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D107539
-
Peter Klausler authored
The unit number passed to a FLUSH statement is not required to be a valid open unit; nothing happens (esp. not the creation of an empty fort.n file) in this case. Differential Revision: https://reviews.llvm.org/D117819
-
Peter Klausler authored
The keyword value was misspelled in the runtime. Differential Revision: https://reviews.llvm.org/D117816
-
Clint Caywood authored
This is an optimization that using a single CPU instruction on supported architectures (amd64 and aarch64, but possibly others) to replace what was previously an iterative look-up-table algorithm. Originally I suggested using inline assembly for this in https://reviews.llvm.org/D117584. Reviewed By: lntue, sivachandra Differential Revision: https://reviews.llvm.org/D117684
-
Sam Clegg authored
Unresolved symbols are not currently reported when building with `-shared` or `-pie` so setting unresolvedSymbols doesn't have any effect. Differential Revision: https://reviews.llvm.org/D117737
-
Paweł Bylica authored
-
CJ Johnson authored
[clang-tidy] Update bugprone-stringview-nullptr to consistently prefer the empty string when passing arguments to constructors/functions Previously, function(nullptr) would have been fixed with function({}). This unfortunately can change overload resolution and even become ambiguous. T(nullptr) was already being fixed with T(""), so this change just brings function calls in line with that. Differential Revision: https://reviews.llvm.org/D117840
-
Craig Topper authored
All code should use one of the cleaner named hasVInstructions* functions. Fix the two uses that weren't and delete the methods so no new uses can be created.
-
Siva Chandra Reddy authored
Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D117838
-
- Jan 20, 2022
-
-
Craig Topper authored
RISCV only has a unary shuffle that requires places indices in a register. For interleaving two vectors this means we need at least two vrgathers and a vmerge to do a shuffle of two vectors. This patch teaches shuffle lowering to use a widening addu followed by a widening vmaccu to implement the interleave. First we extract the low half of both V1 and V2. Then we implement (zext(V1) + zext(V2)) + (zext(V2) * zext(2^eltbits - 1)) which simplifies to (zext(V1) + zext(V2) * 2^eltbits). This further simplifies to (zext(V1) + zext(V2) << eltbits). Then we bitcast the result back to the original type splitting the wide elements in half. We can only do this if we have a type with wider elements available. Because we're using extends we also have to be careful with fractional lmuls. Floating point types are supported by bitcasting to/from integer. The tests test a varied combination of LMULs split across VLEN>=128 and VLEN>=512 tests. There a few tests with shuffle indices commuted as well as tests for undef indices. There's one test for a vXi64/vXf64 vector which we can't optimize, but verifies we don't crash. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D117743
-
Rob Suderman authored
Right shift can occur that is a 32-bit right shift. This is undefined behavior. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D117732
-
owenca authored
Differential Revision: https://reviews.llvm.org/D117759
-
Nathan James authored
-
Michael Kruse authored
Follow-up on D117226 for applyStaticWorkshareLoop and applyDynamicWorkshareLoop checking for conflicting InertPoints via an assert. There is no in-tree code that violates this assertion, hence nothing changes.
-
Philip Reames authored
Certain bots (e.g. sanitizer-x86_64-linux-android) appear to be running with strict c++98 flags which disallow ; at global scope.
-
Stanislav Mekhanoshin authored
Compares, v_cndmask_b32, and v_readfirstlane_b32 use EXEC in a way which modifies the result. This implicit EXEC use shall not be ignored for the purposes of instruction moves. Differential Revision: https://reviews.llvm.org/D117814
-
Philip Reames authored
-
Adrian Prantl authored
This patch works around what looks like a bug in Clang itself. The error on the bot is: https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/40466/consoleText In module 'LLVM_Utils' imported from /Users/buildslave/jenkins/workspace/lldb-cmake/llvm-project/lldb/source/Plugins/ScriptInterpreter/Python/lldb-python.h:18: /Users/buildslave/jenkins/workspace/lldb-cmake/llvm-project/llvm/include/llvm/Support/Error.h:720:3: error: 'llvm::Expected<bool>::(anonymous)' from module 'LLVM_Utils.Support.Error' is not present in definition of 'llvm::Expected<bool>' in module 'LLVM_Utils.Support.Error' union { ^ /Users/buildslave/jenkins/workspace/lldb-cmake/llvm-project/llvm/include/llvm/Support/Error.h:720:3: note: declaration of '' does not match /Users/buildslave/jenkins/workspace/lldb-cmake/llvm-project/llvm/include/llvm/Support/Error.h:720:3: note: declaration of '' does not match 1 error generated. The intention is to revert this as soon as a proper fix has been identified! rdar://87845391
-
John Ericson authored
We previously had a few varied definitions of this floating around. I made the one installed with LLVM handle all the cases, and then made the others use it. This issue was reported to me in https://reviews.llvm.org/D116521#3248117 as D116521 made clang and llvm use the common cmake utils. Reviewed By: sebastian-ne, phosek, #libunwind, #libc, #libc_abi, ldionne Differential Revision: https://reviews.llvm.org/D117537
-
John Ericson authored
This gives us the option of using CMake modules from LLVM, and other things. We will use that to deduplicate code later. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D117815
-
Tue Ly authored
Add to log2f 2 more exceptional cases got when not using fma for polyeval. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D117812
-
River Riddle authored
Interfaces in Interfaces/ should not depend on any dialects, and this include is unnecessary anyways.
-
Philip Reames authored
-
Sanjay Patel authored
We already have the related folds for zext-of-bool, so it should make things more consistent to have this transform to select for sext-of-bool too: https://alive2.llvm.org/ce/z/YikdfA Fixes #53319
-
Sanjay Patel authored
-
Craig Topper authored
No instructions should be using these. Everything should use HasVInstructions* Predicates. Remove them so that they can't be used by accident.
-
Krzysztof Drewniak authored
- Set the DEBUG_TYPE of SerializeToBlob to serialize-to-blob - Add debug output to print the assembly or PTX for GPU modules before they are assembled and linked Note that, as SerializeToBlob is a superclass of SerializeToCubin and SerializeToHsaco, --debug-only=serialize-to-blom will dump the intermediate compiler result for both of these passes. In addition, if LLVM options such as --stop-after are used to control the GPU kernel compilation process, the debug output will contain the appropriate intermediate IR. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D117519
-