- Aug 12, 2021
-
-
Craig Topper authored
-Add Z for the B extension subextensions. -Don't mention I along with B or its sub extensions. This is based on comments in D107817. Differential Revision: https://reviews.llvm.org/D107992
-
Andrzej Warzynski authored
https://reviews.llvm.org/D106137 added support for plugins in Flang. The CMake configuration for plugins requires some LLVM variables that are not available in out-of-tree builds (e.g. `LLVM_SHLIB_OUTPUT_INTDIR`). This has caused the out-of-tree BuildBot worker to start failing: * https://lab.llvm.org/buildbot/#/builders/175 This patch effectively disables plugins in out-of-tree builds and fixes the configuration error. In order to support plugins in out-of-tree builds, we would have to find a way to access the missing CMake variables from LLVM. This could be implemented at a later time. Differential Revision: https://reviews.llvm.org/D107973
-
Sam McCall authored
This adds a method to Attr to get at the documentation programmatically. Differential Revision: https://reviews.llvm.org/D107703
-
Vitaly Buka authored
In some binaries, built with clang/lld, libunwind crashes with "unsupported x86_64 register" for regNum == 16: Differential Revision: https://reviews.llvm.org/D107919
-
Michael Kruse authored
-
Bill Wendling authored
-
Bill Wendling authored
A DiffConsumer object may be reused, but we'd like to reset it before the next use. No functionality change intended. Differential Revision: https://reviews.llvm.org/D107985
-
Aart Bik authored
Using the python API to easily set up sparse kernels, this test exhaustively builds, compilers, and runs SpMM for all annotations on a sparse tensor, making sure every version generates the correct result. This test also illustrates using the python API to set up a sparse kernel and sparse compilation. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D107943
-
Vyacheslav Zakharin authored
It is a NFC to enumerate types of LLVMOMPOFFLOAD ELF notes. Differential Revision: https://reviews.llvm.org/D99612
-
Ivan Zhechev authored
Flang uses positional arguments for `messages::say()`, such as "%1$s" which is only supported in MS Compilers with the `_*printf_p` form of the function. This uses a conditional macro to convert the existing `vsnprintf` used to the one needed in MS-World. 7 tests in D107575 rely on this change. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D107654
-
Florian Hahn authored
This reverts the revert 28c04794. The failing MLIR test that caused the revert should be fixed in this version. Also includes a PPC test fix previously in 1f87c7c4.
-
Louis Dionne authored
Since we officially don't support several older compilers now, we can drop a lot of the markup in the test suite. This helps keep the test suite simple and makes sure that UNSUPPORTED annotations don't rot. This is the first patch of a series that will remove annotations for compilers that are now unsupported. Differential Revision: https://reviews.llvm.org/D107787
-
LLVM GN Syncbot authored
-
Kiran Chandramohan authored
std::clock_t can be an unsigned value on some platforms like MacOS and therefore needs a cast when initializing an std::clock_t value with -1. Reviewed By: klausler Differential Revision: https://reviews.llvm.org/D107972
-
Craig Topper authored
DAGCombiner::visitStore can call GetDemandedBits which will remove upper bits from immediates. The upper bits are important for good materialization of negative constants on RISCV. GetDemandedBits is a different mechanism than SimplifyDemandedBits so TargetShrinkDemandedConstant can't block it. As far as I know this behavior is unique to stores. I think we can fix this in isel using a concept similar to D107658. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D107860
-
zoecarver authored
Differential Revision: https://reviews.llvm.org/D107920
-
Craig Topper authored
[RISCV] Teach vsetvli insertion pass that it doesn't need to insert vsetvli for unit-stride or strided loads/stores in some cases. For unit-stride and strided load/stores we set the SEW operand of the pseudo instruction equal the EEW in the opcode. The LMUL of the pseudo instruction is the LMUL we want. These instructions calculate EMUL=(EEW/SEW) * LMUL. We can use this to avoid changing vtype if the SEW/LMUL of the previous vtype matches the EEW/EMUL ratio we need for the instruction. Due to how the global analysis works, we can only do this optimization when the previous vsetvli was produced in the block containing the store. We need to know in the first phase if the vsetvli will be inserted so we can propagate information to the successors in the second phase correctly. This means we can't depend on predecessors. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D106601
-
Craig Topper authored
-
Craig Topper authored
Instead of using scalar size divided by 8 for segment loads, get the alignment from clang's type system. Make vleff match for consistency. Also replace uses of getPointerElementType() which will be removed as part of the OpaquePtr changes. Reviewed By: HsiangKai Differential Revision: https://reviews.llvm.org/D106738
-
Roman Lebedev authored
We really shouldn't deal with a conditional branch that can be trivially constant-folded into an unconditional branch. Indeed, barring failure to trigger BB reprocessing, that should be true, so let's assert as much, and hope the assertion never fires. If it does, we have a bug to fix.
-
Roman Lebedev authored
Mainly, i want to add an assertion that `SimplifyCFGOpt::simplifyCondBranch()` doesn't get asked to deal with non-unconditional branches, and if i do that, then said assertion fires on existing tests, and this is what prevents it from firing.
-
Kazu Hirata authored
The function was introduced without a use on Sep 16, 2011 in commit 5acab501.
-
Jon Chesterfield authored
This reverts commit b6113548.
-
Sanjay Patel authored
This is a direct translation of the select folds added with D53033 / D53036 and another step towards canonicalization using the intrinsics (see D98152).
-
Sanjay Patel authored
-
peter klausler authored
Recent work in runtime assignments failed an assertion in fir-dev while running tests (flang/test/Semantics/defined-ops.f90). This test didn't fail in llvm-project/main because only the "new" Arm driver is used now, and that only builds runtime derived type information tables when some debug dumping options are enabled. So add a reproducing test case to another test that is run with -fdebug-dump-symbols, and fix the crash by emitting special procedure binding information only for type-bound generic ASSIGNMENT(=) bindings that are relevant to the runtime support library for use in intrinsic assignment of derived types. Differential Revision: https://reviews.llvm.org/D107918
-
Fangrui Song authored
New ports in glibc typically don't define ELF_INITFINI, so DT_INIT/DT_FINI support is disabled. (rhel ppc64le likely patches their glibc this way as well.) musl can disable DT_INIT/DT_FINI via -DNO_LEGACY_INITFINI. So we cannot guarantee ctor()/dtor() will be printed.
-
Jon Chesterfield authored
Fixes miscompile of calls into ocml. Bug 51445. The stack variable `double __tmp` is moved to dynamically allocated shared memory by CGOpenMPRuntimeGPU. This is usually fine, but when the variable is passed to a function that is explicitly annotated address_space(5) then allocating the variable off-stack leads to a miscompile in the back end, which cannot decide to move the variable back to the stack from shared. This could be fixed by removing the AS(5) annotation from the math library or by explicitly marking the variables as thread_mem_alloc. The cast to AS(5) is still a no-op once IR is reached. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D107971
-
maekawatoshiki authored
Currently, LNICM pass does not support sinking instructions out of loop nest. This patch enables LNICM to sink down as many instructions to the exit block of outermost loop as possible. Reviewed By: Whitney Differential Revision: https://reviews.llvm.org/D107219
-
Aaron Ballman authored
This is a fix for commit d2c5cbc3
-
Hongtao Yu authored
Previoulsy debug-info-for-profiling and pseudo-probe-for-profiling are mutual exclusive because they compete the dwarf discrimnator for callsites on the IR. This changes allows to use the two switches together. The side effect is that callsite discriminators will be taken by pseudo probe, while discriminators for other instructions are still available for AutoFDO use. This is less than ideal, however, it still allows us a chance to smoothly transition from AutoFDO to CSSPGO, by collecting both profiles from a CSSPGO binary. Reviewed By: wenlei, wmi Differential Revision: https://reviews.llvm.org/D107876
-
Sanjay Patel authored
This was just added with 6de1dbbd , and I missed pulling the extra arg from the final revision.
-
Johannes Doerfert authored
If we cast a function at the call site it is hard(er) to get the rewrite correct, let's not attempt it for now. Fixes PR51448.
-
Johannes Doerfert authored
-
Johannes Doerfert authored
See D106309. Differential Revision: https://reviews.llvm.org/D107906
-
David Green authored
Given a constant operand, the MVE and DAGCombine combines could fight, each redistributing in the opposite order. Add a guard to the MVE vecreduce distribution to prevent that.
-
LLVM GN Syncbot authored
-
Florin Iucha authored
Add a check for enforcing minimum length for variable names. A default minimum length of three characters is applied to regular variables (including function parameters). Loop counters and exception variables have a minimum of two characters. Additionally, the 'i', 'j' and 'k' are accepted as legacy values. All three sizes, as well as the list of accepted legacy loop counter names are configurable.
-
Sanjay Patel authored
This diff was accidentally committed with: 1b5a1958
-