- Feb 22, 2021
-
-
Louis Dionne authored
The implementation of tuple's constructors and assignment operators currently diverges from the way the Standard specifies them, which leads to subtle cases where the behavior is not as specified. In particular, a class derived from a tuple-like type (e.g. pair) can't be assigned to a tuple with corresponding members, when it should. This commit re-implements the assignment operators (BUT NOT THE CONSTRUCTORS) in a way much closer to the specification to get rid of this bug. Most of the tests have been stolen from Eric's patch https://reviews.llvm.org/D27606. As a fly-by improvement, tests for noexcept correctness have been added to all overloads of operator=. We should tackle the same issue for the tuple constructors in a future patch - I'm just trying to make progress on fixing this long-standing bug. PR17550 rdar://15837420 Differential Revision: https://reviews.llvm.org/D50106
-
Heejin Ahn authored
- Fix `preds` comments - Delete nonexistent attributes in instructions (They used to exist in clang-generated files, but I removed most of them to make the tests tidy. We have only `nounwind` and `noreturn` left here.) - Add missing `Function Attrs` comments in function declarations None of these affect test function semantics or test results for now. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D97179
-
Jez Ng authored
-
Nathan James authored
Prevent warning when the values are initialized using fields that will be initialized later or VarDecls defined in the constructors body. Both of these cases can't be safely fixed. Also improve logic of finding where to insert member initializers, previously it could be confused by in class member initializers. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D97132
-
Geoffrey Martin-Noble authored
Followup to https://reviews.llvm.org/D97006 which broke the shared libs build because of a missing dependency. Differential Revision: https://reviews.llvm.org/D97213
-
Peyton, Jonathan L authored
This code alleviates some pathological loop parameters (lower, upper, stride) within calculations involved in the static loop code. It bounds the chunk size to the trip count if it is greater than the trip count and also minimizes problematic code for when trip count < nth. Differential Revision: https://reviews.llvm.org/D96426
-
Vivek authored
The current implementation of tilePerfectlyNested utility doesn't handle the non-unit step size. We have added support to perform tiling correctly even if the step size of the loop to be tiled is non-unit. Fixes https://bugs.llvm.org/show_bug.cgi?id=49188. Differential Revision: https://reviews.llvm.org/D97037
-
Peyton, Jonathan L authored
Only attempt shutdown if lpReserved is NULL. The Windows documentation states: When handling DLL_PROCESS_DETACH, a DLL should free resources such as heap memory only if the DLL is being unloaded dynamically (the lpReserved parameter is NULL). If the process is terminating (the lpReserved parameter is non-NULL), all threads in the process except the current thread either have exited already or have been explicitly terminated by a call to the ExitProcess function, which might leave some process resources such as heaps in an inconsistent state. In this case, it is not safe for the DLL to clean up the resources. Instead, the DLL should allow the operating system to reclaim the memory. Differential Revision: https://reviews.llvm.org/D96750
-
Peyton, Jonathan L authored
This patch limits the number of dispatch buffers (used for loop worksharing construct) to between 1 and 4096. Differential Revision: https://reviews.llvm.org/D96749
-
Petr Hosek authored
This reverts commits: 5ca21175 97184ab9 The instrprof-gc-sections.c is failing on AArch64 LLD bot.
-
Geoffrey Martin-Noble authored
This patch adds Linalg named ops for various types of integer matmuls. Due to limitations in the tc spec/linalg-ods-gen ops cannot be type polymorphic, so this instead creates new ops (improvements to the methods for defining Linalg named ops are underway with a prototype at https://github.com/stellaraccident/mlir-linalgpy). To avoid the necessity of directly referencing these many new ops, this adds additional methods to ContractionOpInterface to allow classifying types of operations based on their indexing maps. Reviewed By: nicolasvasilache, mravishankar Differential Revision: https://reviews.llvm.org/D97006
-
Benjamin Kramer authored
operands[2] can be nullptr here. I'm not able to build a lit test for this because of the commutative reordering of operands. It's possible to trigger this with a createOrFold<BroadcastOp> though. Differential Revision: https://reviews.llvm.org/D97206
-
Peyton, Jonathan L authored
Differential Revision: https://reviews.llvm.org/D96748
-
Vy Nguyen authored
Reland 1a0afcf5 https://reviews.llvm.org/D95913 New change: fix UB bug caused by copying empty path/name. (since the executable does not have a name)
-
Simon Pilgrim authored
-
Leonard Chan authored
This is necessary for compilation with [thin]lto. Differential Revision: https://reviews.llvm.org/D96170
-
Jessica Paquette authored
Similar to how we emit G_ASSERT_ZEXT when we have CCValAssign::LocInfo::ZExt. This will allow us to combine away some redundant sign extends. Example: https://godbolt.org/z/cTbKvr Differential Revision: https://reviews.llvm.org/D96915
-
Renaud-K authored
https://reviews.llvm.org/D97093
-
Florian Hahn authored
ICMP_NE predicates cannot be directly represented as constraint. But we can use ICMP_UGT instead ICMP_NE for %x != 0. See https://alive2.llvm.org/ce/z/XlLCsW
-
Craig Topper authored
A previous patch moved the index versions. This moves the rest. I also removed the custom lowering for VLEFF since we can now do everything directly in the isel handling. I had to update getLMUL to handle mask registers to index the pseudo table correctly for VLE1/VSE1. This is good for another 15K reduction in llc size. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D97097
-
Vinayaka Bandishti authored
[MLIR][affine] Prevent fusion when ops with memory effect free are present between producer and consumer This commit fixes a bug in affine fusion pipeline where an incorrect fusion is performed despite a dealloc op is present between a producer and a consumer. This is done by creating a node for dealloc op in the MDG. Reviewed By: bondhugula, dcaballe Differential Revision: https://reviews.llvm.org/D97032
-
Joachim Protze authored
When compiling with ccache, compiler commands get split into smaller steps and clang's default -Wunused-command-line-argument complains about unused include directory arguments. In combination -Werror, compilation aborts. If CMAKE_C_FLAGS contains -Wno-unused-command-line-argument or -Wno-error=unused-command-line-argument, the latter flag is passed into the build script. This is a re-commit. The previous version was reverted because of failing tests. Differential Revision: https://reviews.llvm.org/D96762
-
Nikita Popov authored
If the call is readnone, then there may not be any MemoryAccess associated with the call. Bail out in that case. This fixes the issue reported at https://reviews.llvm.org/D94376#2578312.
-
Nathan James authored
If a static assert has a message as the right side of an and condition, suggest a fix it of replacing the '&&' to ','. `static_assert(cond && "Failed Cond")` -> `static_assert(cond, "Failed cond")` This use case comes up when lazily replacing asserts with static asserts. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D89065
-
Nikita Popov authored
When cloning instructions during jump threading, also clone and adapt any declared scopes. This is primarily important when threading loop exits, because we'll end up with two dominating scope declarations in that case (at least after additional loop rotation). This addresses a loose thread from https://reviews.llvm.org/rG2556b413a7b8#975012. Differential Revision: https://reviews.llvm.org/D97154
-
Fangrui Song authored
Differential Revision: https://reviews.llvm.org/D97161
-
Jez Ng authored
-
Jez Ng authored
dyld on iOS will complain if the LIB64 bit is set. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D96565
-
Florian Hahn authored
-
Tim Keith authored
Add -J to the f18 driver for compatibility with gfortran. Add -module-dir for compatibility with the new flang driver. They both set the output directory for .mod files and add the directory to the search list. -module still only does the former. Clean up the new driver test to match. Differential Revision: https://reviews.llvm.org/D97164
-
Simon Pilgrim authored
Use APInt::uge() direct instead of getLimitedValue(). Use KnownBits::getMinValue() to make the bounds check more obvious.
-
Florian Hahn authored
VPBlendRecipes with single incoming (value, mask) pair are no-ops. Use the incoming value directly.
-
Shilei Tian authored
In current implementation of `deviceRTLs`, we're using some functions that are CUDA version dependent (if CUDA_VERSION < 9, it is one; otheriwse, it is another one). As a result, we have to compile one bitcode library for each CUDA version supported. A worse problem is forward compatibility. If a new CUDA version is released, we have to update CMake file as well. CUDA 9.2 has been released for three years. Instead of using various weird tricks to make `deviceRTLs` work with different CUDA versions and still have forward compatibility, we can simply drop support for CUDA 9.1 or lower version. It has at least two benifits: - We don't need to generate bitcode libraries for each CUDA version; - Clang driver doesn't need to search for the bitcode lib based on CUDA version. We can claim that starting from LLVM 12, OpenMP offloading on NVPTX target requires CUDA 9.2+. Reviewed By: jdoerfert, JonChesterfield Differential Revision: https://reviews.llvm.org/D97003
-
Nathan James authored
Swapping the order of Init and MemberOrEllipsisLocation removes 8 bytes (20%) of padding on 64bit builds. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D97191
-
Sanjay Patel authored
-
Anastasia Stulova authored
After updating clang driver to include standard OpenCL headers implicitly, the output being checked in the test does not match because the implicit header contains other pragmas. The test does not aim to use the header and therefore it has to be updated passing '-cl-no-stdinc' command-line flag. This fixes failing bots.
-
Mikhail Goncharov authored
This reverts commit ba1d9546.
-
Mikhail Goncharov authored
This reverts commit 6984e0d4. While change by itself seems to be consistent with nullPointerConstant docs of not matching "int i = 0;" but it's not clear why it's wrong and 9148302a author just forgot to update the doc.
-
Anastasia Stulova authored
This change enables the builtin function declarations in clang driver by default using the Tablegen solution along with the implicit include of 'opencl-c-base.h' header. A new flag '-cl-no-stdinc' disabling all default declarations and header includes is added. If any other mechanisms were used to include the declarations (e.g. with -Xclang -finclude-default-header) and the new default approach is not sufficient the, `-cl-no-stdinc` flag has to be used with clang to activate the old behavior. Tags: #clang Differential Revision: https://reviews.llvm.org/D96515
-
Simon Pilgrim authored
-