- Nov 30, 2020
-
-
George authored
`bool` is pretty well supported by now in C, and using it in place of `int` is not only more semantically accurate, but also improves automatic bindings for languages like Swift. There is more discussion here: https://llvm.discourse.group/t/adding-mlirbool-to-c-bindings/2280/5 Reviewed By: ftynse, mehdi_amini Differential Revision: https://reviews.llvm.org/D92193
-
Nathan Ridge authored
Fixes https://bugs.llvm.org/show_bug.cgi?id=47809 Differential Revision: https://reviews.llvm.org/D92272
-
- Nov 29, 2020
-
-
Fangrui Song authored
For --gc-sections, SmallVector<InputSection *, 256> -> SmallVector<InputSection *, 0> because the code bloat (1296 bytes) is not worthwhile (the saved reallocation is negligible). For OutputSection::compressedData, N=1 is useless (for a compressed .debug_*, the size is always larger than 1).
-
Nikita Popov authored
Information for pointer size/alignment/etc is queried a lot, but the binary search based implementation makes this fairly slow. Add an explicit check for address space zero and skip the search in that case -- we need to specially handle the zero address space anyway, as it serves as the fallback for all address spaces that were not explicitly defined. I initially wanted to simply replace the binary search with a linear search, which would handle both address space zero and the general case efficiently, but I was not sure whether there are any degenerate targets that use more than a handful of declared address spaces (in-tree, even AMDGPU only declares six).
-
Fangrui Song authored
The function took 1% (161MiB clang) to 1.7% (an 4.9GiB executable) time.
-
Craig Topper authored
[RISCV] Change predicate on InstAliases for GORCI/GREVI/SHFLI/UNSHFLI to HasStdExtZbp instead of HasStdExtZbbOrZbp. This matches the predicate on the instructions. Though I think some specific encodings are valid in Zbb, but not all of them.
-
Jacques Pienaar authored
Op with mapping from ops to corresponding shape functions for those op in the library and mechanism to associate shape functions to functions. The mapping of operand to shape function is kept separate from the shape functions themselves as the operation is associated to the shape function and not vice versa, and one could have a common library of shape functions that can be used in different contexts. Use fully qualified names and require a name for shape fn lib ops for now and an explicit print/parse (based around the generated one & GPU module op ones). This commit reverts d9da4c3e. Fixes missing headers (don't know how that was working locally). Differential Revision: https://reviews.llvm.org/D91672
-
Fangrui Song authored
-
Florian Hahn authored
-
George authored
This mirrors the underlying C++ api. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D92252
-
Joachim Protze authored
The test would fail for gcc, when built with debug flag.
-
Joachim Protze authored
The test had a chance to finish the first task before the second task is created. In this case, the dependences-pair event would not trigger.
-
Florian Hahn authored
Interleave groups also depend on the values they store. Manage the stored values as VPUser operands. This is currently a NFC, but is required to allow VPlan transforms and to manage generated vector values exclusively in VPTransformState.
-
Sanjay Patel authored
As suggested in D92247 (and independent of whatever we decide to do there), this code is confusing as-is. Hopefully, this is at least mildly better. We might be able to do better still, but we have a function called "removePredecessor" with this behavior: "Note that this function does not actually remove the predecessor." (!)
-
Sanjay Patel authored
As noted in D92247 (and independent of that patch): http://llvm.org/docs/CodingStandards.html#doxygen-use-in-documentation-comments "Don’t duplicate the documentation comment in the header file and in the implementation file. Put the documentation comments for public APIs into the header file."
-
Harald van Dijk authored
x86-64 ILP32 mode (x32) uses 32-bit size_t, so share the code with ix86 to zero out padding bits, not with x86-64 LP64 mode. Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D91349
-
Sam McCall authored
-
Sam McCall authored
This cache went away in 73fdd998 This time, the cache is periodically validated against disk, so config is still mostly "live". The per-file cache reuses FileCache, but the tree-of-file-caches is duplicated from ConfigProvider. .clangd, .clang-tidy, .clang-format, and compile_commands.json all have this pattern, we should extract it at some point. TODO for now though. Differential Revision: https://reviews.llvm.org/D92133
-
Juneyoung Lee authored
This patch folds more operations to poison. Alive2 proof: https://alive2.llvm.org/ce/z/mxcb9G (it does not contain tests about div/rem because they fold to poison when raising UB) Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D92270
-
Juneyoung Lee authored
-
Nikita Popov authored
For recursive phis, we skip the recursive operands and check that the remaining operands are NoAlias with an unknown size. Currently, this is limited to inbounds GEPs with positive offsets, to guarantee that the recursion only ever increases the pointer. Make this more general by only requiring that the underlying object of the phi operand is the phi itself, i.e. it it based on itself in some way. To compensate, we need to use a beforeOrAfterPointer() location size, as we no longer have the guarantee that the pointer is strictly increasing. This allows us to handle some additional cases like negative geps, geps with dynamic offsets or geps that aren't inbounds. Differential Revision: https://reviews.llvm.org/D91914
-
Mehdi Amini authored
This reverts commit 6dd9596b. Build is broken.
-
Jacques Pienaar authored
Op with mapping from ops to corresponding shape functions for those op in the library and mechanism to associate shape functions to functions. The mapping of operand to shape function is kept separate from the shape functions themselves as the operation is associated to the shape function and not vice versa, and one could have a common library of shape functions that can be used in different contexts. Use fully qualified names and require a name for shape fn lib ops for now and an explicit print/parse (based around the generated one & GPU module op ones). Differential Revision: https://reviews.llvm.org/D91672
-
Dimitry Andric authored
This retrieves CPU affinity via FreeBSD's cpuset(2) API, and makes LLVM respect affinity settings configured by the user via the cpuset(1) command. In particular, this allows to reduce the number of threads used on machines with high core counts, which can interact badly with parallelized build systems. This is particularly noticable with lld, which spawns lots of threads even for linking e.g. hello_world! This fix is related to PR48193, but does not adress the more fundamental problem, which is that LLVM by default grabs as many CPUs and/or threads as possible. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D92271
-
- Nov 28, 2020
-
-
LemonBoy authored
I took the "Permitted"/"Not Permitted" combo from the `Tag_ARM_ISA_use` case (GNU tools print "Yes"). Reviewed By: compnerd, MaskRay, simon_tatham Differential Revision: https://reviews.llvm.org/D90305
-
Mark de Wever authored
This should fix running the libc++ unit tests. They failed due to the changes introduced in 67c88e.
-
Nikita Popov authored
While we determine that (phi1, phi2) is noalias, we don't determine that (gep phi1 + 1, gep phi2 + 1) are also noalias.
-
Harald van Dijk authored
This ensures that failures show up in regular builds, rather than only when expensive checks are enabled. Differential Revision: https://reviews.llvm.org/D91339
-
Harald van Dijk authored
The build bots caught two additional pre-existing problems exposed by the test change part of my change https://reviews.llvm.org/D91339, when expensive checks are enabled. https://reviews.llvm.org/D91924 fixes one of them, this fixes the other. FixupSetCC will change code in the form of %setcc = SETCCr ... %ext1 = MOVZX32rr8 %setcc to %zero = MOV32r0 %setcc = SETCCr ... %ext2 = INSERT_SUBREG %zero, %setcc, %subreg.sub_8bit and replace uses of %ext1 with %ext2. The register class for %ext2 did not take into account any constraints on %ext1, which may have been required by its uses. This change ensures that the original constraints are honoured, by instead of creating a new %ext2 register, reusing %ext1 and further constraining it as needed. This requires a slight reorganisation to account for the fact that it is possible for the constraining to fail, in which case no changes should be made. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D91933
-
Juneyoung Lee authored
This patch updates ConstantFold, so operations are folded into poison if possible. <alive2 proofs> casts: https://alive2.llvm.org/ce/z/WSj7rw binary operations (arithmetic): https://alive2.llvm.org/ce/z/_7dEyJ binary operations (bitwise): https://alive2.llvm.org/ce/z/cezjVN vector/aggregate operations: https://alive2.llvm.org/ce/z/BQ7hWz unary ops: https://alive2.llvm.org/ce/z/yBRs4q other ops: https://alive2.llvm.org/ce/z/iXbcFD Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D92203
-
Harald van Dijk authored
The build bots caught two additional pre-existing problems exposed by the test change part of my change https://reviews.llvm.org/D91339, when expensive checks are enabled. This fixes one of them. X86 has CALL64r and CALL32r opcodes, where CALL64r takes a 64-bit register, and CALL32r takes a 32-bit register. CALL64r can only be used in 64-bit mode, CALL32r can only be used in 32-bit mode. LLVM would assume that after picking the appropriate CALLr opcode, a pointer-sized register would be a valid operand, but in x32 mode, a 64-bit mode, pointers are 32 bits. In this mode, it is invalid to directly pass a pointer to CALL64r, it needs to be extended to 64 bits first. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D91924
-
Paul C. Anagnostopoulos authored
Add one more timer to DAGISelEmitter to test the option. Differential Revision: https://reviews.llvm.org/D92146
-
Mark de Wever authored
Implements P1645: constexpr for <numeric> algorithms Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D90569
-
Kirill Bobyrev authored
Makes it easier to diagnose remote index issues with --debug-origins flag. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D92202
-
Christian Sigg authored
Differential Revision: https://reviews.llvm.org/D92265
-
Christian Sigg authored
Reviewed By: herhut, ftynse Differential Revision: https://reviews.llvm.org/D92111
-
Nikita Popov authored
The size requirement on V2 was present because it was not clear whether an unknown size would allow an access before the start of V2, which could then overlap. This is clarified since D91649: In this part of BasicAA, all accesses can occur only after the base pointer, even if they have unknown size. This makes the positive and negative offset cases symmetric. Differential Revision: https://reviews.llvm.org/D91482
-
Craig Topper authored
[RISCV] Add tests for existing (rotr (bswap X), (i32 16))->grevi pattern for RV32. Extend same pattern to rotl and GREVIW. Not sure why bswap was treated specially. This also applies to bitreverse or generic grevi. We can improve this in future patches. For now I just wanted to get the consistency and the test coverage as I plan to make some other changes around bswap.
-
LLVM GN Syncbot authored
-
Andrew Litteken authored
Reverting commit due to address sanitizer errors. > Extracting the similar regions is the first step in the IROutliner. > > Using the IRSimilarityIdentifier, we collect the SimilarityGroups and > sort them by how many instructions will be removed. Each > IRSimilarityCandidate is used to define an OutlinableRegion. Each > region is ordered by their occurrence in the Module and the regions that > are not compatible with previously outlined regions are discarded. > > Each region is then extracted with the CodeExtractor into its own > function. > > We test that correctly extract in: > test/Transforms/IROutliner/extraction.ll > test/Transforms/IROutliner/address-taken.ll > test/Transforms/IROutliner/outlining-same-globals.ll > test/Transforms/IROutliner/outlining-same-constants.ll > test/Transforms/IROutliner/outlining-different-structure.ll > > Reviewers: paquette, jroelofs, yroux > > Differential Revision: https://reviews.llvm.org/D86975 This reverts commit bf899e89.
-