- Oct 07, 2021
-
-
Jon Chesterfield authored
Follow on to D110006, related to D110957 Where implementations have diverged this resolves to match the new DeviceRTL - replaces definitions of this struct in deviceRTL and plugins with include - changes the dynamic_shared_size field from D110006 to 32 bits - handles stdint being unavailable in DeviceRTL - adds a zero initializer for the field to amdgpu - moves the extern declaration for deviceRTL to target_interface (omptarget.h is more natural, but doesn't work due to include order with debug.h) - Renames the fields everywhere to match the LLVM format used in DeviceRTL - Makes debug_level uint32_t everywhere (previously sometimes int32_t) Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D111069
-
Balázs Kéri authored
The code of `ASTImporter::Import(const Attr *)` was repetitive, it is now simplified. (There is still room for improvement but probably only after big changes.) Reviewed By: martong, steakhal Differential Revision: https://reviews.llvm.org/D110810
-
David Green authored
-
David Spickett authored
If you don't have ptrace permissions this test will fail to run silently, this adds a check for that and anything else that might do similar things. The output will now be: ``` FAILED test program did not run correctly, check gdb warnings /usr/bin/gdb: warning: Couldn't determine a path for the index cache directory. No symbol table is loaded. Use the "file" command. warning: Error disabling address space randomization: Operation not permitted warning: Could not trace the inferior process. warning: ptrace: Operation not permitted error: command failed with exit status: 255 ``` We already have a feature to check for a compatible python enabled gdb, so I think it's reasonable to check for this at test runtime. Note that this is different to the catch all at the end of the test script. That would be a case where you can trace but something else made it stop mid way that wasn't our test breakpoints. Reviewed By: saugustine Differential Revision: https://reviews.llvm.org/D110936
-
Pavel Labath authored
The previous version of the patch did not update the definitions in conditionally compiled code. This patch includes changes to ARC and windows targets. Original commit message was: These were added to support some mips registers on linux, but linux mips support has now been removed due. They are still referenced in the freebds mips implementation, but the completeness of that implementation is also unknown. All other architectures just set these fields to zero, which is a cause of significant bloat in our register info definitions. Arm also has registers with variable sizes, but they were implemented in a more gdb-compatible fashion and don't use this feature. Differential Revision: https://reviews.llvm.org/D110914
-
Jay Foad authored
Fix the calculation of ReplacedAllUntiedUses when any of the tied defs are early-clobber. The effect of this is to fix the placement of kill flags on an instruction like this (from @f2 in test/CodeGen/SystemZ/asm-18.ll): INLINEASM &"stepb $1, $2" [attdialect], $0:[regdef-ec:GRH32Bit], def early-clobber %3:grh32bit, $1:[reguse tiedto:$0], killed %4:grh32bit(tied-def 3), $2:[reguse:GRH32Bit], %4:grh32bit After TwoAddressInstruction without this patch: %3:grh32bit = COPY killed %4:grh32bit INLINEASM &"stepb $1, $2" [attdialect], $0:[regdef-ec:GRH32Bit], def early-clobber %3:grh32bit, $1:[reguse tiedto:$0], %3:grh32bit(tied-def 3), $2:[reguse:GRH32Bit], %4:grh32bit Note that the COPY kills %4, even though there is a later use of %4 in the INLINEASM. This fails machine verification if you force it to run after TwoAddressInstruction (currently it is disabled for other reasons). After TwoAddressInstruction with this patch: %3:grh32bit = COPY %4:grh32bit INLINEASM &"stepb $1, $2" [attdialect], $0:[regdef-ec:GRH32Bit], def early-clobber %3:grh32bit, $1:[reguse tiedto:$0], %3:grh32bit(tied-def 3), $2:[reguse:GRH32Bit], %4:grh32bit Differential Revision: https://reviews.llvm.org/D110848
-
Jay Foad authored
-
Adrian Kuegel authored
ConstShapeOp has a constant shape, so its type can always be static. We still allow it to have ShapeType though. Differential Revision: https://reviews.llvm.org/D111139
-
Cullen Rhodes authored
Changes in architecture revision 00eac1: * Renamed to PSEL. * Copies whole source register. * Element type suffix removed from destination. * Element index no longer optional and '#' prefix has been removed. The reference can be found here: https://developer.arm.com/documentation/ddi0602/2021-09 Depends on D111212. Reviewed By: kmclaughlin Differential Revision: https://reviews.llvm.org/D111213
-
Cullen Rhodes authored
Changes in architecture revision 00eac1: * Tile slice index offset no longer prefixed with '#'. * The syntax for 128-bit (.Q) ZA tile slice accesses must now include an explicit zero index. The reference can be found here: https://developer.arm.com/documentation/ddi0602/2021-09 Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D111212
-
Florian Hahn authored
This makes the tests a bit more robust with respect to small changes in the value numbering.
-
David Spickett authored
When the locale is not some UTF-8 these tests fail. (different results for python2 linked gdbs vs. python3 but same issue) Setting the locale just for the test works around this. By default Ubuntu comes with just C.UTF-8. I've chosen to use en_US.UTF-8 instead given that my Mac doesn't have the former and there's a slim chance this test might run there. This also enables the u16string tests which are now passing. Reviewed By: #libc, ldionne, saugustine Differential Revision: https://reviews.llvm.org/D111138
-
David Spickett authored
Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D111235
-
Tobias Gysi authored
Update OpDSL to support unsigned integers by adding unsigned min/max/cast signatures. Add tests in OpDSL and on the C++ side to verify the proper signed and unsigned operations are emitted. The patch addresses an issue brought up in https://reviews.llvm.org/D111170. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D111230
-
Saiyedul Islam authored
Fixes 4c411708.
-
Mikael Holmen authored
-
Saiyedul Islam authored
An archive containing device code object files can be passed to clang command line for linking. For each given offload target it creates a device specific archives which is either passed to llvm-link if the target is amdgpu, or to clang-nvlink-wrapper if the target is nvptx. -L/-l flags are used to specify these fat archives on the command line. E.g. clang++ -fopenmp -fopenmp-targets=nvptx64 main.cpp -L. -lmylib It currently doesn't support linking an archive directly, like: clang++ -fopenmp -fopenmp-targets=nvptx64 main.cpp libmylib.a Linking with x86 offload also does not work. Reviewed By: ye-luo Differential Revision: https://reviews.llvm.org/D105191
-
hsmahesha authored
which is essentially required as a pre-commit for https://reviews.llvm.org/D110257. Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D110676
-
Jason Molenda authored
A Mach-O corefile has an array of memory segments, representing the memory of the process at the point of the capture. Each segment has a virtual address + size, and a file offset + size. The file size may be less than the virtual address size, indicating that the memory was unavailable. When ProcessMachCore::DoLoadCore scans this array of memory segments, it builds up a table to translate virtual addresses to file offsets, for memory read requests. This lookup table combines contiguous memory segments into a single entry, to reduce the number of entries (some corefile writers will emit a separate segement for each virtual meory page). This contiguous check wasn't taking into account a segment that isn't present in the corefile, e.g. filesize==0, and every contiguous memory segment after that point would result in lldb reading the wrong offset of the file because it didn't account for this. I'd like to have an error message when someone tries to read memory from one of these segments, instead of returning all zeroes, so this patch intentionally leaves these out of the vmaddr -> fileoff table (and avoids combining them with segments that actually do exist in the corefile). I'm a little unsure of writing a test for this one; I'd have to do a yaml2obj of a corefile with the problem, or add an internal mode to the Mach-O process save-core where it could write a filesize==0 segment while it was writing one. rdar://83382487
-
Jason Molenda authored
When BridgeOS and DriverKit are added to llvm Triple.
-
Uday Bondhugula authored
Add folding rule for std.or op when an operand has all bits set. or(x, <all bits set>) -> <all bits set> Differential Revision: https://reviews.llvm.org/D111206
-
Itay Bookstein authored
To better reflect the meaning of the now-disambiguated {GlobalValue, GlobalAlias}::getBaseObject after breaking off GlobalIFunc::getResolverFunction (D109792), the function is renamed to getAliaseeObject.
-
Artem Belevich authored
This should fix bot failures on PPC and windows.
-
Mogball authored
Missing function defs causes errors on some build configs.
-
David Blaikie authored
This reverts c7f16ab3 / r109694 - which suggested this was done to improve consistency with the gdb test suite. Possible that at the time GCC did not canonicalize integer types, and so matching types was important for cross-compiler validity, or that it was only a case of over-constrained test cases that printed out/tested the exact names of integer types. In any case neither issue seems to exist today based on my limited testing - both gdb and lldb canonicalize integer types (in a way that happens to match Clang's preferred naming, incidentally) and so never print the original text name produced in the DWARF by GCC or Clang. This canonicalization appears to be in `integer_types_same_name_p` for GDB and in `TypeSystemClang::GetBasicTypeEnumeration` for lldb. (I tested this with one translation unit defining 3 variables - `long`, `long (*)()`, and `int (*)()`, and another translation unit that had main, and a function that took `long (*)()` as a parameter - then compiled them with mismatched compilers (either GCC+Clang, or Clang+(Clang with this patch applied)) and no matter the combination, despite the debug info for one CU naming the type "long int" and the other naming it "long", both debuggers printed out the name as "long" and were able to correctly perform overload resolution and pass the `long int (*)()` variable to the `long (*)()` function parameter) Did find one hiccup, identified by the lldb test suite - that CodeView was relying on these names to map them to builtin types in that format. So added some handling for that in LLVM. (these could be split out into separate patches, but seems small enough to not warrant it - will do that if there ends up needing any reverti/revisiting) Differential Revision: https://reviews.llvm.org/D110455
-
Kuba Mracek authored
[GlobalDCE] In VFE, replace the whole 'sub' expression of unused relative-pointer-based vtable slots Differential Revision: https://reviews.llvm.org/D109114
-
Stella Laurenzo authored
This was causing a subsequent assert/crash when a type converter failed to convert a block argument. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D110985
-
LLVM GN Syncbot authored
-
Artem Belevich authored
The patch implements header-only support for testure lookups. The patch has been tested on a source file with all possible combinations of argument types supported by CUDA headers, compiled and verified that the generated instructions and their parameters match the code generated by NVCC. Unfortunately, compiling texture code requires CUDA headers and can't be tested in clang itself. The test will need to be added to the test-suite later. While generated code compiles and seems to match NVCC, I do not have any code that uses textures that I could test correctness of the implementation. Hence the experimental status. Differential Revision: https://reviews.llvm.org/D110089
-
Arthur Eubanks authored
Fixes some bots where the two differ.
-
Richard Smith authored
declaration. Names starting with an underscore are reserved at the global scope, so cannot be used as the name of an extern "C" symbol in any scope because such usages conflict with a name at global scope.
-
Richard Smith authored
Also do not warn on `#define _foo` or `#undef _foo`. Only global scope names starting with _[a-z] are reserved, not the use of such an identifier in any other context.
-
Richard Smith authored
constexpr function body.
-
Philip Reames authored
When checking to see if we can apply IR flags to a SCEV, we need to identify a bound on the defining scope of the SCEV to be produced. We'd previously added support for a couple SCEVExpr types which trivially imply bounds, but hadn't handled types such as umax where the bounds come from the bounds of the operands. This does the obvious thing, and recurses through operands searching for a tighter bound on the defining scope. I'm honestly surprised by how little this seems to mater on existing tests, but it's worth doing for completeness sake alone. Differential Revision: https://reviews.llvm.org/D111191
-
Leonard Chan authored
This reverts commit c52d60ec. The failing scudo test came up again.
-
- Oct 06, 2021
-
-
Adrian Prantl authored
-
Adrian Prantl authored
-
Adrian Prantl authored
This has the nice side-effect that it can actually store the quadruple version numbers that Apple's tools are using nowadays. rdar://82982162 Differential Revision: https://reviews.llvm.org/D111200
-
Craig Topper authored
Without popcnt we had a special case for using the parity flag from a single test i8 test instruction if only bits 7:0 could be non-zero. That special case is still useful when we have popcnt. To reach this special case, we enable custom lowering of parity for i16/i32/i64 even when popcnt is enabled. The check for POPCNT being enabled is now after the special case in LowerPARITY. Fixes PR52093 Differential Revision: https://reviews.llvm.org/D111249
-
Jennifer Yu authored
adjustMemberOfForLambdaCaptures. The problem is happening when user passes lambda function with reference type in the map clause. The natural of the problem when processing generateInfoForCapture, the BasePointer is generated with new load for a lambda variable with reference type. It is not expected in adjustMemberOfForLambdaCaptures. One way to fix this is to skipping call to generateInfoForCapture for map(to:lambda). The map info will be generated later in the call to generateDefaultMapInfo samiler as firsprivate clase. This to fix https://bugs.llvm.org/show_bug.cgi?id=52071 Differential Revision:https://reviews.llvm.org/D111115
-