Commits · 9e39a5d9a68af70c58ac415e51e6b12cd85f9af2 · Lorenzo Albano / LLVM bpEVL

Nov 19, 2020

[mlir][linalg] Start a named ops to generic ops pass · 9e39a5d9

Lei Zhang authored Nov 19, 2020

This commit starts a new pass and patterns for converting Linalg
named ops to generic ops. This enables us to leverage the flexbility
from generic ops during transformations. Right now only linalg.conv
is supported; others will be added when useful.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D91357

9e39a5d9

[libc++] ADL-proof <vector> by adding _VSTD:: qualification on calls. · 40267cc9

Arthur O'Dwyer authored Nov 18, 2020

(1) Add _VSTD:: qualification to __swap_allocator.

(2) Add _VSTD:: qualification consistently to __to_address.

(3) Add some more missing _VSTD:: to <vector>, with a regression test.
This part is cleanup after d9a4f936.

Note that a vector whose allocator actually runs afoul of any of these ADL calls will
likely also run afoul of simple things like `v1 == v2` (which is also an ADL call).
But, still, libc++ should be consistent in qualifying function calls wherever possible.

Relevant blog post: https://quuxplusone.github.io/blog/2019/09/26/uglification-doesnt-stop-adl/

Differential Revision: https://reviews.llvm.org/D91708

40267cc9

Fix Wundef warnings for Support/Compiler.h · 57e00075

Sven van Haastregt authored Nov 19, 2020

Support/Compiler.h is included by c files (e.g. regcomp.c) where
__cplusplus is not defined at all.  Avoid evaluating the undefined
macro for such files.

57e00075

[ConstraintElimination] Add GEP test case with variable offset. · 7f4d88a1
Florian Hahn authored Nov 19, 2020

7f4d88a1
[RISCV] Extend 32-bit test coverage of neg-abs tests for D91120 · 9374e7b1
Simon Pilgrim authored Nov 19, 2020

9374e7b1

[ValueTracking] computeKnownBitsFromShiftOperator - move shift amount analysis... · fceaff41

Simon Pilgrim authored Nov 19, 2020

[ValueTracking] computeKnownBitsFromShiftOperator - move shift amount analysis to top of the function. NFCI.

These are all lightweight to compute and helps avoid issues with Known being used to hold both the shift amount and then the shifted result.

Minor cleanup for D90479.

fceaff41

[ARM] Deliberately prevent inline asm in low overhead loops. NFC · 006b3bde

David Green authored Nov 19, 2020

This was already something that was handled by one of the "else"
branches in maybeLoweredToCall, so this patch is an NFC but makes it
explicit and adds a test. We may in the future want to support this
under certain situations but for the moment just don't try and create
low overhead loops with inline asm in them.

Differential Revision: https://reviews.llvm.org/D91257

006b3bde

[clangd] Disable SerializationTest.NoCrashOnBadArraySize with ASAN · 14078334

Kirill Bobyrev authored Nov 19, 2020

Address Sanitizer crashes on large allocations:

```c++
// Try to crash rather than hang on large allocation.
ScopedMemoryLimit MemLimit(1000 * 1024 * 1024); // 1GB
```

14078334

[lldb] Use translated full ftag values · c43abf04

Michał Górny authored Nov 15, 2020

Translate between abridged and full ftag values in order to expose
the latter in the gdb-remote protocol while the former are used by
FXSAVE/XSAVE...  This matches the gdb behavior.

Differential Revision: https://reviews.llvm.org/D91504

c43abf04

[lldb] Add explicit 64-bit fip/fdp registers on x86_64 · d8ff269f

Michał Górny authored Nov 15, 2020

The FXSAVE/XSAVE data can have two different layouts on x86_64.  When
called as FXSAVE/XSAVE..., the Instruction Pointer and Address Pointer
registers are reported using a 16-bit segment identifier and a 32-bit
offset.  When called as FXSAVE64/XSAVE64..., they are reported using
a complete 64-bit offsets instead.

LLDB has historically followed GDB and unconditionally used to assume
the 32-bit layout, with the slight modification of possibly
using a 32-bit segment register (i.e. extending the register into
the reserved 16 upper bits).  When the underlying operating system used
FXSAVE64/XSAVE64..., the pointer was split into two halves,
with the upper half repored as the segment registers.  While
reconstructing the full address was possible on the user end (and e.g.
the FPU register tests did that), it certainly was not the most
convenient option.

Introduce a two additional 'fip' and 'fdp' registers that overlap
with 'fiseg'/'fioff' and 'foseg'/'foff' respectively, and report
the complete 64-bit address.

Differential Revision: https://reviews.llvm.org/D91497

d8ff269f

[X86][AVX] Only share broadcasts of different widths from the same SDValue of... · 14ae02fb

Simon Pilgrim authored Nov 19, 2020

[X86][AVX] Only share broadcasts of different widths from the same SDValue of the same SDNode (PR48215)

D57663 allowed us to reuse broadcasts of the same scalar value by extracting low subvectors from the widest type.

Unfortunately we weren't ensuring the broadcasts were from the same SDValue, just the same SDNode - which failed on multiple-value nodes like ISD::SDIVREM

FYI: I intend to request this be merged into the 11.x release branch.

Differential Revision: https://reviews.llvm.org/D91709

14ae02fb

[AArch64][SVE] Allow C-style casts between fixed-size and scalable vectors · 1e2da383

Joe Ellis authored Nov 17, 2020

This patch allows C-style casting between fixed-size and scalable
vectors. This kind of cast was previously blocked by the compiler, but
it should be allowed.

Differential Revision: https://reviews.llvm.org/D91262

1e2da383

[LV][NFC-ish] Allow vector widths over 256 elements · a1de391d

Simon Moll authored Nov 19, 2020

The assertion that vector widths are <= 256 elements was hard wired in the LV code. Eg, VE allows for vectors up to 512 elements. Test again the TTI vector register bit width instead - this is an NFC for non-asserting builds.

Reviewed By: fhahn

Differential Revision: https://reviews.llvm.org/D91518

a1de391d

[Mach0] Fix unused-variable warnings · 2d1f471e
Gabriel Hjort Åkerlund authored Nov 19, 2020
```
Reviewed By: arsenm

Differential Revision: https://reviews.llvm.org/D91519
```
2d1f471e

[SelDAGBuilder] Do not require simple VTs for constraints. · 1983acce

Florian Hahn authored Nov 19, 2020

In some cases, the values passed to `asm sideeffect` calls cannot be
mapped directly to simple MVTs. Currently, we crash in the backend if
that happens. An example can be found in the @test_vector_too_large_r_m
test case, where we pass <9 x float> vectors. In practice, this can
happen in cases like the simple C example below.

using vec = float __attribute__((ext_vector_type(9)));
void f1 (vec m) {
  asm volatile("" : "+r,m"(m) : : "memory");
}

One case that use "+r,m" constraints for arbitrary data types in
practice is google-benchmark's DoNotOptimize.

This patch updates visitInlineAsm so that it use MVT::Other for
constraints with complex VTs. It looks like the rest of the backend
correctly deals with that and properly legalizes the type.

And we still report an error if there are no registers to satisfy the
constraint.

Reviewed By: arsenm

Differential Revision: https://reviews.llvm.org/D91710

1983acce

[NFC] Remove comment (commited ahead of time by mistake) · 515105f4
Max Kazantsev authored Nov 19, 2020

515105f4
[NFC] Move code earlier as preparation for further changes · 7c601d09
Max Kazantsev authored Nov 19, 2020

7c601d09

[clang-tidy] Improving bugprone-sizeof-expr check. · 47518d6a

Balázs Kéri authored Nov 19, 2020

Do not warn for "pointer to aggregate" in a `sizeof(A) / sizeof(A[0])`
expression if `A` is an array of pointers. This is the usual way of
calculating the array length even if the array is of pointers.

Reviewed By: aaron.ballman

Differential Revision: https://reviews.llvm.org/D91543

47518d6a

[mlir][TableGen] Support intrinsics with multiple returns and overloaded operands. · 58ce4a8b

Ji Kim authored Nov 19, 2020

For intrinsics with multiple returns where one or more operands are overloaded, the overloaded type is inferred from the corresponding field of the resulting struct, instead of accessing the result directly.

As such, the hasResult parameter of LLVM_IntrOpBase (and derived classes) is replaced with numResults. TableGen for intrinsics also updated to populate this field with the total number of results.

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D91680

58ce4a8b

[VE] VEC_BROADCAST, lowering and isel · ffe6c97f

Simon Moll authored Nov 19, 2020

This defines the vec_broadcast SDNode along with lowering and isel code.
We also remove unused type mappings for the vector register classes (all vector MVTs that are not used in the ISA go).

We will implement support for short vectors later by intercepting nodes with illegal vector EVTs before LLVM has had a chance to widen them.

Reviewed By: kaz7

Differential Revision: https://reviews.llvm.org/D91646

ffe6c97f

[WebAssembly] Add support for named globals in the object format. · 1827005c
Sam Clegg authored Nov 18, 2020
```
Differential Revision: https://reviews.llvm.org/D91769
```
1827005c

[IndVarSimplify] Notify top most loop to drop cached exit counts · ea7ab5a4

Andrew Wei authored Nov 19, 2020

Some nested loops may share the same ExitingBB, so after we finishing FoldExit,
we need to notify OuterLoop and SCEV to drop any stored trip count.

Patched by: guopeilin
Reviewed By: mkazantsev

Differential Revision: https://reviews.llvm.org/D91325

ea7ab5a4

[clangd] Fix data race in GoToInclude.All test · 7c2990b8
Kadir Cetinkaya authored Nov 19, 2020

7c2990b8

[PowerPC] [Clang] Fix alignment of 128-bit float types · 6b1341eb

Qiu Chaofan authored Nov 19, 2020

According to ELF v2 ABI, both IEEE 128-bit and IBM extended floating
point variables should be quad-word (16 bytes) aligned. Previously, only
vector types are considered aligned as quad-word on PowerPC.

This patch will fix incorrectness of IEEE 128-bit float argument in
va_arg cases.

Reviewed By: rjmccall

Differential Revision: https://reviews.llvm.org/D91596

6b1341eb

[libc] Fix the overflow check condition of ldexp. · 4d8dede5

Siva Chandra Reddy authored Nov 18, 2020

Targeted tests have been added.

Reviewed By: lntue

Differential Revision: https://reviews.llvm.org/D91752

4d8dede5

[NFC][TFUtils] also include output specs lookup logic in loadOutputSpecs · 8ab2353a

Mircea Trofin authored Nov 18, 2020

The lookup logic is also reusable.

Also refactored the API to return the loaded vector - this makes it more
clear what state it is in in the case of error (as it won't be
returned).

Differential Revision: https://reviews.llvm.org/D91759

8ab2353a

[Transforms] Use llvm::is_contained (NFC) · 43c0e4f6
Kazu Hirata authored Nov 18, 2020

43c0e4f6

[NFC][TFUtils] Extract out the output spec loader · b51e844f

Mircea Trofin authored Nov 18, 2020

It's generic for the 'development mode', not specific to the inliner
case.

Differential Revision: https://reviews.llvm.org/D91751

b51e844f

[RISCV] Add MemOperand to the instruction created by storeRegToStackSlot/loadRegFromStackSlot · 6b0fc1f3
Craig Topper authored Nov 18, 2020
```
Differential Revision: https://reviews.llvm.org/D91730
```
6b0fc1f3

[mlir][Pass] Only enable/disable CrashRecovery once · bd106d74

River Riddle authored Nov 18, 2020

This prevents potential problems that occur when multiple pass managers register crash recovery contexts.

bd106d74

[mlir] Add support for referencing a SymbolRefAttr in a SideEffectInstance · c0958b7b

River Riddle authored Nov 18, 2020

This allows for operations that exclusively affect symbol operations to better describe their side effects.

Differential Revision: https://reviews.llvm.org/D91581

c0958b7b

[X86][AArch64][RISCV] Pre-commit negated abs test case. NFC. · 5f0ae23e
Kai Luo authored Nov 19, 2020

5f0ae23e

[trace][intel-pt] Scaffold the 'thread trace start | stop' commands · fb19f11e

Walter Erquinigo authored Oct 26, 2020

Depends on D90490.

The stop command is simple and invokes the new method Trace::StopTracingThread(thread).

On the other hand, the start command works by delegating its implementation to a CommandObject provided by the Trace plugin. This is necessary because each trace plugin needs different options for this command. There's even the chance that a Trace plugin can't support live tracing, but instead supports offline decoding and analysis, which means that "thread trace dump instructions" works but "thread trace start" doest. Because of this and a few other reasons, it's better to have each plugin provide this implementation.

Besides, I'm using the GetSupportedTraceType method introduced in D90490 to quickly infer what's the trace plug-in that works for the current process.

As an implementation note, I moved CommandObjectIterateOverThreads to its header so that I can use it from the IntelPT plugin. Besides, the actual start and stop logic for intel-pt is not part of this diff.

Reviewed By: clayborg

Differential Revision: https://reviews.llvm.org/D90729

fb19f11e

[clang-tidy] Extend bugprone-string-constructor-check to std::string_view. · 25f5406f

Chris Kennelly authored Nov 07, 2020

This allows for matching the constructors std::string has in common with
std::string_view.

Reviewed By: aaron.ballman

Differential Revision: https://reviews.llvm.org/D91015

25f5406f

Support: Avoid SmallVector::assign with a range from to-be-replaced vector in... · 90966daa

Duncan P. N. Exon Smith authored Nov 18, 2020

Support: Avoid SmallVector::assign with a range from to-be-replaced vector in Windows GetExecutableName

This code wasn't valid, and 5abf76fb
started asserting. This is a speculative fix since I don't have a
Windows machine handy.

90966daa

ADT: Add assertions to SmallVector::insert, etc., for reference invalidation · 5abf76fb

Duncan P. N. Exon Smith authored Nov 13, 2020

2c196bbc asserted that
`SmallVector::push_back` doesn't invalidate the parameter when it needs
to grow. Do the same for `resize`, `append`, `assign`, `insert`, and
`emplace_back`.

Differential Revision: https://reviews.llvm.org/D91744

5abf76fb

[mlir][sparse] remove a few rewriting failures · 9ad62f62

Aart Bik authored Nov 18, 2020

Rationale:
Make sure preconditions are tested already during verfication.
Currently, the only way a sparse rewriting rule can fail is if
(1) the linalg op does not have sparse annotations, or
(2) a yet to be handled operation is encounted inside the op

Reviewed By: penpornk

Differential Revision: https://reviews.llvm.org/D91748

9ad62f62

[WebAssembly] Support fp reg class in r constraint · 803af31e
snek authored Nov 18, 2020
```
Patch by snek

Reviewed By: aheejin

Differential Revision: https://reviews.llvm.org/D90978
```
803af31e
Added GDB pretty printer for StringMap · 57473807
Moritz Sichert authored Nov 18, 2020
```
Reviewed By: csigg, dblaikie

Differential Revision: https://reviews.llvm.org/D91183
```
57473807

[hwasan] Fix Thread reuse (try 2). · 523cc097

Evgenii Stepanov authored Nov 12, 2020

HwasanThreadList::DontNeedThread clobbers Thread::next_,
Breaking the freelist. As a result, only the top of the freelist ever
gets reused, and the rest of it is lost.

Since the Thread object with its associated ring buffer is only 8Kb, this is
typically only noticable in long running processes, such as fuzzers.

Fix the problem by switching from an intrusive linked list to a vector.

Differential Revision: https://reviews.llvm.org/D91392

523cc097