Commits · 6a089ce0e40abbe4e0f26f05540e3caa60d98a29 · Lorenzo Albano / LLVM bpEVL

Oct 05, 2020

[AMDGPU] Use tablegen for argument indices · 6a089ce0

Sebastian Neubauer authored Sep 30, 2020

Use tablegen generic tables to get the index of image intrinsic
arguments.
Before, the computation of which image intrinsic argument is at which
index was scattered in a few places, tablegen, the SDag instruction
selection and GlobalISel. This patch changes that, so only tablegen
contains code to compute indices and the ImageDimIntrinsicInfo table
provides these information.

Differential Revision: https://reviews.llvm.org/D86270

6a089ce0

[mlir] Fix SubViewOp doc in .td · d52211e3
Nicolas Vasilache authored Oct 05, 2020

d52211e3

[VE] Support register and frame-index pair correctly · 5b5e78a4

Kazushi (Jam) Marukawa authored Sep 21, 2020

Support register and frame-index pair correctly as operands of
generic load/store instrucitons, e.g. LD1BZXrri, STLrri, and etc.
Add regression tests also.

Differential Revision: https://reviews.llvm.org/D88779

5b5e78a4

Promote transpose from linalg to standard dialect · 6e2b267d

Benjamin Kramer authored Sep 30, 2020

While affine maps are part of the builtin memref type, there is very
limited support for manipulating them in the standard dialect. Add
transpose to the set of ops to complement the existing view/subview ops.
This is a metadata transformation that encodes the transpose into the
strides of a memref.

I'm planning to use this when lowering operations on strided memrefs,
using the transpose to remove the stride without adding a dependency on
linalg dialect.

Differential Revision: https://reviews.llvm.org/D88651

6e2b267d

[AMDGPU] Make bfe patterns divergence-aware · 16778b19

Jay Foad authored Sep 25, 2020

This tends to increase code size but more importantly it reduces vgpr
usage, and could avoid costly readfirstlanes if the result needs to be
in an sgpr.

Differential Revision: https://reviews.llvm.org/D88580

16778b19

[AMDGPU] Split R600 and GCN bfe patterns · 0d5989bb

Jay Foad authored Sep 25, 2020

This is in preparation for making the GCN patterns divergence-aware.
NFC.

Differential Revision: https://reviews.llvm.org/D88579

0d5989bb

[TableGen][GlobalISel] add handling of nested *_SUBREG · 64b879ae

Gabriel Hjort Åkerlund authored Oct 05, 2020

When nesting INSERT_SUBREG and EXTRACT_SUBREG, GlobalISelEmitter would
fail to find the register class of the nested node. This patch fixes
that for registers with subregs.

Reviewed By: arsenm

Differential Revision: https://reviews.llvm.org/D88487

64b879ae

[AST][RecoveryExpr] Popagate the error-bit from a VarDecl's initializer to DeclRefExpr. · 3423d5c9

Haojian Wu authored Oct 05, 2020

The error-bit was missing, if a DeclRefExpr (which refers to a VarDecl
with a contains-errors initializer).

It could cause different violations in clang -- the DeclRefExpr is value-dependent,
but not contains-errors, `ABC<DeclRefExpr>` could produce a non-error
and non-dependent type in non-template context, which will lead to
crashes in constexpr evaluation.

Reviewed By: sammccall

Differential Revision: https://reviews.llvm.org/D86048

3423d5c9

[DebugInfo] Improve dbg preservation in LSR. · a3caf7f6

Markus Lavin authored Oct 05, 2020

Use SCEV to salvage additional @llvm.dbg.value that have turned into
referencing undef after transformation (and traditional
salvageDebugInfo). Before transformation compute SCEV for each
@llvm.dbg.value in the loop body and store it (along side its current
DIExpression). After transformation update those @llvm.dbg.value now
referencing undef by comparing its stored SCEV to the SCEV of the
current loop-header PHI-nodes. Allow match with offset by inserting
compensation code in the DIExpression.

Fixes : PR38815

Differential Revision: https://reviews.llvm.org/D87494

a3caf7f6

[RISCV][ASAN] mark asan as supported for RISCV64 and enable tests · cf4aa683

Alexey Baturo authored Oct 04, 2020

[11/11] patch series to port ASAN for riscv64

These changes allow using ASAN on RISCV64 architecture.
The majority of existing tests are passing. With few exceptions (see below).
Tests we run on qemu and on "HiFive Unleashed" board.

Tests run:

```
Asan-riscv64-inline-Test  - pass
Asan-riscv64-inline-Noinst-Test  - pass
Asan-riscv64-calls-Noinst-Test  - pass
Asan-riscv64-calls-Test  - pass
```

Lit tests:

```
RISCV64LinuxConfig (282 supported, few failures)
RISCV64LinuxDynamicConfig (289 supported, few failures)
```

Lit failures:

```
TestCases/malloc_context_size.cpp - asan works, but backtrace misses some calls
TestCases/Linux/malloc_delete_mismatch.cpp - asan works, but backtrace misses some calls
TestCases/Linux/static_tls.cpp - "Can't guess glibc version" (under debugging)
TestCases/asan_and_llvm_coverage_test.cpp - missing libclang_rt.profile-riscv64.a
```

These failures are under debugging currently and shall be addressed in a
subsequent commits.

Depends On D87581

Reviewed By: eugenis, vitalybuka

Differential Revision: https://reviews.llvm.org/D87582

cf4aa683

[llvm] Rename DwarfFile to DWARFFile to fix ODR violation (NFC) · a58b20e5

Jonas Devlieghere authored Oct 04, 2020

Rename the DwarfFile class in DWARFLinker to DWARFFile. This is
consistent with the other DWARF classes and avoids a ODR violation with
the DwarfFile class in AsmPrinter.

a58b20e5

[lldb] [test/Register] Attempt to fix x86-fp-read.test on Darwin · e8beb698

Michał Górny authored Oct 04, 2020

Darwin seems to use stmmN instead of stN. Use a regex to accept both.

Also try to actually clear st(7).

Differential revision: https://reviews.llvm.org/D88795

e8beb698

[X86] MWAITX_SAVE_RBX should not have EBX as an implicit use. · b1802611

Craig Topper authored Oct 04, 2020

RBX was copied to a virtual register before this instruction
was created. And the EBX input for the final MWAITX is still
in a virtual register. So EBX isn't read by this pseudo.

b1802611

llvm-dwarfdump: Don't try to parse rnglist tables when dumping CUs · 6d0be74a

David Blaikie authored Oct 04, 2020

It's not possible to do this in complete generality - a CU using a
sec_offset DW_AT_ranges has no way of knowing where its rnglists
contribution starts, so should not attempt to parse any full rnglist
table/header to do so. And even using FORM_rnglistx there's no need to
parse the header - the offset can be computed using the CU's DWARF
format (32 or 64) to compute offset entry sizes, and then the list
parsed at that offset without ever trying to find a rnglist contribution
header immediately prior to the rnglists_base.

6d0be74a

[HIP] Fix -fgpu-allow-device-init option · e372c1d7

Yaxun (Sam) Liu authored Sep 29, 2020

The option needs to be passed to both host and device compilation.

Differential Revision: https://reviews.llvm.org/D88550

e372c1d7

[HIP] Fix default output file for -E · 5b551b79

Yaxun (Sam) Liu authored Oct 02, 2020

By convention the default output file for -E is "-" (stdout).
This is expected by tools like ccache, which uses output
of -E to determine if a file and its dependence has changed.

Currently clang does not use stdout as default output file for -E
for HIP, which causes ccache not working.

This patch fixes that.

Differential Revision: https://reviews.llvm.org/D88730

5b551b79

Recommit "[HIP] Add option --gpu-instrument-lib=" · 9756a402
Yaxun (Sam) Liu authored Oct 04, 2020
```
recommit 64f7790e after
fixing hip-device-libs.hip.
```
9756a402
Revert "[HIP] Add option --gpu-instrument-lib=" · fef0ebbc
Yaxun (Sam) Liu authored Oct 04, 2020
```
This reverts commit 64f7790e due
to regression in hip-device-libs.hip.
```
fef0ebbc

[HIP] Add option --gpu-instrument-lib= · 64f7790e

Yaxun (Sam) Liu authored Sep 30, 2020

Add an option --gpu-instrument-lib= to allow users to specify
an instrument device library. This is for supporting -finstrument
in device code for debugging/profiling tools.

Differential Revision: https://reviews.llvm.org/D88557

64f7790e

llvm-dwarfdump: Add support for DW_RLE_startx_endx · 92c45e4e
David Blaikie authored Oct 04, 2020

92c45e4e

[X86] Remove MWAITX_SAVE_EBX pseudo instruction. Always save/restore the full... · 4b38ceb0

Craig Topper authored Oct 04, 2020

[X86] Remove MWAITX_SAVE_EBX pseudo instruction. Always save/restore the full %rbx register even in gnux32.

ebx/rbx only needs to be saved when 64-bit registers are supported
anyway. It should be fine to save/restore the whole rbx register
even in gnux32 where the base is technically just ebx.

This matches what we do for cmpxchg16b where rbx is saved/restored
regardless of gnux32.

4b38ceb0

llvm-dwarfdump: Print addresses in debug_line to the parsed address size · 628a3194
David Blaikie authored Oct 04, 2020

628a3194

[NewPM] collapsing nested pass mangers of the same type · 2c94d88e

Yuanfang Chen authored Oct 02, 2020

This is one of the reason for extra invalidations in D84959. In
practice, I don't think we have use cases needing this. This simplifies
the pipeline a bit and prune corner cases when considering
invalidations.

Reviewed By: asbirlea

Differential Revision: https://reviews.llvm.org/D85676

2c94d88e

[NFCI] Remove unnecessary trailing undef in RuntimeLibcalls.def · 83cc498c
Yuanfang Chen authored Oct 02, 2020
```
All uses of the file undef the macro already.
```
83cc498c
llvm-dwarfdump: Dump address forms in their encoded length rather than always in 64 bits · ea83e0b1
David Blaikie authored Oct 04, 2020
```
Few places did this already - refactor them all into a common helper.
```
ea83e0b1

[DomTree] findNearestCommonDominator: assert the nodes are in tree · 1065f343

Fangrui Song authored Oct 04, 2020

i.e. they cannot be unreachable from the entry (which usually indicate usage errors).
This change allows the removal of some nullptr checks.

Reviewed By: kuhar

Differential Revision: https://reviews.llvm.org/D88758

1065f343

[X86] Correct the implicit defs/uses for the MWAITX pseudo instructions. · 952dfd76

Craig Topper authored Oct 04, 2020

MWAITX doesn't touch EFLAGS so no pseudos should def EFLAGS.

The SAVE_EBX/RBX pseudos only needs to def the EBX register that
the expansion overwrites. The EAX and ECX registers are only read.

The pseudo emitted during isel that is used by the custom inserter
shouldn't have any implicit defs or uses since everything is in
vregs.

952dfd76

[X86] Remove usesCustomInserter from MWAITX_SAVE_EBX and MWAITX_SAVE_RBX. NFC · 0db97234
Craig Topper authored Oct 04, 2020
```
These are now emitted by a CustomInserter rather than using a custom
inserter themselves.
```
0db97234

Revert "[RFC] Factor out repetitive cmake patterns for llvm-style projects" · b0dce6b3

Stephen Neuendorffer authored Oct 04, 2020

This reverts commit e9b87f43.

There are issues with macros generating macros without an obvious simple fix
so I'm going to revert this and try something different.

b0dce6b3

Oct 04, 2020

[Coroutines][NewPM] Fix coroutine tests under new pass manager · 37010d4d

Arthur Eubanks authored Sep 14, 2020

Some new function parameter attributes are derived under NPM.

Reviewed By: rjmccall

Differential Revision: https://reviews.llvm.org/D88760

37010d4d

[NFC][SCEV] Add a test with some patterns where we could treat... · 80ac6da9
Roman Lebedev authored Oct 04, 2020
```
[NFC][SCEV] Add a test with some patterns where we could treat inttoptr/ptrtoint as semi-transparent
```
80ac6da9

llvm-dwarfdump: Skip tombstoned address ranges · 8036cf7f

David Blaikie authored Oct 04, 2020

Make the dumper & API a bit more informative by using the new tombstone
addresses to filter out or otherwise render more explicitly dead code
ranges.

8036cf7f

[MemCpyOpt] Add tests for call slot optimization with GEPs (NFC) · 8aaa7313
Nikita Popov authored Oct 04, 2020

8aaa7313

Implement callee/caller type checking for llvm.call · f05173d0

Mehdi Amini authored Oct 03, 2020

This aligns the behavior with the standard call as well as the LLVM verifier.

Reviewed By: ftynse, dcaballe

Differential Revision: https://reviews.llvm.org/D88362

f05173d0

[MemCpyOpt] Don't use array allocas in tests (NFC) · 22664a32

Nikita Popov authored Oct 04, 2020

Apparently querying dereferenceability of array allocations is
being intentionally penalized (https://reviews.llvm.org/D41398),
so avoid using them in tests.

22664a32

[X86] Remove an accidentally added file. NFC. · b4288f27
Martin Storsjö authored Oct 04, 2020
```
This file seems to have been accidentally added as part of commit
413577a8.
```
b4288f27
[SDA] Fix -Wunused-function in -DLLVM_ENABLE_ASSERTIONS=off builds · c36d441b
Fangrui Song authored Oct 04, 2020

c36d441b
[gn build] Port 6c6cd5f8 · 955b926b
LLVM GN Syncbot authored Oct 04, 2020

955b926b

[X86] Sync AESENC/DEC Key Locker builtins with gcc. · a02b449b

Craig Topper authored Oct 04, 2020

For the wide builtins, pass a single input and output pointer to
the builtins. Emit the GEPs and input loads from CGBuiltin.

a02b449b

[X86] Synchronize the encodekey builtins with gcc. Don't assume void* is 16 byte aligned. · 230c57b0

Craig Topper authored Oct 04, 2020

We were taking multiple pointer arguments in the builtin.
gcc accepts a single void*.

The cast from void* to _m128i* caused the IR generation to assume
the pointer was aligned.

Instead make the builtin take a single void*, emit i8* GEPs to
adjust then cast to <2 x i64>* and perform a store with align of 1.

230c57b0