Commits · cbd93cee9bf014402a7405479ba21f6f3340a126 · Lorenzo Albano / LLVM bpEVL

May 12, 2021

Revert "[PowerPC] [Clang] Enable float128 feature on VSX targets" · cbd93cee

Qiu Chaofan authored May 12, 2021

This commit brought build break in some f128 related tests. But that's
not the root cause. There exists some differences between Clang and
GCC's definition for 128-bit float types on PPC, so macros/functions in
glibc may not work with clang -mfloat128 well. We need to handle this
carefully and reland it.

cbd93cee

[ARM] Prevent spilling between ldrex/strex pairs · 34c098b7

Tomas Matheson authored May 11, 2021

Based on the same for AArch64: 4751cadc

At -O0, the fast register allocator may insert spills between the ldrex and
strex instructions inserted by AtomicExpandPass when expanding atomicrmw
instructions in LL/SC loops. To avoid this, expand to cmpxchg loops and
therefore expand the cmpxchg pseudos after register allocation.

Required a tweak to ARMExpandPseudo::ExpandCMP_SWAP to use the 4-byte encoding
of UXT, since the pseudo instruction can be allocated a high register (R8-R15)
which the 2-byte encoding doesn't support. However, the 4-byte encodings
are not present for ARM v8-M Baseline. To enable this, two new pseudos are
added for Thumb which are only valid for v8mbase, tCMP_SWAP_8 and
tCMP_SWAP_16.

The previously committed attempt in D101164 had to be reverted due to runtime
failures in the test suites. Rather than spending time fixing that
implementation (adding another implementation of atomic operations and more
divergence between backends) I have chosen to follow the approach taken in
D101163.

Differential Revision: https://reviews.llvm.org/D101898

Depends on D101912

34c098b7

[ARM] Precommit test for D101898 · edf9d882
Tomas Matheson authored May 05, 2021
```
Differential Revision: https://reviews.llvm.org/D101912
```
edf9d882

Fixed llvm-objcopy to add correct symbol table for ELF with program headers. · d8e65585

Alex Orlov authored May 12, 2021

This fixes the following bugs:
https://bugs.llvm.org/show_bug.cgi?id=43935

Reviewed By: jhenderson

Differential Revision: https://reviews.llvm.org/D102258

d8e65585

[NFC][llvm-dwarfdump] Avoid passing std::string by value in collectStatsForDie() · 44642505
Djordje Todorovic authored May 11, 2021

44642505

[libc] Simplifies multi implementations · 6351993d

Guillaume Chatelet authored May 12, 2021

This is a roll forward of D101895 with two additional fixes:

Original Patch description:
> This is a follow up on D101524 which:
>
> - simplifies cpu features detection and usage,
> - flattens target dependent optimizations so it's obvious which implementations are generated,
> - provides an implementation targeting the host (march/mtune=native) for the mem* functions,
> - makes sure all implementations are unittested (provided the host can run them).

Additional fixes:
 - Fix uninitialized ALL_CPU_FEATURES
 - Use non pseudo microarch as it is only supported from Clang 12 on

Differential Revision: https://reviews.llvm.org/D102233

6351993d

scudo: fix CheckFailed-related build breakage · 8aa7f284

Dmitry Vyukov authored May 12, 2021

I was running:

$ ninja check-sanitizer check-msan check-asan \
  check-tsan check-lsan check-ubsan check-cfi \
  check-profile check-memprof check-xray check-hwasan

but missed check-scudo...

Differential Revision: https://reviews.llvm.org/D102314

8aa7f284

[MLIR] Enable conversion from llvm::SMLoc to mlir::Location with OpAsmParser. · 27b2bd76

Ulysse Beaugnon authored May 12, 2021

DialectAsmParser already allows converting an llvm::SMLoc location to a
mlir::Location location. This commit adds the same functionality to OpAsmParser.
Implementation is copied from DialectAsmParser.

Reviewed By: rriddle

Differential Revision: https://reviews.llvm.org/D102165

27b2bd76

[mlir] Support alignment in LLVM dialect GlobalOp · 9a0ea599

Dumitru Potop authored May 12, 2021

First step in adding alignment as an attribute to MLIR global definitions. Alignment can be specified for global objects in LLVM IR. It can also be specified as a named attribute in the LLVMIR dialect of MLIR. However, this attribute has no standing and is discarded during translation from MLIR to LLVM IR. This patch does two things: First, it adds the attribute to the syntax of the llvm.mlir.global operation, and by doing this it also adds accessors and verifications. The syntax is "align=XX" (with XX being an integer), placed right after the value of the operation. Second, it allows transforming this operation to and from LLVM IR. It is checked whether the value is an integer power of 2.

Reviewed By: ftynse, mehdi_amini

Differential Revision: https://reviews.llvm.org/D101492

9a0ea599

tsan: fix syscall test on aarch64 · 1dc83871

Dmitry Vyukov authored May 12, 2021

Add missing includes and use SYS_pipe2 instead of SYS_pipe
as it's not present on some arches.

Differential Revision: https://reviews.llvm.org/D102311

1dc83871

[COFF] Fix ARM and ARM64 REL32 relocations to be relative to the end of the relocation · 382c505d

Martin Storsjö authored May 11, 2021

This matches how they are defined on X86.

This should fix the relative lookup tables pass for COFF, allowing
it to be reenabled.

Differential Revision: https://reviews.llvm.org/D102217

382c505d

sanitizer_common: deduplicate CheckFailed · 2721e27c

Dmitry Vyukov authored May 11, 2021

We have some significant amount of duplication around
CheckFailed functionality. Each sanitizer copy-pasted
a chunk of code. Some got random improvements like
dealing with recursive failures better. These improvements
could benefit all sanitizers, but they don't.

Deduplicate CheckFailed logic across sanitizers and let each
sanitizer only print the current stack trace.
I've tried to dedup stack printing as well,
but this got me into cmake hell. So let's keep this part
duplicated in each sanitizer for now.

Reviewed By: vitalybuka

Differential Revision: https://reviews.llvm.org/D102221

2721e27c

[PowerPC] [Clang] Enable float128 feature on VSX targets · febbe4b5
Qiu Chaofan authored May 12, 2021
```
Reviewed By: nemanjai, steven.zhang

Differential Revision: https://reviews.llvm.org/D92815
```
febbe4b5

[libcxx][test] Split more debug mode tests · f8306647

Kristina Bessonova authored May 10, 2021

Split a few more debug mode tests missed in D100592.

Differential Revision: https://reviews.llvm.org/D102194

f8306647

sanitizer_common: don't write into .rodata · 23596fec

Dmitry Vyukov authored May 10, 2021

setlocale interceptor imitates a write into result,
which may be located in .rodata section.
This is the only interceptor that tries to do this and
I think the intention was to initialize the range for msan.
So do that instead. Writing into .rodata shouldn't happen
(without crashing later on the actual write) and this
traps on my local tsan experiments.

Reviewed By: vitalybuka

Differential Revision: https://reviews.llvm.org/D102161

23596fec

[symbolizer] Fix leak after D96883 · 85a96d82
Vitaly Buka authored May 11, 2021

85a96d82

sanitizer_common: fix SIG_DFL warning · 53558ed8

Dmitry Vyukov authored May 10, 2021

Currently we have:

sanitizer_posix_libcdep.cpp:146:27: warning: cast between incompatible
  function types from ‘__sighandler_t’ {aka ‘void (*)(int)’} to ‘sa_sigaction_t’
  146 |     sigact.sa_sigaction = (sa_sigaction_t)SIG_DFL;

We don't set SA_SIGINFO, so we need to assign to sa_handler.
And SIG_DFL is meant for sa_handler, so this gets rid of both
compiler warning, type cast and potential runtime misbehavior.

Reviewed By: vitalybuka

Differential Revision: https://reviews.llvm.org/D102162

53558ed8

tsan: declare annotations in test.h · 8214764f

Dmitry Vyukov authored May 10, 2021

We already declare subset of annotations in test.h.
But some are duplicated and declared in tests.
Move all annotation declarations to test.h.

Reviewed By: vitalybuka

Differential Revision: https://reviews.llvm.org/D102152

8214764f

[VectorComine] Restrict single-element-store index to inbounds constant · 6d2df181

Qiu Chaofan authored May 12, 2021

Vector single element update optimization is landed in 2db4979c. But the
scope needs restriction. This patch restricts the index to inbounds and
vector must be fixed sized. In future, we may use value tracking to
relax constant restrictions.

Reviewed By: fhahn

Differential Revision: https://reviews.llvm.org/D102146

6d2df181

tsan: mark sigwait as blocking · 5dad3d1b

Dmitry Vyukov authored May 07, 2021

Add a test case reported in:
https://github.com/google/sanitizers/issues/1401
and fix it.
The code assumes sigwait will process other signals.

Reviewed By: vitalybuka

Differential Revision: https://reviews.llvm.org/D102057

5dad3d1b

tsan: add a simple syscall test · 04b2ada5

Dmitry Vyukov authored May 11, 2021

Add a simple test that uses syscall annotations.
Just to ensure at least basic functionality works.
Also factor out annotated syscall wrappers into a separate
header file as they may be useful for future tests.

Reviewed By: vitalybuka

Differential Revision: https://reviews.llvm.org/D102223

04b2ada5

[mlir][AsmPrinter] Remove recursion while SSA naming · f653313d

Chia-hung Duan authored May 12, 2021

Address the TODO of removing recursion while SSA naming.

Reviewed By: mehdi_amini

Differential Revision: https://reviews.llvm.org/D102226

f653313d

[NFC][msan] Move setlocale test into sanitizer_common · 7d101e0f
Vitaly Buka authored May 11, 2021

7d101e0f

[LoopInterchange] Handle lcssa PHIs with multiple predecessors · 3f8be15f

Congzhe Cao authored May 11, 2021

This is a bugfix in the transformation phase.

If the original outer loop header branches to both the inner loop
(header) and the outer loop latch, and if there is an lcssa PHI
node outside the loop nest, then after interchange the new outer latch
will have an lcssa PHI node inserted which has two predecessors, i.e.,
the original outer header and the original outer latch. Currently
the transformation assumes it has only one predecessor (the original
outer latch) and crashes, since the inserted lcssa PHI node does
not take both predecessors as incoming BBs.

Reviewed By: Whitney

Differential Revision: https://reviews.llvm.org/D100792

3f8be15f

Removing test... · 10c309ad

Jim Ingham authored May 11, 2021

Actually, I don't think this test is going to be stable enough
to be worthwhile.  Let me see if I can think of a better way to
test this.

10c309ad

AMDGPU: Fix SILoadStoreOptimizer for gfx90a · cc79aace

Matt Arsenault authored May 10, 2021

This was hardcoding the register class to use for the newly created
pointer registers, violating the aligned VGPR requirement.

cc79aace

This test is failing on Linux, skip while I investigate. · 0f2eb7e6

Jim Ingham authored May 11, 2021

The gdb-remote tests are a bit artificial, depending on
Python threading, and sleeps.  So I'm not 100% surprised it doesn't
work straight up on another XSsystem.

0f2eb7e6

[lld][WebAssembly] Fix for string merging + negative addends · 19cedd3c

Sam Clegg authored May 11, 2021

Don't include the relocation addend when calculating the
virtual address of a symbol.  Instead just pass the symbol's
offset and add the addend afterwards.

Without this fix we hit the `offset is outside the section`
error in MergeInputSegment::getSegmentPiece.

This fixes a real world error we were are seeing in emscripten.

Differential Revision: https://reviews.llvm.org/D102271

19cedd3c

Revert "Fix bad mangling of <data-member-prefix> for a closure in the... · bb726383

Richard Smith authored May 11, 2021

Revert "Fix bad mangling of <data-member-prefix> for a closure in the initializer of a variable at global namespace scope."

This reverts commit 697ac15a, for which
review was not complete. That change was accidentally pushed when
an unrelated change was pushed.

bb726383

Add test for PR50039. · 3978333b

Richard Smith authored May 11, 2021

I believe Clang's behavior is correct according to the standard here,
but this is an unusual situation for which we had no test coverage, so
I'm adding some.

3978333b

Fix bad mangling of <data-member-prefix> for a closure in the initializer of a... · 697ac15a

Richard Smith authored May 05, 2021

Fix bad mangling of <data-member-prefix> for a closure in the initializer of a variable at global namespace scope.

This implements the direction proposed in
https://github.com/itanium-cxx-abi/cxx-abi/pull/126.

Differential Revision: https://reviews.llvm.org/D101968

697ac15a

GlobalISel: Don't hardcode varargs=false in resultsCompatible · 6f5ddf67
Matt Arsenault authored May 05, 2021

6f5ddf67
AMDGPU: Fix assert on constant load from addrspacecasted pointer · a15ed701
Matt Arsenault authored May 11, 2021
```
This was trying to create a bitcast between different address spaces.
```
a15ed701
GlobalISel: Make constant fields const · 6ecbdb76
Matt Arsenault authored May 11, 2021

6ecbdb76

GlobalISel: Split ValueHandler into assignment and emission classes · 24e2e5df

Matt Arsenault authored May 04, 2021

Currently the ValueHandler handles both selecting the type and
location for arguments, as well as inserting instructions needed to
handle them. Split this so that the determination of the argument
handling is independent of the function state. Currently the checks
for tail call compatibility do not follow the full assignment logic,
so it misses cases where arguments require nontrivial legalization.

This should help avoid targets ending up in a buggy state where the
argument evaluation may change in different contexts.

24e2e5df

GlobalISel: Move AArch64 AssignFnVarArg to base class · 2bdfcf0c

Matt Arsenault authored May 04, 2021

We can handle the distinction easily enough in the generic code, and
this makes it easier to abstract the selection of type/location from
the code to insert code.

2bdfcf0c

Revert "[GVN] Clobber partially aliased loads." · fec29459

Jordan Rupprecht authored May 11, 2021

This reverts commit 6c570442.

It causes assertion errors due to widening atomic loads, and potentially causes miscompile elsewhere too. Repro, also posted to D95543:

```
$ cat repro.ll
; ModuleID = 'repro.ll'
source_filename = "repro.ll"
target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"

%struct.widget = type { i32 }
%struct.baz = type { i32, %struct.snork }
%struct.snork = type { %struct.spam }
%struct.spam = type { i32, i32 }

@global = external local_unnamed_addr global %struct.widget, align 4
@global.1 = external local_unnamed_addr global i8, align 1
@global.2 = external local_unnamed_addr global i32, align 4

define void @zot(%struct.baz* %arg) local_unnamed_addr align 2 {
bb:
  %tmp = getelementptr inbounds %struct.baz, %struct.baz* %arg, i64 0, i32 1
  %tmp1 = bitcast %struct.snork* %tmp to i64*
  %tmp2 = load i64, i64* %tmp1, align 4
  %tmp3 = getelementptr inbounds %struct.baz, %struct.baz* %arg, i64 0, i32 1, i32 0, i32 1
  %tmp4 = icmp ugt i64 %tmp2, 4294967295
  br label %bb5

bb5:                                              ; preds = %bb14, %bb
  %tmp6 = load i32, i32* %tmp3, align 4
  %tmp7 = icmp ne i32 %tmp6, 0
  %tmp8 = select i1 %tmp7, i1 %tmp4, i1 false
  %tmp9 = zext i1 %tmp8 to i8
  store i8 %tmp9, i8* @global.1, align 1
  %tmp10 = load i32, i32* @global.2, align 4
  switch i32 %tmp10, label %bb11 [
    i32 1, label %bb12
    i32 2, label %bb12
  ]

bb11:                                             ; preds = %bb5
  br label %bb14

bb12:                                             ; preds = %bb5, %bb5
  %tmp13 = load atomic i32, i32* getelementptr inbounds (%struct.widget, %struct.widget* @global, i64 0, i32 0) acquire, align 4
  br label %bb14

bb14:                                             ; preds = %bb12, %bb11
  br label %bb5
}
$ opt -O2 repro.ll -disable-output
opt: /home/rupprecht/src/llvm-project/llvm/lib/Transforms/Utils/VNCoercion.cpp:496: llvm::Value *llvm::VNCoercion::getLoadValueForLoad(llvm::LoadInst *, unsigned int, llvm::Type *, llvm::Instruction *, const llvm::DataLayout &): Assertion `SrcVal->isSimple() && "Cannot widen volatile/atomic load!"' failed.
PLEASE submit a bug report to https://bugs.llvm.org/ and include the crash backtrace.
Stack dump:
0.      Program arguments: /home/rupprecht/dev/opt -O2 repro.ll -disable-output
...
```

fec29459

[JITLink] Fix bogus format string. · d63860a0
Lang Hames authored May 11, 2021

d63860a0

[clang][Fuchsia] Introduce compat multilibs · 5cb17728

Leonard Chan authored May 06, 2021

These are GCC-compatible multilibs that use the generic Itanium C++ ABI
instead of the Fuchsia C++ ABI.

Differential Revision: https://reviews.llvm.org/D102030

5cb17728

[LoopInterchange] Fix legality for triangular loops · 40e3aa39

Congzhe Cao authored May 11, 2021

This is a bug fix in legality check.

When we encounter triangular loops such as the following form:
    for (int i = 0; i < m; i++)
      for (int j = 0; j < i; j++), or

    for (int i = 0; i < m; i++)
      for (int j = 0; j*i < n; j++),

we should not perform interchange since the number of executions
of the loop body will be different before and after interchange,
resulting in incorrect results.

Reviewed By: bmahjour

Differential Revision: https://reviews.llvm.org/D101305

40e3aa39