Commits · 317e6a80778c09d9e9bc1cbbef88a62fa4e93582 · Lorenzo Albano / LLVM bpEVL

Mar 04, 2022

[bazel] Port 76ec69a9 · 317e6a80
Benjamin Kramer authored Mar 04, 2022

317e6a80

[memprof] Filter out callstack frames which cannot be symbolized. · 11314f40

Snehasish Kumar authored Mar 02, 2022

This patch filters out callstack frames which can't be symbolized or if
the frames belong to the runtime. Symbolization may not be possible if
debug information is unavailable or if the addresses are from a shared
library. For now we only support optimization of the main binary which
is statically linked to the compiler runtime.

Differential Revision: https://reviews.llvm.org/D120860

11314f40

[libc] Remove the redundant header FPUtil/FEnvUtils.h · 76ec69a9

Tue Ly authored Mar 03, 2022

Remove the redundant header FPUtil/FEnvUtils.h, use FPUtil/FEnvImpl.h header instead.

Reviewed By: sivachandra

Differential Revision: https://reviews.llvm.org/D120965

76ec69a9

[NVPTX] Correctly set regs for neg, abs intrinsics · dce6aa23

Jakub Chlanda authored Mar 04, 2022

This patch fixes a bug introduced in D117887.

Reviewed By: tra

Differential Revision: https://reviews.llvm.org/D120991

dce6aa23

[OpenMP][FIX] Ensure custom state machine works · f9c2d600

Johannes Doerfert authored Mar 04, 2022

The custom state machine had a check for surplus threads that filtered
the main thread if the kernel was executed by a single warp only. We
now first check for the main thread, then for surplus threads, avoiding
to filter the former out.

Fixes #54214.

Reviewed By: jhuber6

Differential Revision: https://reviews.llvm.org/D121011

f9c2d600

Revert "[SROA] Maintain shadow/backing alloca when some slices are... · 7405581f

Roman Lebedev authored Mar 04, 2022

Revert "[SROA] Maintain shadow/backing alloca when some slices are noncapturnig read-only calls to allow alloca partitioning/promotion"

Bots are reporting that the assertion about only expecting loads is wrong.

This reverts commit 703240c7.

7405581f

[InstrProf][NFC] Fix warning by removing typecast · 9c353039

Ellis Hoag authored Mar 03, 2022

This fixes a warning about comparing mismatched types. Since `mmap()` already returns a `void *` use that as the pointer type for comparison.

Reviewed By: kyulee, zequanwu

Differential Revision: https://reviews.llvm.org/D120945

9c353039

[libc][NFC] Add a platform independent thread support library. · a5ee8183

Siva Chandra Reddy authored Feb 26, 2022

The idea is that, other parts of the libc which require thread/lock
support will be able to use this platform independent setup.

With this change, only the linux implementation of a mutex type has been
moved to the new library. Because of this, there is some duplication
between the new library and src/threads/linux. A follow up change will
move all of src/threads/linux to the new library. The duplication should
be eliminated with that move.

Reviewed By: lntue

Differential Revision: https://reviews.llvm.org/D120795

a5ee8183

[libc++] Mark __wrap_iter's private constructors as explicit. · 7e1355eb

Arthur O'Dwyer authored Mar 03, 2022

This is slightly more user-visible than D119894, because the user is
expected to touch `__wrap_iter` directly. But the affected ctors are
non-public, so the user was never expected to be actually calling them.
And I didn't intentionally omit this from D119894; I just didn't
think of it.

Differential Revision: https://reviews.llvm.org/D120937

7e1355eb

[libc][Obvious] Add build folder to .gitignore. · bccc88b3
Tue Ly authored Mar 04, 2022

bccc88b3

[Libomptarget] Work around bug in initialization of libomptarget · e2dcc221

Joseph Huber authored Mar 04, 2022

Libomptarget uses some shared variables to track certain internal stated
in the runtime. This causes problems when we have code that contains no
OpenMP kernels. These variables are normally initialized upon kernel
entry, but if there are no kernels we will see no initialization.
Currently we load the runtime into each source file when not running in
LTO mode, so these variables will be erroneously considered undefined or
dead and removed, causing miscompiles. This patch temporarily works
around the most obvious case, but others still exhibit this problem. We
will need to fix this more soundly later.

Fixes #54208.

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D121007

e2dcc221

[SROA] Maintain shadow/backing alloca when some slices are noncapturnig... · 703240c7

Roman Lebedev authored Feb 24, 2022

[SROA] Maintain shadow/backing alloca when some slices are noncapturnig read-only calls to allow alloca partitioning/promotion

This is inspired by the original variant of D109749 by Graham Hunter,
but is a more general version.

Roughly, instead of promoting the alloca, we call it
a shadow/backing alloca, go through all it's slices,
clone(!) instructions that operated on it,
but make them operate on the cloned alloca,
and promote cloned alloca instead.

This keeps the shadow/backing alloca, and all the original instructions
around, which results in said shadow/backing alloca being
a perfect mirror/representation of the promoted alloca's content,
so calls that take the alloca as arguments (non-capturingly!)
can be supported.

For now, we require that the calls also don't modify the alloca's content,
but that is only to simplify the initial implementation,
and that will be supported in a follow-up.

Overall, this leads to *smaller* codesize:
https://llvm-compile-time-tracker.com/compare.php?from=a8b4f5bbab62091835205f3d648902432a4a5b58&to=aeae054055b125b011c1122f82c86457e159436f&stat=size-total
and is roughly neutral compile-time wise:
https://llvm-compile-time-tracker.com/compare.php?from=a8b4f5bbab62091835205f3d648902432a4a5b58&to=aeae054055b125b011c1122f82c86457e159436f&stat=instructions

Reviewed By: djtodoro

Differential Revision: https://reviews.llvm.org/D113520

703240c7

[clang] [concepts] Check constrained-auto return types for void-returning functions · f0891cd6
Arthur O'Dwyer authored Feb 14, 2022
```
Fixes #49188.

Differential Revision: https://reviews.llvm.org/D119184
```
f0891cd6
[clang] [NFC] Add `const` to a parameter that's not modified. · adf6703f
Arthur O'Dwyer authored Feb 17, 2022
```
Reviewed as part of D119184.
```
adf6703f

[X86] getTargetVShiftNode - peek through any zext node · 588d97e2

Simon Pilgrim authored Mar 04, 2022

If the shift amount has been zero-extended, peek through as this might help us further canonicalize the shift amount.

Fixes regression mentioned in rG147cfcbef1255ba2b4875b76708dab1a685085f5

588d97e2

Pass through more LIBCXX_* variables to libfuzzer's custom lib++ · bcc65fb4

Colin Cross authored Mar 03, 2022

Pass LIBCXX_HAS_PTHREAD_LIB, LIBCXX_HAS_RT_LIB and LIBCXXABI_HAS_PTHREAD_LIB
through to the custom lib++ builds so that libfuzzer doesn't end up with a .deplibs section that
links against those libraries when the variables are set to false.

Reviewed By: phosek

Differential Revision: https://reviews.llvm.org/D120946

bcc65fb4

[libc] Make the errno macro resolve to the thread local variable directly. · dd33f9cd

Siva Chandra Reddy authored Mar 03, 2022

With modern architectures having a thread pointer and language supporting
thread locals, there is no reason to use a function intermediary to access
the thread local errno value.

The entrypoint corresponding to errno has been replaced with an object
library as there is no formal entrypoint for errno anymore.

Reviewed By: jeffbailey, michaelrj

Differential Revision: https://reviews.llvm.org/D120920

dd33f9cd

[gn build] Port c88deef0 · fa8293bb
LLVM GN Syncbot authored Mar 04, 2022

fa8293bb
[gn build] Port 7ee97c24 · a1e91b53
LLVM GN Syncbot authored Mar 04, 2022

a1e91b53

[clang][dataflow] Add `MatchSwitch` utility library. · c88deef0

Yitzhak Mandelbaum authored Mar 03, 2022

Adds `MatchSwitch`, a library for simplifying implementation of transfer
functions. `MatchSwitch` supports constructing a "switch" statement, where each
case of the switch is defined by an AST matcher. The cases are considered in
order, like pattern matching in functional languages.

Differential Revision: https://reviews.llvm.org/D120900

c88deef0

[MLIR][AMDGPU] Fix typo and add comment to SerializeToHsaco · 4e817b3f
Krzysztof Drewniak authored Mar 03, 2022
```
Reviewed By: bondhugula

Differential Revision: https://reviews.llvm.org/D120943
```
4e817b3f

[clang][dataflow] Add a lattice to track source locations. · 7ee97c24

Yitzhak Mandelbaum authored Mar 03, 2022

This patch adds a simpe lattice used to collect source loctions. An intended application is to track errors found in code during an analysis.

Differential Revision: https://reviews.llvm.org/D120890

7ee97c24

[MLIR][SCF] Allow combining subsequent if statements that yield & negated condition · 62f84c73

William S. Moses authored Mar 03, 2022

This patch extends the existing if combining canonicalization to also handle the case where a value returned by the first if is used within the body of the second if.

This patch also extends if combining to support if's whose conditions are logical negations of each other.

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D120924

62f84c73

[DebugInfo][InstrRef] Accept register-reads after isel in any block · 0e96d95d

Jeremy Morse authored Mar 04, 2022

When lowering LLVM-IR to instruction referencing stuff, if a value is
defined by a COPY, we try and follow the register definitions back to where
the value was defined, and build an instruction reference to that
instruction. In a few scenarios (such as arguments), this isn't possible.
I added some assertions to catch cases that weren't explicitly whitelisted.

Over the course of a few months, several more scenarios have cropped up,
the lastest is the llvm.read_register intrinsic, which lets LLVM-IR read an
arbitary register at any point. In the face of this, there's little point
in validating whether debug-info reads a register in an expected scenario.
Thus: this patch just deletes those assertions, and adds a regression test
to check that something is done with the llvm.read_register intrinsic.

Fixes #54190

Differential Revision: https://reviews.llvm.org/D121001

0e96d95d

[MLIR][MemRef] Ensure alloca_scope is inlined with no allocating ops · 1d179157
William S. Moses authored Mar 02, 2022
```
Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D120841
```
1d179157

[X86] LowerShiftByScalarVariable - find splat patterns with... · 147cfcbe

Simon Pilgrim authored Mar 04, 2022

[X86] LowerShiftByScalarVariable - find splat patterns with getSplatSourceVector instead of getSplatValue

This completes the removal of uses of SelectionDAG::getSplatValue started in D119090 - by avoiding extracting the splatted element we make it a lot easier to zero-extend the bottom 64-bits of the shift amount and fixes issues we had on 32-bit targets where i64 isn't legal.

I've removed the old version of getTargetVShiftNode that took the scalar shift amount argument and LowerRotate can finally efficiently handle vXi16 rotates-by-scalar (using the same code as general funnel-shifts).

The only regression we see is in the X86-AVX2 PR52719 test case in vector-shift-ashr-256.ll - this is now hitting the same problem as the X86-AVX1 case (failure to simplify a multi-use X86ISD::VBROADCAST_LOAD) which I intend to address in a follow up patch.

147cfcbe

Revert "[AArch64] Async unwind - function prologues" · 85c53c70

Hans Wennborg authored Mar 04, 2022

It caused builds to assert with:

  (StackSize == 0 && "We already have the CFA offset!"),
  function generateCompactUnwindEncoding, file AArch64AsmBackend.cpp, line 624.

when targeting iOS. See comment on the code review for reproducer.

> This patch rearranges emission of CFI instructions, so the resulting
> DWARF and `.eh_frame` information is precise at every instruction.
>
> The current state is that the unwind info is emitted only after the
> function prologue. This is fine for synchronous (e.g. C++) exceptions,
> but the information is generally incorrect when the program counter is
> at an instruction in the prologue or the epilogue, for example:
>
> ```
> stp     x29, x30, [sp, #-16]!           // 16-byte Folded Spill
> mov     x29, sp
> .cfi_def_cfa w29, 16
> ...
> ```
>
> after the `stp` is executed the (initial) rule for the CFA still says
> the CFA is in the `sp`, even though it's already offset by 16 bytes
>
> A correct unwind info could look like:
> ```
> stp     x29, x30, [sp, #-16]!           // 16-byte Folded Spill
> .cfi_def_cfa_offset 16
> mov     x29, sp
> .cfi_def_cfa w29, 16
> ...
> ```
>
> Having this information precise up to an instruction is useful for
> sampling profilers that would like to get a stack backtrace. The end
> goal (towards this patch is just a step) is to have fully working
> `-fasynchronous-unwind-tables`.
>
> Reviewed By: danielkiss, MaskRay
>
> Differential Revision: https://reviews.llvm.org/D111411

This reverts commit 32e8b550.

85c53c70

[AArch64] Use simd mov to materialize big fp constants · 7a605ab7

zhongyunde authored Mar 04, 2022

mov w8, #1325400064 + fmov s0, w8 ==> movi v0.2s, 0x4f, lsl 24
Fix https://github.com/llvm/llvm-project/issues/53651

Reviewed By: dmgreen, fhahn

Differential Revision: https://reviews.llvm.org/D120452

7a605ab7

[llvm] fix bitcode-strip.test on windows · 8ba84ced

Richard Howell authored Mar 04, 2022

Remove the executable name from the test match as this will have
a `.exe` suffix on windows.

Reviewed By: drodriguez

Differential Revision: https://reviews.llvm.org/D121000

8ba84ced

[MLIR][Presburger] skip IntegerPolyhedrons with LocalIds in coalesce · 21dc4ad5

Michel Weber authored Mar 04, 2022

This patch makes coalesce skip the comparison of all pairs of IntegerPolyhedrons with LocalIds rather than crash. The heuristics to handle these cases will be upstreamed later on.

Reviewed By: arjunp

Differential Revision: https://reviews.llvm.org/D120995

21dc4ad5

[llvm] add -o flag to llvm-bitcode-strip · 8e6d2fe4

Richard Howell authored Mar 04, 2022

Add the -o flag to specify an output path for llvm-bitcode-strip.
This matches the interface to the Xcode bitcode_strip tool.

Reviewed By: jhenderson

Differential Revision: https://reviews.llvm.org/D120731

8e6d2fe4

[MLIR][LLVM] Fold extractvalue to ignore insertvalue at distinct index · 4a94a33c

William S. Moses authored Mar 03, 2022

We can simplify an extractvalue of an insertvalue to extract out of the base of the insertvalue, if the insert and extract are at distinct and non-prefix'd indices

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D120915

4a94a33c

[runtimes] Trigger CI jobs when only the runtimes/ subdirectory is touched · 9fee527e
Louis Dionne authored Mar 04, 2022

9fee527e
[clang-tidy][NFC] Document bugprone-narrowing-conversions check alias · e86324f8
Balazs Benics authored Mar 04, 2022

e86324f8

InstructionCombining: avoid eliding mismatched alloc/free pairs · 5e4c75db

Augie Fackler authored Jan 14, 2022

Prior to this change LLVM would happily elide a call to any allocation
function and a call to any free function operating on the same unused
pointer. This can cause problems in some obscure cases, for example if
the body of operator::new can be inlined but the body of
operator::delete can't, as in this example from jyknight:

    #include <stdlib.h>
    #include <stdio.h>

    int allocs = 0;

    void *operator new(size_t n) {
        allocs++;
        void *mem = malloc(n);
        if (!mem) abort();
        return mem;
    }

    __attribute__((noinline)) void operator delete(void *mem) noexcept {
        allocs--;
        free(mem);
    }

    void deleteit(int*i) { delete i; }
    int main() {
        int*i = new int;
        deleteit(i);
        if (allocs != 0)
          printf("MEMORY LEAK! allocs: %d\n", allocs);
    }

This patch addresses the issue by introducing the concept of an
allocator function family and uses it to make sure that alloc/free
function pairs are only removed if they're in the same family.

Differential Revision: https://reviews.llvm.org/D117356

5e4c75db

Extend `performANDCSELCombine` to `performANDORCSELCombine` · 43a0016f
Karl Meakin authored Feb 16, 2022
```
Differential Revision: https://reviews.llvm.org/D120422
```
43a0016f
[CoroFrame] Remove unused insertSpills() return value (NFC) · 6467d1d2
Nikita Popov authored Mar 04, 2022

6467d1d2
[X86] SimplifyDemandedVectorElts - adjust X86ISD::ANDNP demanded elts based off constant masks · 940d7cd5
Simon Pilgrim authored Mar 04, 2022
```
Similar to what we already do in combineAndnp, if either operand is a constant then we can improve the demanded elts/bits.
```
940d7cd5

Revert "[WebAssembly] Update WebAssemblyAsmTypeCheck for table.get" · ffca16c3

David Spickett authored Mar 04, 2022

This reverts commit 6b2482f6 due to
test failures on AArch64 bots:
https://lab.llvm.org/buildbot/#/builders/183/builds/3684

ffca16c3

[flang] Fix standalone builds · bbcc0f60

Andrzej Warzynski authored Mar 04, 2022

In dd875dd8 I added a missing MLIR
dependency in Flang. However, that particular CMake target is not
exported as something available to standalone builds. In this patch is
switch to `MLIRIR` instead, which depends on
`MLIRBuiltinAttributeInterfacesIncGen` - the missing dependency added
previously.

Differential Revision: https://reviews.llvm.org/D120986

bbcc0f60