Commits · 731206f3684af5979e3a794970db83f9a34b4541 · Lorenzo Albano / LLVM bpEVL

May 11, 2021

[mlir] Move move capture in SparseElementsAttr::getValues · 731206f3
River Riddle authored May 11, 2021
```
This was a TODO for the move to C++14. Now that the move has been completed, we can resolve it.
```
731206f3

[lld][WebAssembly] Remove relocation target verification · b49a798e

Sam Clegg authored May 11, 2021

We have this extra step in wasm-ld that doesn't exist in other lld
backend which verifies the existing contents of the relocation targets.
This was originally intended as an extra form of double checking and an
aid to compiler developers.   However it has always been somewhat
controversial and there have been suggestions in the past the we simply
remove it.

My motivation for removing it now is that its causing me a headache
when trying to fix an issue with negative addends.  In the case of
negative addends that final result can be wrapped/negative but this
checking code would require significant modification to be able to deal
with that case.  For example with some test cases I'm looking at I'm
seeing error like this:

```
wasm-ld: warning: /usr/local/google/home/sbc/dev/wasm/llvm-build/tools/lld/test/wasm/Output/merge-string.s.tmp.o:(.rodata_relocs): unexpected existing value for R_WASM_MEMORY_ADDR_I32: existing=FFFFFFFA expected=FFFFFFFFFFFFFFFA
```

Rather than try to refactor `calcExpectedValue` to somehow return two
different types of results (32 and 64-bit) depending on the relocation
type, I think we can just remove this code.

Differential Revision: https://reviews.llvm.org/D102265

b49a798e

Add an "interrupt timeout" to Process, and pipe that through the · 9558b602

Jim Ingham authored May 06, 2021

ProcessGDBRemote plugin layers.

Also fix a bug where if we tried to interrupt, but the ReadPacket
wakeup timer woke us up just after the timeout, we would break out
the switch, but then since we immediately check if the response is
empty & fail if it is, we could end up actually only giving a
small interval to the interrupt.

Differential Revision: https://reviews.llvm.org/D102085

9558b602

[libc++] Run `substitutes-in-compile-flags.sh.cpp` test on Windows. · 384dd9dd

Vladimir Vereschaka authored May 11, 2021

Fix for substitutes-in-compile-flags.sh.cpp to run it properly on Windows platform.

Differential Revision: https://reviews.llvm.org/D102048

384dd9dd

[OpenMP] Use compound operators for reduction combiner if available. · f90abac6

Mike Rice authored Mar 19, 2021

The OpenMP spec seems to require the compound operators be used for
+, *, &, |, and ^ reduction.  So use these if a class has those operators.
If not try the simple operators as we did previously to limit the impact
to existing code.

Fixes: https://bugs.llvm.org/show_bug.cgi?id=48584

Differential Revision: https://reviews.llvm.org/D101941

f90abac6

[clang] Support -fpic -fno-semantic-interposition for RISCV · 2075f2b2

Fangrui Song authored May 11, 2021

-fno-semantic-interposition (only effective with -fpic) can optimize default
visibility external linkage (non-ifunc-non-COMDAT) variable access and function
calls to avoid GOT/PLT, by using local aliases, e.g.
```
int var;
__attribute__((optnone)) int fun(int x) { return x * x; }
int test() { return fun(var); }
```

-fpic (var and fun are dso_preemptable)
```
test:
.LBB1_1:
        auipc   a0, %got_pcrel_hi(var)
        ld      a0, %pcrel_lo(.LBB1_1)(a0)
        lw      a0, 0(a0)
// fun is preemptible by default in ld -shared mode. ld will create a PLT.
        tail    fun@plt
```

vs -fpic -fno-semantic-interposition (var and fun are dso_local)
```
test:
.Ltest$local:
.LBB1_1:
        auipc   a0, %pcrel_hi(.Lvar$local)
        addi    a0, a0, %pcrel_lo(.LBB1_1)
        lw      a0, 0(a0)
// The assembler either resolves .Lfun$local at assembly time (-mno-relax
// -fno-function-sections), or produces a relocation referencing a non-preemptible
// local symbol (which can avoid PLT).
        tail    .Lfun$local
```

Note: Clang's default -fpic is more aggressive than GCC -fpic: interprocedural
optimizations (including inlining) are available but local aliases are not used.
-fpic -fsemantic-interposition can disable interprocedural optimizations.

Depends on D101875

Reviewed By: luismarques

Differential Revision: https://reviews.llvm.org/D101876

2075f2b2

[lld][WebAssembly] Convert test to assembly. NFC. · b2f227c6
Sam Clegg authored May 11, 2021
```
Differential Revision: https://reviews.llvm.org/D102264
```
b2f227c6
[X86] X86TTIImpl::getInterleavedMemoryOpCostAVX2(): canonicalize to integer type · 97e04d41
Roman Lebedev authored May 11, 2021
```
This way we don't have to duplicate i32/f32 and i64/f64 entries,
which was already forgotten to be done for a few tuples.
```
97e04d41

[GlobalOpt] Remove heap SROA · 129f466e

Fangrui Song authored May 11, 2021

GlobalOpt implements a heap SROA (SROA for an malloc allocatated struct or array
of structs) which is largely undertested (heap-sra-[1234].ll are basically the
same test with very little difference) and does not trigger at all when
bootstrapping clang (it only supports the case of one single store).

The heap SROA implementation causes PR50027 (GEP is not properly handled; crash or miscompile).
Just drop the implementation. I have deleted some obviously duplicated tests
but kept `heap-sra-[12]{,-no-nullopt}.ll`.

Reviewed By: aeubanks

Differential Revision: https://reviews.llvm.org/D102257

129f466e

[AArch64][GlobalISel] Support truncstorei8/i16 w/ combine to form truncating G_STOREs. · ae2b36e8

Amara Emerson authored Jan 24, 2021

This needs some tablegen changes so that we can actually import the patterns properly.

Differential Revision: https://reviews.llvm.org/D102204

ae2b36e8

[RISCV] Prefer to lower MC_GlobalAddress operands to .Lfoo$local · ec27c5f1

Fangrui Song authored May 11, 2021

Similar to X86 D73230 and AArch64 D101872

With this change, we can set dso_local in clang's -fpic -fno-semantic-interposition mode,
for default visibility external linkage non-ifunc-non-COMDAT definitions.

For such dso_local definitions, variable access/taking the address of a
function/calling a function will go through a local alias to avoid GOT/PLT.

Reviewed By: jrtc27, luismarques

Differential Revision: https://reviews.llvm.org/D101875

ec27c5f1

[ArgumentPromotion] Fix byval alignment handling. · 61cbbba7

Eli Friedman authored Oct 20, 2020

Make sure the alignment of the generated operations matches the
alignment of the byval argument.  Previously, we were just ignoring
alignment and getting lucky.

While I'm here, also delete the unnecessary "tail" handling.
Passing a pointer to a byval argument to a "tail" call is UB, so
rewriting to an alloca doesn't require any special handling.

Differential Revision: https://reviews.llvm.org/D89819

61cbbba7

[mlir][ODS]: Add per-op cppNamespace. · 49755871

Sean Silva authored May 10, 2021

This is useful for dialects that have logical subparts.

Differential Revision: https://reviews.llvm.org/D102200

49755871

[libcxx] [test] Fix filesystem permission tests for windows · 68de58cd

Martin Storsjö authored Feb 26, 2021

On Windows, the permission bits are mapped down to essentially only
two possible states; readonly or readwrite. Normalize the checked
permission bitmask to match what the implementation will return.

Differential Revision: https://reviews.llvm.org/D101728

68de58cd

[git-clang-format] Do not apply clang-format to symlinks · 0fd0a010

Pirama Arumuga Nainar authored May 04, 2021

This fixes PR46992.

Git stores symlinks as text files and we should not format them even if
they have one of the requested extensions.

(Move the call to `cd_to_toplevel()` up a few lines so we can also print
the skipped symlinks during verbose output.)

Differential Revision: https://reviews.llvm.org/D101878

0fd0a010

[lld/mac] Implement -sectalign · 9ab49ae5

Nico Weber authored May 11, 2021

clang sometimes passes this flag along (see D68351), so we should implement it.

Differential Revision: https://reviews.llvm.org/D102247

9ab49ae5

Re-apply "[ORC-RT] Add unit test infrastructure, extensible_rtti..." · e0b6c992
Lang Hames authored May 11, 2021
```
This reapplies 6d263b6f (which was reverted in 1c7c6f2b) with a fix for a
CMake issue.
```
e0b6c992

[flang] Allow large and erroneous ac-implied-do's · 5a9497d6

Peter Steinfeld authored May 10, 2021

We sometimes unroll an ac-implied-do of an array constructor into a flat list
of values. We then re-analyze the array constructor that contains the
resulting list of expressions. Such a list may or may not contain errors.

But when processing an array constructor with an unrolled ac-implied-do, the
compiler was building an expression to represent the extent of the resulting
array constructor containing the list of values. The number of operands
in this extent expression was based on the number of elements in the
unrolled list of values. For very large lists, this created an
expression so large that it could not be evaluated by the compiler
without overflowing the stack.

I fixed this by continuously folding the extent expression as each operand is
added to it. I added the test .../flang/test/Semantics/array-constr-big.f90
that will cause the compiler to seg fault without this change.

Also, when the unrolled ac-implied-do expression contains errors, we were
repeating the same error message referencing the same source line for every
instance of the erroneous expression in the unrolled list. This potentially
resulted in a very long list of messages for a single error in the source code.

I fixed this by comparing the message being emitted to the previously emitted
message. If they are the same, I do not emit the message. This change is also
tested by the new test array-constr-big.f90.

Several of the existing tests had duplicate error messages for the same source
line, and this change caused differences in their output. So I adjusted the
tests to match the new message emitting behavior.

Differential Revision: https://reviews.llvm.org/D102210

5a9497d6

[TextAPI] Reformat llvm_unreachable message · cba508fb

Sam Powell authored May 11, 2021

Change llvm_unreachable message from "Unknown llvm.MachO.PlatformKind
enum" to "Unknown llvm::MachO::PlatformKind enum".

Differential revision: https://reviews.llvm.org/D102250

cba508fb

Revert "[ORC-RT] Add unit test infrastructure, extensible_rtti..." · 1c7c6f2b
Lang Hames authored May 11, 2021
```
This reverts commit 6d263b6f while I investigate the CMake failures that it
causes in some configurations.
```
1c7c6f2b

Reland "[Coverage] Fix branch coverage merging in... · eccb9251

Alan Phipps authored May 11, 2021

Reland "[Coverage] Fix branch coverage merging in FunctionCoverageSummary::get() for instantiation""

Originally landed in: 6400905a
Reverted in: 668dccc3

Fix branch coverage merging in FunctionCoverageSummary::get() for instantiation
groups.

This change corrects the implementation for the branch coverage summary to do
the same thing for branches that is done for lines and regions.  That is,
across function instantiations in an instantiation group, the maximum branch
coverage found in any of those instantiations is returned, with the total
number of branches being the same across instantiations.

Differential Revision: https://reviews.llvm.org/D102193

eccb9251

[X86][SSE] Add tests for permute(phaddw(phaddw(x,y),phaddw(z,w))) ->... · 4f80340f

Simon Pilgrim authored May 11, 2021

[X86][SSE] Add tests for permute(phaddw(phaddw(x,y),phaddw(z,w))) -> phaddw(phaddw(),phaddw()) folds.

We currently only fold if NumEltsPerLane == 4

4f80340f

[libcxx][tests] Fix incomplte.verify tests by disabling them on clang-10. · db13f832

zoecarver authored May 11, 2021

For some reason clang-10 can't match the expected errors produced by
passing icomplete arrays to range access functions. Disabling the tests
is a stop-gap solution to fix the bots.

db13f832

[RISCV] Use fractional LMULs for fixed length types smaller than riscv-v-vector-bits-min. · ce6e4f27

Craig Topper authored May 11, 2021

My thought process is that if v2i64 is an LMUL=1 type then v2i32
should be an LMUL=1/2 type. We limit the fractional LMUL so that
SEW=64 clips to LMUL=1, SEW=32 clips to LMUL=1/2, etc. This
ensures there's always a fractional LMUL available to truncate a type.
This does reduce the number of vsetvlis in some cases.

Some tests increase vsetvlis because the best container type for a
mask type is dependent on the LMUL+SEW that the mask was produced
from, but you can't tell that from the type. I think this is
something we need to solve this in the machine IR when optimizing
vsetvlis.

Reviewed By: frasercrmck

Differential Revision: https://reviews.llvm.org/D101215

ce6e4f27

[X86][Codegen] Shift amount mod: sh? i64 x, (32-y) --> sh? i64 x, -(y+32) · 5f78ba00

Roman Lebedev authored May 11, 2021

I've seen this in the RawSpeed's BitPumpMSB*::push() hotpath,
after fixing the buffer abstraction to a more sane one,
when looking into a +5% runtime regression.
I was hoping that this would fix it, but it does not look it does.

This seems to be at least not worse than the original pattern.
But i'm actually mainly interested in the case where we already
compute `(y+32)` (see last test),

https://alive2.llvm.org/ce/z/ZCzJio

Reviewed By: spatel

Differential Revision: https://reviews.llvm.org/D101944

5f78ba00

[RISCV] Match trunc_vector_vl+sra_vl/srl_vl with splat shift amount to vnsra/vnsrl. · dc00cbb5

Craig Topper authored May 10, 2021

Limited to splats because we would need to truncate the shift
amount vector otherwise.

I tried to do this with new ISD nodes and a DAG combine to
avoid such a large pattern, but we don't form the splat until
LegalizeDAG and need DAG combine to remove a scalable->fixed->scalable
cast before it becomes visible to the shift node. By the time that
happens we've already visited the truncate node and won't revisit it.

I think I have an idea how to improve i64 on RV32 I'll save for a
follow up.

Reviewed By: frasercrmck

Differential Revision: https://reviews.llvm.org/D102019

dc00cbb5

Revert "Fix branch coverage merging in FunctionCoverageSummary::get() for instantiation" · 668dccc3
Alan Phipps authored May 11, 2021
```
This reverts commit 6400905a.
```
668dccc3
[libc++] Remove more unnecessary _VSTD:: from type names. NFCI. · 6491d99e
Arthur O'Dwyer authored May 10, 2021
```
Differential Revision: https://reviews.llvm.org/D102181
```
6491d99e
[libc++] s/_VSTD::is_unsigned/is_unsigned/ in <random>. NFCI. · 866b2795
Arthur O'Dwyer authored May 10, 2021

866b2795
[libc++] s/_VSTD::chrono/chrono/g. NFCI. · aa5e3bee
Arthur O'Dwyer authored May 10, 2021

aa5e3bee
[libc++] s/std::size_t/size_t/g. NFCI. · 0b8da5fa
Arthur O'Dwyer authored May 10, 2021

0b8da5fa
[libc++] s/_VSTD::declval/declval/g. NFCI. · ab3fcc50
Arthur O'Dwyer authored May 10, 2021

ab3fcc50

[libomptarget][nfc] Add hook to easily disable building amdgcn bclib · 72995a4b

Jon Chesterfield authored May 11, 2021

[libomptarget][nfc] Add hook to easily disable building amdgcn bclib

This is useful when building LLVM with a toolchain that can't emit code
for amdgcn, e.g. because it overrides the include search path with headers
from another architecture, or the clang compiler is missing builtins.

Reviewed By: tianshilei1992

Differential Revision: https://reviews.llvm.org/D102229

72995a4b

[mlir] Use static shape knowledge when lowering memref.reshape · b20e150c

Benjamin Kramer authored May 11, 2021

This is actually necessary for correctness, as memref.reinterpret_cast
doesn't verify if the output shape doesn't match the static sizes.

Differential Revision: https://reviews.llvm.org/D102232

b20e150c

Add null-pointer checks when accessing a TypeSystem's SymbolFile · ec28e43e

Augusto Noronha authored May 11, 2021

A type system is not guaranteed to have a symbol file. This patch adds null-pointer checks so we don't crash when trying to access a type system's symbol file.

Reviewed By: aprantl, teemperor

Differential Revision: https://reviews.llvm.org/D101539

ec28e43e

Change Target::ReadMemory to ensure the amount of memory read from the... · 6c82b8a3

Augusto Noronha authored May 11, 2021

Change Target::ReadMemory to ensure the amount of memory read from the file-cache is the amount requested.

This change ensures that if for whatever reason we read less bytes than expected (for example, when trying to read memory that spans multiple sections), we try reading from the live process as well.

Reviewed By: jasonmolenda

Differential Revision: https://reviews.llvm.org/D101390

6c82b8a3

Fix branch coverage merging in FunctionCoverageSummary::get() for instantiation · 6400905a

Alan Phipps authored May 10, 2021

groups.

This change corrects the implementation for the branch coverage
summary to do the same thing for branches that is done for lines and regions.
That is, across function instantiations in an instantiation group, the maximum
branch coverage found in any of those instantiations is returned, with the
total number of branches being the same across instantiations.

Differential Revision: https://reviews.llvm.org/D102193

6400905a

[NFC][X86] Precommit another testcase for D101944 · 2c1f9f39
Roman Lebedev authored May 11, 2021

2c1f9f39

Produce warning for performing pointer arithmetic on a null pointer. · dfc1e31d

Jamie Schmeiser authored May 11, 2021

Summary:
Test and produce warning for subtracting a pointer from null or subtracting
null from a pointer.  Reuse existing warning that this is undefined
behaviour.  Also add unit test for both warnings.

Reformat to satisfy clang-format.

Respond to review comments:  add additional test.

Respond to review comments:  Do not issue warning for nullptr - nullptr
in C++.

Fix indenting to satisfy clang-format.

Respond to review comments:  Add C++ tests.

Author: Jamie Schmeiser <schmeise@ca.ibm.com>
Reviewed By: efriedma (Eli Friedman), nickdesaulniers (Nick Desaulniers)
Differential Revision: https://reviews.llvm.org/D98798

dfc1e31d

[IR][AutoUpgrade] Drop align attribute from void return types · 4eff9469

Steven Wu authored May 11, 2021

Since D87304, `align` become an invalid attribute on none pointer types and
verifier will reject bitcode that has invalid `align` attribute.

The problem is before the change, DeadArgumentElimination can easily
turn a pointer return type into a void return type without removing
`align` attribute. Teach Autograde to remove invalid `align` attribute
from return types to maintain bitcode compatibility.

rdar://77022993

Reviewed By: dexonsmith

Differential Revision: https://reviews.llvm.org/D102201

4eff9469