Commits · e97aab8d1510a484a19016cf079e1a8f913cf7cd · Lorenzo Albano / LLVM bpEVL

Feb 21, 2021

[ThinLTO] Fix import of multiply defined global variables · e97aab8d

Kristina Bessonova authored Feb 02, 2021

Currently, if there is a module that contains a strong definition of
a global variable and a module that has both a weak definition for
the same global and a reference to it, it may result in an undefined symbol error
while linking with ThinLTO.

It happens because:
* the strong definition become internal because it is read-only and can be imported;
* the weak definition gets replaced by a declaration because it's non-prevailing;
* the strong definition failed to be imported because the destination module
  already contains another definition of the global yet this def is non-prevailing.

The patch adds a check to computeImportForReferencedGlobals() that allows
considering a global variable for being imported even if the module contains
a definition of it in the case this def has an interposable linkage type.

Note that currently the check is based only on the linkage type
(and this seems to be enough at the moment), but it might be worth to account
the information whether the def is prevailing or not.

Reviewed By: tejohnson

Differential Revision: https://reviews.llvm.org/D95943

e97aab8d

[DAG] Match USUBSAT patterns through zext/trunc · 38ab47c8

Simon Pilgrim authored Feb 21, 2021

This patch handles usubsat patterns hidden through zext/trunc and uses the getTruncatedUSUBSAT helper to determine if the USUBSAT can be correctly performed in the truncated form:

zext(x) >= y ? x - trunc(y) : 0 --> usubsat(x,trunc(umin(y,SatLimit)))
zext(x) >  y ? x - trunc(y) : 0 --> usubsat(x,trunc(umin(y,SatLimit)))

Based on original examples:

void foo(unsigned short *p, int max, int n) {
    int i;
    unsigned m;
    for (i = 0; i < n; i++) {
        m = *--p;
        *p = (unsigned short)(m >= max ? m-max : 0);
    }
}

Differential Revision: https://reviews.llvm.org/D25987

38ab47c8

[X86][AVX] Fold concat(extract_subvector(v0,c0), extract_subvector(v1,c1)) -> vperm2x128 · a6a258f1
Simon Pilgrim authored Feb 21, 2021
```
Fixes regression exposed by removing bitcasts across logic-ops in D96206.

Differential Revision: https://reviews.llvm.org/D96206
```
a6a258f1

[X86] Fold bitcast(logic(bitcast(X), Y)) --> logic'(X, bitcast(Y)) for int-int bitcasts · 2885d125

Simon Pilgrim authored Feb 21, 2021

Extend the existing combine that handles bitcasting for fp-logic ops to also help remove logic ops across bitcasts to/from the same integer types.

This helps improve AVX512 predicate handling for D/Q logic ops and also allows DAGCombine's scalarizeExtractedBinop to remove some annoying gpr->simd->gpr transfers.

The concat_vectors regression in pr40891.ll will be addressed in a followup commit on this patch.

Differential Revision: https://reviews.llvm.org/D96206

2885d125

[RISCV] Add test cases for add/sub/mul overflow intrinsics. NFC · d9207d3f
Craig Topper authored Feb 21, 2021
```
Largely copied from AArch64/arm64-xaluo.ll
```
d9207d3f

[lld][ELF] __start_/__stop_ refs don't retain C-ident named group sections · 1a3f3a3f

Petr Hosek authored Feb 15, 2021

The special root semantics for identifier-named sections is meant
specifically for the metadata sections. In the context of group
semantics, where group members are always retained or discarded as a
unit, it's natural not to have this semantics apply to a section in a
group, otherwise we would never discard the group defeating the purpose
of using the group in the first place.

This change modifies the GC behavior so that __start_/__stop_ references
don't retain C identifier named sections in section groups which allows
for these groups to be collected. This matches the behavior of BFD ld.

The only kind of existing case that might break is interdependent
metadata sections that are all in a group together, but that group
doesn't contain any other sections referenced by anything except
implicit inclusion in a `__start_` and/or `__stop_`-referenced
identifier-named section, but such cases should be unlikely.

Differential Revision: https://reviews.llvm.org/D96753

1a3f3a3f

[CodeGen] Use range-based for loops (NFC) · 0b417ba2
Kazu Hirata authored Feb 20, 2021

0b417ba2
[TableGen] Use ListSeparator (NFC) · 9e4033b0
Kazu Hirata authored Feb 20, 2021

9e4033b0
[dfsan] Comment out unused methods by D97087 temporarily · 9524632f
Jianzhou Zhao authored Feb 21, 2021

9524632f
[clang][Driver][OpenBSD] libcxx also requires pthread · b42d57a1
Brad Smith authored Feb 20, 2021

b42d57a1

[lldb] Refine ThreadPlan::ShouldAutoContinue · b0186c25

Dave Lee authored Feb 19, 2021

Adjust `ShouldAutoContinue` to be available to any thread plan previous to the plan that
explains a stop, not limited to the parent to the plan that explains the stop.

Before this change, `Thread::ShouldStop` did the following:

1. find the plan that explains the stop
2. if it's not a master plan, continue processing previous (aka parent) plans
3. first, call `ShouldAutoContinue` on the immediate parent of the explaining plan
4. then loop over previous plans, calling `ShouldStop` and `MischiefManaged`

Of note, the iteration in step 4 does not call `ShouldAutoContinue`, so again only the
plan just prior to the explaining plan is given the opportunity to override whether to
continue or stop.

This commit changes the loop call `ShouldAutoContinue`, giving each plan the opportunity
to override `ShouldStop` of previous plans.

Why? This allows a plan to do the following:

1. mark itself done and be popped off the stack
2. allow parent plans to finish their work, and to also be popped off the stack
3. and finally, have the thread continue, not stop

This is useful for stepping into async functions. A plan will would step far enough
enough to set a breakpoint on the async target, and then use `ShouldAutoContinue` to
unwind the necessary stepping, and then have the calling thread continue.

Differential Revision: https://reviews.llvm.org/D97076

b0186c25

Update test error string post pass registration change · fa211f3c
Jacques Pienaar authored Feb 20, 2021

fa211f3c
[mlir] Register the print-op-graph pass using ODS · 02d7b260
Jacques Pienaar authored Feb 20, 2021
```
Move over to ODS & use pass options.
```
02d7b260
[NFC] Refactor PreferMemberInitializerCheck · 557d2ade
Nathan James authored Feb 20, 2021

557d2ade

Feb 20, 2021

[libcxx] [test] Call create_directory_symlink when linking directories · 3d6ca4b8

Martin Storsjö authored Feb 20, 2021

This makes the symlinks work properly on windows.

A similar round of cleanup was done in
c41bda7f, but these tests were
added after that.

Differential Revision: https://reviews.llvm.org/D97089

3d6ca4b8

[libcxx] Make path::format a non-class enum · 26005c78

Martin Storsjö authored Nov 06, 2020

The spec doesn't declare it as an enum class, and being declared
as an enum class breaks referring to the values as e.g.
path::auto_format.

Differential Revision: https://reviews.llvm.org/D97084

26005c78

[InstrProfiling] Use nobits as __llvm_prf_cnts section type in ELF · 6b286d93

Petr Hosek authored Feb 19, 2021

This can reduce the binary size because counters will no longer occupy
space in the binary, instead they will be allocated by dynamic linker.

Differential Revision: https://reviews.llvm.org/D97110

6b286d93

[clang-tidy] Simplify throw keyword missing check · 77056fe5

Stephen Kelly authored Dec 28, 2020

Extend test to verify that it does not match in template instantiations.

Differential Revision: https://reviews.llvm.org/D96132

77056fe5

[clang-tidy] Simplify function complexity check · 6852a29a

Stephen Kelly authored Dec 26, 2020

Update test to note use of lambda instead of the invisible operator().

Differential Revision: https://reviews.llvm.org/D96131

6852a29a

[RISCV] Add another test case showing failure to use remw when the RHS has... · 038bd147
Craig Topper authored Feb 20, 2021
```
[RISCV] Add another test case showing failure to use remw when the RHS has been zero extended from less than i32. NFC
```
038bd147

[clang-itdy] Simplify virtual near-miss check · 9a4b574d

Stephen Kelly authored Dec 29, 2020

Diagnose the problem in templates in the context of the template
declaration instead of in the context of all of the (possibly very many)
template instantiations.

Differential Revision: https://reviews.llvm.org/D96224

9a4b574d

[ConstantRange] Handle wrapping ranges in min/max (PR48643) · a852234f

Nikita Popov authored Jan 01, 2021

When one of the inputs is a wrapping range, intersect with the
union of the two inputs. The union of the two inputs corresponds
to the result we would get if we treated the min/max as a simple
select.

This fixes PR48643.

a852234f

[InstCombine] fold fdiv with exp/exp2 divisor (PR49147) · e772618f

Sanjay Patel authored Feb 20, 2021

Follow-up to:
D96648 / b40fde06
...for the special-case base calls.

From the earlier commit:
This is unusual in the general (non-reciprocal) case because we need
an extra instruction, but that should be better for general FP
reassociation and codegen. We conservatively check for "arcp" FMF
here as we do with existing fdiv folds, but it is not strictly
necessary to have that.

e772618f

[InstCombine] add tests for fdiv of exp/exp2; NFC · fbca27bf
Sanjay Patel authored Feb 19, 2021

fbca27bf

[ConstantRange] Handle wrapping range in binaryNot() · b6088f74

Nikita Popov authored Feb 20, 2021

We don't need any special handling for wrapping ranges (or empty
ranges for that matter). The sub() call will already compute a
correct and precise range.

We only need to adjust the test expectation: We're now computing
an optimal result, rather than an unsigned envelope.

b6088f74

[OpenMP] libomp: cleanup some resource leaks · 1611e547

AndreyChurbanov authored Feb 20, 2021

Close mutexattr and condattr local objects to eliminate resource leaks.

Differential Revision: https://reviews.llvm.org/D96892

1611e547

[RISCV] Add an additional remw test to rv64m-exhaustive-w-insts.ll. NFC · 09966a66

Craig Topper authored Feb 20, 2021

This adds the IR for this C code

int32_t foo(uint16_t x, int16_t y) {
  x %= y;
  return x;
}

Note the dividend is unsigned and the divisor is signed. C type
promotion rules will extend them and use a 32-bit srem and the
function returns a 32-bit result.

We fail to use remw for this case. The zero extended input has
enough sign bits, but we won't consider (i64 AssertZext X, i16) in
the sexti32 isel pattern.

We also end up with a extra shifts to zero upper bits on the result.
computeKnownBits knew the result was positive before type legalization
and allowed the SIGN_EXTEND to become ZERO_EXTEND. But after promoting
to i64 we no longer know that bit 31 (and all bits above it) should
be 0.

09966a66

[Clang][OpenMP] Update driver test case for OpenMP offload to use sm_35 · 33d66093

Shilei Tian authored Feb 20, 2021

`sm_35` is the minimum requirement for OpenMP offloading on NVPTX device.
Current driver test case is using `sm_20`. D97003 is going to switch the minimum
CUDA version to 9.2, which only supports `sm_30+`. This patch makes step for the
change.

Reviewed By: JonChesterfield

Differential Revision: https://reviews.llvm.org/D97120

33d66093

[clang-tidy] Simplify braced init check · e8b8f896

Stephen Kelly authored Dec 29, 2020

The normalization of matchers means that this now works in all language
modes.

Differential Revision: https://reviews.llvm.org/D96135

e8b8f896

clang: Exclude efi_main from -Wmissing-prototypes · 7dd42ecf

Daan De Meyer authored Jan 30, 2021

When compiling UEFI applications, the main function is named
efi_main() instead of main(). Let's exclude efi_main() from
-Wmissing-prototypes as well to avoid warnings when working
on UEFI applications.

Differential Revision: https://reviews.llvm.org/D95746

7dd42ecf

[ConstantRangeTest] Print detailed information on failure (NFC) · 5ec75c60

Nikita Popov authored Feb 20, 2021

When the optimality check fails, print the inputs, the computed
range and the better range that was found. This makes it much
simpler to identify the cause of the failure.

Make sure that full ranges (which, unlikely all the other cases,
have multiple ways to construct them that all result in the same
range) only print one message by handling them separately.

5ec75c60

[lld/mac] reject -undefined warning and -undefined suppress with -twolevel_namespace · 28d9953a

Nico Weber authored Feb 18, 2021

See discussion on https://reviews.llvm.org/D93263

-flat_namespace isn't implemented yet, and neither is -undefined dynamic,
so this makes -undefined pretty pointless in lld/MachO for now. But once
we implement -flat_namespace (which we need to do anyways to get check-llvm
to pass with lld as host linker), the code's already there.

Follow-up to https://reviews.llvm.org/D93263#2491865

Differential Revision: https://reviews.llvm.org/D96963

28d9953a

[ASTMatchers] Fix hasUnaryOperand matcher for postfix operators · 559f3728
Stephen Kelly authored Feb 19, 2021
```
Differential Revision: https://reviews.llvm.org/D97095
```
559f3728

[LTO] Fix cloning of llvm*.used when splitting module · fde55a9c

Teresa Johnson authored Feb 18, 2021

Refines the fix in 3c4c2050 to only
put globals whose defs were cloned into the split regular LTO module
on the cloned llvm*.used globals. This avoids an issue where one of the
attached values was a local that was promoted in the original module
after the module was cloned. We only need to have the values defined in
the new module on those globals.

Fixes PR49251.

Differential Revision: https://reviews.llvm.org/D97013

fde55a9c

[OpenMP][NFC] clang-format the whole openmp project · 309b00a4

Shilei Tian authored Feb 20, 2021

Same script as D95318. Test files are excluded.

Reviewed By: AndreyChurbanov

Differential Revision: https://reviews.llvm.org/D97088

309b00a4

Revert "Implement nullPointerConstant() using a better API." · 6984e0d4

Stephen Kelly authored Feb 14, 2021

This reverts commit 9148302a (2019-08-22) which broke the pre-existing
unit test for the matcher. Also revert commit 518b2266 (Fix the
nullPointerConstant() test to get bots back to green., 2019-08-22) which
incorrectly changed the test to expect the broken behavior.

Differential Revision: https://reviews.llvm.org/D96665

6984e0d4

[RISCV] Support extraction of misaligned subvectors · 3e1317fd

Fraser Cormack authored Feb 18, 2021

This patch extends the support for RVV EXTRACT_SUBVECTOR to cover those
which don't align to a vector register boundary. It accomplishes this by
extracting the nearest register-sized subvector (a subregister
operation), then sliding the vector down with VSLIDEDOWN and extracting
the subvector from the first position (a COPY operation).

Since this procedure involves the use of VSCALE and multiplication, the
handling of such operations is done during lowering to simplify the
implementation and make use of DAG combining. This necessitated moving
some helper functions from RISCVISelDAGToDAG to RISCVTargetLowering.

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D96959

3e1317fd

[RISCV] Improve register allocation around vector masks · 9aa20cae

Fraser Cormack authored Feb 19, 2021

With vector mask registers only allocatable to V0 (VMV0Regs) it is
relatively simple to generate code which uses multiple masks and naively
requires spilling.

This patch aims to improve codegen in such cases by telling LLVM it can
use VRRegs to hold masks. This will prevent spilling in many cases by
having LLVM copy to an available VR register.

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D97055

9aa20cae

[lit testing] "END." not "END:" · 4550fdff
David Zarzycki authored Feb 20, 2021

4550fdff

[InstCombine] matchBSwapOrBitReverse - remove pattern matching early-out. NFCI. · 609d0c97

Simon Pilgrim authored Feb 20, 2021

recognizeBSwapOrBitReverseIdiom + collectBitParts have pattern matching to bail out early if a bswap/bitreverse pattern isn't possible - we should be able to rely on this instead without any notable change in compile time.

This is part of a cleanup towards letting matchBSwapOrBitReverse /recognizeBSwapOrBitReverseIdiom use 'root' instructions that aren't ORs (FSHL/FSHRs in particular which can be prematurely created).

Differential Revision: https://reviews.llvm.org/D97056

609d0c97