Commits · e8b8f896024620eb86bf924be73100d83c74c736 · Lorenzo Albano / LLVM bpEVL

Feb 20, 2021

[clang-tidy] Simplify braced init check · e8b8f896

Stephen Kelly authored Dec 29, 2020

The normalization of matchers means that this now works in all language
modes.

Differential Revision: https://reviews.llvm.org/D96135

e8b8f896

clang: Exclude efi_main from -Wmissing-prototypes · 7dd42ecf

Daan De Meyer authored Jan 30, 2021

When compiling UEFI applications, the main function is named
efi_main() instead of main(). Let's exclude efi_main() from
-Wmissing-prototypes as well to avoid warnings when working
on UEFI applications.

Differential Revision: https://reviews.llvm.org/D95746

7dd42ecf

[ConstantRangeTest] Print detailed information on failure (NFC) · 5ec75c60

Nikita Popov authored Feb 20, 2021

When the optimality check fails, print the inputs, the computed
range and the better range that was found. This makes it much
simpler to identify the cause of the failure.

Make sure that full ranges (which, unlikely all the other cases,
have multiple ways to construct them that all result in the same
range) only print one message by handling them separately.

5ec75c60

[lld/mac] reject -undefined warning and -undefined suppress with -twolevel_namespace · 28d9953a

Nico Weber authored Feb 18, 2021

See discussion on https://reviews.llvm.org/D93263

-flat_namespace isn't implemented yet, and neither is -undefined dynamic,
so this makes -undefined pretty pointless in lld/MachO for now. But once
we implement -flat_namespace (which we need to do anyways to get check-llvm
to pass with lld as host linker), the code's already there.

Follow-up to https://reviews.llvm.org/D93263#2491865

Differential Revision: https://reviews.llvm.org/D96963

28d9953a

[ASTMatchers] Fix hasUnaryOperand matcher for postfix operators · 559f3728
Stephen Kelly authored Feb 19, 2021
```
Differential Revision: https://reviews.llvm.org/D97095
```
559f3728

[LTO] Fix cloning of llvm*.used when splitting module · fde55a9c

Teresa Johnson authored Feb 18, 2021

Refines the fix in 3c4c2050 to only
put globals whose defs were cloned into the split regular LTO module
on the cloned llvm*.used globals. This avoids an issue where one of the
attached values was a local that was promoted in the original module
after the module was cloned. We only need to have the values defined in
the new module on those globals.

Fixes PR49251.

Differential Revision: https://reviews.llvm.org/D97013

fde55a9c

[OpenMP][NFC] clang-format the whole openmp project · 309b00a4

Shilei Tian authored Feb 20, 2021

Same script as D95318. Test files are excluded.

Reviewed By: AndreyChurbanov

Differential Revision: https://reviews.llvm.org/D97088

309b00a4

Revert "Implement nullPointerConstant() using a better API." · 6984e0d4

Stephen Kelly authored Feb 14, 2021

This reverts commit 9148302a (2019-08-22) which broke the pre-existing
unit test for the matcher. Also revert commit 518b2266 (Fix the
nullPointerConstant() test to get bots back to green., 2019-08-22) which
incorrectly changed the test to expect the broken behavior.

Differential Revision: https://reviews.llvm.org/D96665

6984e0d4

[RISCV] Support extraction of misaligned subvectors · 3e1317fd

Fraser Cormack authored Feb 18, 2021

This patch extends the support for RVV EXTRACT_SUBVECTOR to cover those
which don't align to a vector register boundary. It accomplishes this by
extracting the nearest register-sized subvector (a subregister
operation), then sliding the vector down with VSLIDEDOWN and extracting
the subvector from the first position (a COPY operation).

Since this procedure involves the use of VSCALE and multiplication, the
handling of such operations is done during lowering to simplify the
implementation and make use of DAG combining. This necessitated moving
some helper functions from RISCVISelDAGToDAG to RISCVTargetLowering.

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D96959

3e1317fd

[RISCV] Improve register allocation around vector masks · 9aa20cae

Fraser Cormack authored Feb 19, 2021

With vector mask registers only allocatable to V0 (VMV0Regs) it is
relatively simple to generate code which uses multiple masks and naively
requires spilling.

This patch aims to improve codegen in such cases by telling LLVM it can
use VRRegs to hold masks. This will prevent spilling in many cases by
having LLVM copy to an available VR register.

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D97055

9aa20cae

[lit testing] "END." not "END:" · 4550fdff
David Zarzycki authored Feb 20, 2021

4550fdff

[InstCombine] matchBSwapOrBitReverse - remove pattern matching early-out. NFCI. · 609d0c97

Simon Pilgrim authored Feb 20, 2021

recognizeBSwapOrBitReverseIdiom + collectBitParts have pattern matching to bail out early if a bswap/bitreverse pattern isn't possible - we should be able to rely on this instead without any notable change in compile time.

This is part of a cleanup towards letting matchBSwapOrBitReverse /recognizeBSwapOrBitReverseIdiom use 'root' instructions that aren't ORs (FSHL/FSHRs in particular which can be prematurely created).

Differential Revision: https://reviews.llvm.org/D97056

609d0c97

[libc++] Fix the build for AppleClang. · 84dbcdd5

Mark de Wever authored Feb 20, 2021

Forgot to add some parts of D93593, this should disable the tests on
Apple. Seems Louis was right ;-)

84dbcdd5

[RISCV] Pre-commit test case for D97055. NFC. · 55b75d83
Fraser Cormack authored Feb 20, 2021
```
This adds a test which unnecessarily spills mask registers.
```
55b75d83
[X86][SSE] Use llvm min/max intrinsics instead of (deprecated) sse intrinsics. NFCI. · ee0dee7d
Simon Pilgrim authored Feb 20, 2021
```
These are auto-upgraded to the equivalent llvm variants now.
```
ee0dee7d

[X86][SSE] vector-compare-combines.ll - use llvm min/max intrinsics instead of... · a274062b

Simon Pilgrim authored Feb 20, 2021

[X86][SSE] vector-compare-combines.ll - use llvm min/max intrinsics instead of (deprecated) sse intrinsics. NFCI.

These are auto-upgraded to the equivalent llvm variants now.

a274062b

[X86][AVX] Remove AVX2 min/max intrinsics tests · 75e66b88

Simon Pilgrim authored Feb 20, 2021

These are now autoupgraded to the llvm equivalents and the tests already moved avx2-intrinsics-x86-upgrade.ll

75e66b88

[X86][SSE] Remove SSE41 min/max intrinsics tests · b9f3b305

Simon Pilgrim authored Feb 20, 2021

These are now autoupgraded to the llvm equivalents and the tests already moved sse41-intrinsics-x86-upgrade.ll

b9f3b305

[X86][SSE2] Remove SSE2 min/max intrinsics tests · 63422bcb

Simon Pilgrim authored Feb 20, 2021

These are now autoupgraded to the llvm equivalents and the tests already moved sse2-intrinsics-x86-upgrade.ll

63422bcb

[X86] KnownBits - use llvm min/max intrinsics instead of (deprecated) sse intrinsics. NFCI. · ff51bcee
Simon Pilgrim authored Feb 20, 2021
```
These are auto-upgraded to the equivalent llvm variants now.
```
ff51bcee
[DAG] foldSubToUSubSat - fold sub(a,trunc(umin(zext(a),b))) -> usubsat(a,trunc(umin(b,SatLimit))) · 761bbed2
Simon Pilgrim authored Feb 19, 2021
```
This moves the last custom x86 USUBSAT fold to generic DAGCombine.

Completes PR40111

Differential Revision: https://reviews.llvm.org/D96703
```
761bbed2

[ConstantRangeTest] Make exhaustive testing more principled (NFC) · 2b729548

Nikita Popov authored Sep 23, 2020

The current infrastructure for exhaustive ConstantRange testing is
somewhat confusing in what exactly it tests and currently cannot even
be used for operations that produce smallest-size results, rather than
signed/unsigned envelopes.

This patch makes the testing more principled by collecting the exact
set of results of an operation into a bit set and then comparing it
against the range approximation by:

 * Checking conservative correctness: All elements in the set must be
   in the range.
 * Checking optimality under a given preference function: None of the
   (slack-free) ranges that can be constructed from the set are
   preferred over the computed range.

Implemented preference functions are:

 * PreferSmallest: Smallest range regardless of signed/unsigned wrapping
   behavior. Probably what we would call "optimal" without further
   qualification.
 * PreferSmallestUnsigned/Signed: Smallest range that has no
   unsigned/signed wrapping. We use this if our calculation is precise
   only up to signed/unsigned envelope.
 * PreferSmallestNonFullUnsigned/Signed: Smallest range that has no
   unsigned/signed wrapping -- but preferring a smaller wrapping range
   over a (non-wrapping) full range. We use this if we have a fully
   precise calculation but apply a sign preference to the result
   (union/intersection). Even with a sign preference, returning a
   wrapping range is still "strictly better" than returning a full one.

This also addresses PR49273 by replacing the fragile manual range
construction logic in testBinarySetOperationExhaustive() with generic
code that isn't specialized to the particular form of ranges that set
operations can produces.

Differential Revision: https://reviews.llvm.org/D88356

2b729548

[Sanitizers][NFC] Fix typo · 7c31661b
Luís Marques authored Feb 20, 2021

7c31661b

[lit] Add --xfail and --filter-out (inverse of --filter) · 45d058e5

David Zarzycki authored Feb 16, 2021

In semi-automated environments, XFAILing or filtering out known regressions without actually committing changes or temporarily modifying the test suite can be quite useful.

Reviewed By: yln

Differential Revision: https://reviews.llvm.org/D96662

45d058e5

Update BPFAdjustOpt.cpp to accept select form of or as well · e4d751c2

Juneyoung Lee authored Feb 20, 2021

This is a minor pattern-match update to BPFAdjustOpt.cpp to accept
not only 'or i1 a, b' but also 'select i1 a, i1 true, i1 b'.
This resolves regression after SimplifyCFG's creating select form
of and/or instead (https://reviews.llvm.org/D95026).
This is a small change, and currently such select form isn't created
or doesn't reach to the late pipeline (because InstCombine eagerly
folds it into and/or i1), so I chose to commit without a review process.

e4d751c2

[AArch64][GlobalISel] Add selection support for G_VECREDUCE of <2 x i32> · 067ec53d
Amara Emerson authored Feb 20, 2021
```
This selects to a pairwise add and a subreg copy.
```
067ec53d

[libcxx] [test] Remove two unnecesary files/variables in a test · 7b5f0050

Martin Storsjö authored Feb 19, 2021

These don't seem to have any function in the test.

The non_regular_file one seems to have been added in
0f8c8f59, without any apparent
purpose there.

Differential Revision: https://reviews.llvm.org/D97083

7b5f0050

[libcxx] Rename a method in PathParser for clarity. NFC. · 8c305a5d
Martin Storsjö authored Jan 09, 2021
```
Differential Revision: https://reviews.llvm.org/D97081
```
8c305a5d

[libc++] Fixes _LIBCPP_HAS_NO_CONCEPTS · 1a5c92f6

Mark de Wever authored Feb 20, 2021

Before the define was in a GCC specific part. Now it's available for all
compilers. The patch had its CI run in D93593.

1a5c92f6

[InstCombine] Add more tests to nonnull-select.ll (NFC) · 3b8cfef4
Juneyoung Lee authored Feb 20, 2021

3b8cfef4
[CodeGen] Use range-based for loops (NFC) · a205fa5c
Kazu Hirata authored Feb 19, 2021

a205fa5c
[TableGen] Use ListSeparator (NFC) · f169c027
Kazu Hirata authored Feb 19, 2021

f169c027
Fixed failing test · 501b4fe4
Dávid Bolvanský authored Feb 20, 2021

501b4fe4
Reduce the number of attributes attached to each function · ee51c42e
Dávid Bolvanský authored Feb 20, 2021
```
This takes advantage of the implicit default behavior to reduce the number of
attributes.
```
ee51c42e
Reland "[Libcalls, Attrs] Annotate libcalls with noundef" · cd54c579
Dávid Bolvanský authored Feb 20, 2021
```
Fixed Clang tests.
```
cd54c579

[ValueTracking] Improve impliesPoison · aacf7878

Juneyoung Lee authored Feb 18, 2021

This patch improves ValueTracking's impliesPoison(V1, V2) to do this reasoning:

```
  %res = call { i64, i1 } @llvm.umul.with.overflow.i64(i64 %a, i64 %b)
  %overflow = extractvalue { i64, i1 } %res, 1
  %mul      = extractvalue { i64, i1 } %res, 0

	; If %mul is poison, %overflow is also poison, and vice versa.
```

This improvement leads to supporting this optimization under `-instcombine-unsafe-select-transform=0`:

```
define i1 @test2_logical(i64 %a, i64 %b, i64* %ptr) {
; CHECK-LABEL: @test2_logical(
; CHECK-NEXT:    [[MUL:%.*]] = mul i64 [[A:%.*]], [[B:%.*]]
; CHECK-NEXT:    [[TMP1:%.*]] = icmp ne i64 [[A]], 0
; CHECK-NEXT:    [[TMP2:%.*]] = icmp ne i64 [[B]], 0
; CHECK-NEXT:    [[OVERFLOW_1:%.*]] = and i1 [[TMP1]], [[TMP2]]
; CHECK-NEXT:    [[NEG:%.*]] = sub i64 0, [[MUL]]
; CHECK-NEXT:    store i64 [[NEG]], i64* [[PTR:%.*]], align 8
; CHECK-NEXT:    ret i1 [[OVERFLOW_1]]
;

  %res = tail call { i64, i1 } @llvm.umul.with.overflow.i64(i64 %a, i64 %b)
  %overflow = extractvalue { i64, i1 } %res, 1
  %mul = extractvalue { i64, i1 } %res, 0
  %cmp = icmp ne i64 %mul, 0
  %overflow.1 = select i1 %overflow, i1 true, i1 %cmp
  %neg = sub i64 0, %mul
  store i64 %neg, i64* %ptr, align 8
  ret i1 %overflow.1
}
```

Previously, this didn't happen because the flag prevented `select i1 %overflow, i1 true, i1 %cmp` from being `or i1 %overflow, %cmp`.
Note that the select -> or conversion happens only when `impliesPoison(%cmp, %overflow)` returns true.
This improvement allows `impliesPoison` to do the reasoning.

Reviewed By: nikic

Differential Revision: https://reviews.llvm.org/D96929

aacf7878

[mlir][sparse] convert function pass to module pass · f32b3401

Aart Bik authored Feb 19, 2021

Rationale:
Touching function level information can only be done within a module pass.

Reviewed By: mehdi_amini

Differential Revision: https://reviews.llvm.org/D97102

f32b3401

[SampleFDO] Skip PreLink ICP for better profile quality of MonoLTO PostLink · a952d729

Wenlei He authored Feb 18, 2021

For ThinLTO, PreLink ICP is skipped to favor better profile annotation during LTO PostLink. This change applies the same tweak for MonoLTO. Note that PreLink ICP not only makes PostLink profile annotation harder, it is also uncoordinated with PostLink ICP so duplicated ICP could happen.

Differential Revision: https://reviews.llvm.org/D97028

a952d729

Revert "[Libcalls, Attrs] Annotate libcalls with noundef" · 94d034fb
Dávid Bolvanský authored Feb 20, 2021
```
This reverts commit 33b0c637. Bots are failing. Some Clang tests need to be updated too.
```
94d034fb

[RISCV] Teach our custom vector load/store intrinsic isel code to propagate... · 71b68fe5

Craig Topper authored Feb 19, 2021

[RISCV] Teach our custom vector load/store intrinsic isel code to propagate memory operands if we have them.

We don't currently create memory operands for these intrinsics,
but there was a suggestion of using the indexed load/store
intrinsics to implement isel for scalable vector gather/scatter.
That may propagate the memory operand from the gather/scatter
ISD nodes.

71b68fe5