Commits · d2dcff60fe230bf5e5f7aeae931c982af4ef3721 · Lorenzo Albano / LLVM bpEVL

Jul 02, 2020

[Alignment][NFC] VectorLayout now uses Align internally · d2dcff60

Guillaume Chatelet authored Jul 02, 2020

By rewritting `ScalarizerVisitor::getVectorLayout` in such a way it returns `VectorLayout` (or `None`) it becomes obvious that `VectorLayout::VecAlign` cannot be `0`.

This patch is part of a series to introduce an Alignment type.
See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html
See this patch for the introduction of the type: https://reviews.llvm.org/D64790

Differential Revision: https://reviews.llvm.org/D82981

d2dcff60

[AArch64][SVE] Add reg+imm addressing mode for unpredicated stores · fd6193d5

Kerry McLaughlin authored Jul 02, 2020

Reviewers: sdesmalen, efriedma, david-arm

Reviewed By: efriedma

Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, danielkiss, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D82985

fd6193d5

[InstCombine] Add some (vXi1 trunc(lshr(x,c))) -> icmp_eq(and(x,c')) tests for non-uniform vectors · 421c02e5
Simon Pilgrim authored Jul 02, 2020
```
As noticed on PR46531
```
421c02e5
Regenerate apint-shift tests and replace %tmp variable names to silence update_test_checks warnings · 11c4bb0c
Simon Pilgrim authored Jul 02, 2020

11c4bb0c

[LV] Enable the LoopVectorizer to create pointer inductions · a8fe1206

Anna Welker authored Jul 02, 2020

This patch enables the LoopVectorizer to build a phi of pointer
type and provide the vector loads and stores with vector type
getelementptrs built from the pointer induction variable, which
produces much less instructions than the previous approach of
creating scalar getelementpointers and glue them together to a
vector.

Differential Revision: https://reviews.llvm.org/D81267

a8fe1206

Regenerate llvm/test/CodeGen/X86/optimize-max-0.ll · 58a56ef4
Roman Lebedev authored Jul 02, 2020
```
It surprizingly appears to be affected by the last SCEV patch
```
58a56ef4

[ScalarEvolution] createSCEV(): recognize `udiv`/`urem` disguised as an `sdiv`/`srem` · 2c16100e

Roman Lebedev authored Jul 02, 2020

Summary:
While InstCombine trivially converts that `srem` into a `urem`,
it might happen later than wanted, in particular i'd like
for that to happen on  https://godbolt.org/z/bwuEmJ test case
early in pipeline, before first instcombine run, just before `-mem2reg`.

SCEV should recognize this case natively.

Reviewers: mkazantsev, efriedma, nikic, reames

Reviewed By: efriedma

Subscribers: clementval, hiraditya, javed.absar, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D82721

2c16100e

[BasicAA] New basic-aa-recphi test. NFC · 68498ce8
David Green authored Jul 02, 2020

68498ce8
[gn build] Port 804d9687 · 559685d0
LLVM GN Syncbot authored Jul 02, 2020

559685d0

[SVE] Add warnings checks in four more LLVM SVE tests · 00f59216

David Sherwood authored Jul 01, 2020

I have added CHECK lines to the following tests:

  llvm/test/CodeGen/AArch64/sve-breakdown-scalable-vectortype.ll
  llvm/test/CodeGen/AArch64/sve-calling-convention-tuple-types.ll
  llvm/test/CodeGen/AArch64/sve-intrinsics-create-tuple.ll
  llvm/test/CodeGen/AArch64/sve-intrinsics-loads.ll

since they are now free of warnings related to invalid use of
EVT::getVectorNumElements() and VectorType::getNumElements().

Differential Revision: https://reviews.llvm.org/D82957

00f59216

[Support][Windows] Prevent 2s delay when renaming a file that does not exist · a27478e5
Ben Dunbobbin authored Jul 01, 2020
```
Differential Revision: https://reviews.llvm.org/D82542
```
a27478e5
DSE: fix builtin function recognition to take decl into account · 7f903873
Nuno Lopes authored Jul 02, 2020

7f903873
[AMDGPU] Fix formatting in MIR tests · 6f169475
Jay Foad authored Jul 02, 2020

6f169475

[CodeGen][SVE] Don't drop scalable flag in DAGCombiner::visitEXTRACT_SUBVECTOR · 143e324e

Sander de Smalen authored Jul 02, 2020

There was a rogue 'assert' in AArch64ISelLowering for the tuple.get intrinsics,
that shouldn't really have been there (I suspect this was a remnant from when
we expected the wider vector always to have come from a vector CONCAT).

When I tried to create a more minimal reproducer, I found a bug in
DAGCombiner where it drops the scalable flag when trying to fold:

      extract_subv (bitcast X), Index --> bitcast (extract_subv X, Index')

This patch fixes both issues.

Reviewers: david-arm, efriedma, spatel

Reviewed By: efriedma

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D82910

143e324e

[AArch64][SVE] Add unpred load/store patterns for bf16 types · 07bda98b

Sander de Smalen authored Jul 02, 2020

Reviewers: kmclaughlin, c-rhodes, efriedma

Reviewed By: efriedma

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D82909

07bda98b

[NFC] Fix typo in triples from unkown to unknown · aa4fd7d8
Qiu Chaofan authored Jul 02, 2020

aa4fd7d8

[ARM] Rearrange SizeReduction when using -Oz · dc8e4d85

Nicholas Guy authored Jun 18, 2020

Move the Thumb2SizeReduce pass to before IfConversion when optimising
for minimal code size.

Running the Thumb2SizeReduction pass before IfConversionallows T1
instructions to propagate to the final output, rather than the
ifConverter modifying T2 instructions and preventing them from being
reduced later.

This change does introduce a regression regarding execution time, so
it's only applied when optimising for size.

Running the LLVM Test Suite with this change produces a geomean
difference of -0.1% for the size..text metric.

Differential Revision: https://reviews.llvm.org/D82439

dc8e4d85

[CodeGen] Fix warnings in getCopyToPartsVector · c7df35d2

David Sherwood authored Jun 29, 2020

Whilst trying to assemble the following test:

  clang/test/CodeGen/aarch64-sve-intrinsics/acle_sve_set2.c

I discovered we were hitting some warnings about possible invalid
calls to getVectorNumElements() in getCopyToPartsVector(). I've
tried to fix these by using ElementCount types where possible and
I've made the assumption that we don't support using a fixed width
vector to copy parts of a scalable vector, and vice versa. Looking
at how the copy is implemented I think that's the right thing for
now.

Differential Revision: https://reviews.llvm.org/D82744

c7df35d2

[X86] Enable multibyte NOPs in 64-bit mode for padding/alignment. · 0aad8294

Craig Topper authored Jul 01, 2020

The default CPU used by llvm-mc doesn't have the NOPL feature, but
if we know we're compiling in 64-bit mode we should be able to
use nopl.

0aad8294

This patch adds basic debug info support with basic block sections. · e4b3c138

Krzysztof Pszeniczny authored Jul 01, 2020

This patch uses ranges for debug information when a function contains basic block sections rather than using [lowpc, highpc]. This is also the first in a series of patches for debug info and does not contain the support for linker relaxation. That will be done as a follow up patch.

Differential Revision: https://reviews.llvm.org/D78851

e4b3c138

[AMDGPU] Control num waves per EU for implicit work-group size · e1a31f52

Pushpinder Singh authored Jun 17, 2020

Summary:
If amdgpu-flat-work-group-size is not specified in LLVM IR, the backend
uses default value of 1024. For this, minimum waves per EU should be 4.
However, backend is still setting minimum value to 1 instead of calculated
value. This is not observed normally as frontend always provide
amdgpu-flat-work-group-size attribute.

Reviewers: rampitec, b-sumner, sameerds, msearles

Reviewed By: rampitec

Subscribers: qcolombet, arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D81991

e1a31f52

[PowerPC]Implement Vector Shift Double Bit Immediate Builtins · 88874f07

Biplob Mishra authored Jul 01, 2020

Implement Vector Shift Double Bit Immediate Builtins in LLVM/Clang.
  * vec_sldb ();
  * vec_srdb ();

Differential Revision: https://reviews.llvm.org/D82440

88874f07

[flang][openmp] Use common Directive and Clause enum from llvm/Frontend · 2ddba308

Valentin Clement authored Jul 01, 2020

Summary:
This patch is removing the custom enumeration for OpenMP Directives and Clauses and replace them
with the newly tablegen generated one from llvm/Frontend. This is a first patch and some will follow to share the same
infrastructure where possible. The next patch should use the clauses allowance defined in the tablegen file.

Reviewers: jdoerfert, DavidTruby, sscalpone, kiranchandramohan, ichoyjx

Reviewed By: DavidTruby, ichoyjx

Subscribers: jholewinski, cfe-commits, dblaikie, MaskRay, ymandel, ichoyjx, mgorny, yaxunl, guansong, jfb, sstefan1, aaron.ballman, llvm-commits

Tags: #llvm, #flang, #clang

Differential Revision: https://reviews.llvm.org/D82906

2ddba308

[X86-64] Support Intel AMX instructions · aded4f0c

Xiang1 Zhang authored Jul 02, 2020

Summary:
INTEL ADVANCED MATRIX EXTENSIONS (AMX).
AMX is a new programming paradigm, it has a set of 2-dimensional registers
(TILES) representing sub-arrays from a larger 2-dimensional memory image and
operate on TILES.

Spec can be found in Chapter 3 here https://software.intel.com/content/www/us/en/develop/download/intel-architecture-instruction-set-extensions-programming-reference.html

Reviewers: LuoYuanke, annita.zhang, pengfei, RKSimon, xiangzhangllvm

Reviewed By: xiangzhangllvm

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D82705

aded4f0c

[PowerPC][NFC] Update doc for FeatureISA3_1/FeatureISA3_0 definitions · 99c4207d
Lei Huang authored Jul 01, 2020

99c4207d

[PowerPC] Exploit xxspltiw and xxspltidp instructions · c5b4f03b

Anil Mahmud authored Jul 01, 2020

Exploits the VSX Vector Splat Immediate Word and
VSX Vector Splat Immediate Double Precision instructions:

  xxspltiw XT,IMM32
  xxspltidp XT,IMM32

Differential Revision: https://reviews.llvm.org/D82911

c5b4f03b

[NFCI] Actually provide correct check lines in sdiv.ll · e7da7d94
Roman Lebedev authored Jul 02, 2020

e7da7d94

AMDGPU: Set more mov flags on V_ACCVGPR_{READ|WRITE}_B32 · d2e74fad

Matt Arsenault authored Jul 01, 2020

This fixes extra copies when materializing constants in AGPRs. This
made it a lot harder to trigger the spilling in spill-agpr.ll

d2e74fad

RegAllocGreedy: Use TargetInstrInfo already in the class · afb3bd99
Matt Arsenault authored Jul 01, 2020

afb3bd99

AMDGPU: Fix missing tracksRegLiveness in tests · a230f1db

Matt Arsenault authored Jul 01, 2020

I have no idea why this is considered optional, or why it's not the
default. Also add uses of the copied registers for more useful
liveness testing.

a230f1db

[AMDGPU] Limit promote alloca to vector with VGPR budget · 54e2dc75

Stanislav Mekhanoshin authored Jul 01, 2020

Allow only up to 1/4 of available VGPRs for the vectorization
of any given alloca.

Differential Revision: https://reviews.llvm.org/D82990

54e2dc75

Revert "[flang][openmp] Use common Directive and Clause enum from llvm/Frontend" · 5c37b2a5
clementval authored Jul 01, 2020
```
This reverts commit 7f1e7767.
```
5c37b2a5
[NFC][ScalarEvolution] Add udiv-disguised-as-sdiv test · 51ff7642
Roman Lebedev authored Jun 29, 2020
```
Much like 25521150,
but with division instead of remainder.

See https://reviews.llvm.org/D82721
```
51ff7642
Revert "[X86] Enable multibyte NOPs in 64-bit mode for padding/alignment." · c4207621
Craig Topper authored Jul 01, 2020
```
Looks like lld tests need updates too

This reverts commit 3367e9da.
```
c4207621

Jul 01, 2020

[RISCV][NFC] Pre-commit tests for D82660 · 003a086f
Ben Shi authored Jul 01, 2020

003a086f
[InstSimplify] Move assume icmp test (NFC) · a59dc55c
Nikita Popov authored Jul 01, 2020
```
Move this test from InstCombine into InstSimplify.
```
a59dc55c

[CallGraph] Add support for callback call sites · cb8faaac

Sergey Dmitriev authored Jun 29, 2020

Summary:
This patch changes call graph analysis to recognize callback call sites
and add an artificial 'reference' call record from the broker function
caller to the callback function in the call graph. A presence of such
reference enforces bottom-up traversal order for callback functions in
CG SCC pass manager because callback function logically becomes a callee
of the broker function caller.

Reviewers: jdoerfert, hfinkel, sstefan1, baziotis

Reviewed By: jdoerfert

Subscribers: hiraditya, kuter, sstefan1, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D82572

cb8faaac

[AMDGPU] Update DWARF proposal · 31fdcf64
Tony authored Jul 01, 2020
```
- Add reference to implicit conversion description.
```
31fdcf64

[flang][openmp] Use common Directive and Clause enum from llvm/Frontend · 7f1e7767

Valentin Clement authored Jul 01, 2020

Reviewers: jdoerfert, DavidTruby, sscalpone, kiranchandramohan, ichoyjx

Reviewed By: DavidTruby, ichoyjx

Subscribers: ichoyjx, mgorny, yaxunl, guansong, jfb, sstefan1, aaron.ballman, llvm-commits

Tags: #llvm, #flang

Differential Revision: https://reviews.llvm.org/D82906

7f1e7767

[X86] Speculatively apply the same fix from... · 51e92b22

Craig Topper authored Jul 01, 2020

[X86] Speculatively apply the same fix from 361853c9 to PromoteIntOp_MGATHER.

The UpdateNodeOperands here is also subject to CSE.

51e92b22