Commits · 54dbcfe5f017265ae829f054f28745b2680a0fa2 · Roger Ferrer / llvm-epi

Apr 29, 2019

Fix additional cases of more that two dashes for options in tests. · 54dbcfe5
Don Hinton authored Apr 29, 2019
```
llvm-svn: 359484
```
54dbcfe5

[CommandLine] Don't allow unlimitted dashes for options. Part 1 or 5 · 89e583b8

Don Hinton authored Apr 29, 2019

Summary:
Prior to this patch, the CommandLine parser would strip an
unlimitted number of dashes from options.  This patch limits it to
two.

Reviewers: rnk

Reviewed By: rnk

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D61229

llvm-svn: 359480

89e583b8

[llvm-mca][x86] Fix MMX PMOVMSKB test · 9d99372f

Simon Pilgrim authored Apr 29, 2019

This is defined as part of SSE1, XMM PMOVMSKB doesn't appear until SSE2

llvm-svn: 359477

9d99372f

[DAG] Refactor DAGCombiner::ReassociateOps · 82099457

Bjorn Pettersson authored Apr 29, 2019

Summary:
Extract the logic for doing reassociations
from DAGCombiner::reassociateOps into a helper
function DAGCombiner::reassociateOpsCommutative,
and use that helper to trigger reassociation
on the original operand order, or the commuted
operand order.

Codegen is not identical since the operand order will
be different when doing the reassociations for the
commuted case. That causes some unfortunate churn in
some test cases. Apart from that this should be NFC.

Reviewers: spatel, craig.topper, tstellar

Reviewed By: spatel

Subscribers: dmgreen, dschuff, jvesely, nhaehnle, javed.absar, sbc100, jgravelle-google, hiraditya, aheejin, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D61199

llvm-svn: 359476

82099457

FileCheck [3/12]: Stricter parsing of @LINE expressions · 15cb1f15

Thomas Preud'homme authored Apr 29, 2019

Summary:
This patch is part of a patch series to add support for FileCheck
numeric expressions. This specific patch gives earlier and better
diagnostics for the @LINE expressions.

Rather than detect parsing errors at matching time, this commit adds
enhance parsing to detect issues with @LINE expressions at parse time
and diagnose them more accurately.

Copyright:
    - Linaro (changes up to diff 183612 of revision D55940)
    - GraphCore (changes in later versions of revision D55940 and
                 in new revision created off D55940)

Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar, arichardson, rnk

Subscribers: hiraditya, llvm-commits, probinson, dblaikie, grimar, arichardson, tra, rnk, kristina, hfinkel, rogfer01, JonChesterfield

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D60383

llvm-svn: 359475

15cb1f15

[llvm-extract] Expose the group extraction feature of the BlockExtractor · 2d977935

Quentin Colombet authored Apr 29, 2019

This patch extends the `-bb` option to be able to use the group
extraction feature from the BlockExtractor.
In particular, `-bb=func:bb` is modified to support a list of basic
blocks per function: `-bb=func:bb1[;bb2...]` that will be extracted
together if at all possible (region must be single entry.)

Differential Revision: https://reviews.llvm.org/D60973

llvm-svn: 359464

2d977935

[BlockExtractor] Change the basic block separator from ',' to ';' · ae2cbb34

Quentin Colombet authored Apr 29, 2019

This change aims at making the file format be compatible with the
way LLVM handles command line options.

Differential Revision: https://reviews.llvm.org/D60970

llvm-svn: 359462

ae2cbb34

Add AVX support to this test. · a25c9283
Kevin P. Neal authored Apr 29, 2019
```
Requested by Craig Topper and Andrew Kaylor as part of D55897.

llvm-svn: 359461
```
a25c9283

[AArch64][SVE] Asm: add aliases for unpredicated bitwise logical instructions · 2c0d5043

Cullen Rhodes authored Apr 29, 2019

This patch adds aliases for element sizes .B/.H/.S to the
AND/ORR/EOR/BIC bitwise logical instructions. The assembler now accepts
these instructions with all element sizes up to 64-bit (.D). The
preferred disassembly is .D.

llvm-svn: 359457

2c0d5043

[X86][SSE] Add scalar horizontal add/sub tests for non-0/1 element extractions · 9d4ed24f
Simon Pilgrim authored Apr 29, 2019
```
llvm-svn: 359454
```
9d4ed24f

FileCheck [2/12]: Stricter parsing of -D option · 5a330470

Thomas Preud'homme authored Apr 29, 2019

Summary:
This patch is part of a patch series to add support for FileCheck
numeric expressions. This specific patch gives earlier and better
diagnostics for the -D option.

Prior to this change, parsing of -D option was very loose: it assumed
that there is an equal sign (which to be fair is now checked by the
FileCheck executable) and that the part on the left of the equal sign
was a valid variable name. This commit adds logic to ensure that this
is the case and gives diagnostic when it is not, making it clear that
the issue came from a command-line option error. This is achieved by
sharing the variable parsing code into a new function ParseVariable.

Copyright:
    - Linaro (changes up to diff 183612 of revision D55940)
    - GraphCore (changes in later versions of revision D55940 and
                 in new revision created off D55940)

Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar, arichardson, rnk

Subscribers: hiraditya, llvm-commits, probinson, dblaikie, grimar, arichardson, tra, rnk, kristina, hfinkel, rogfer01, JonChesterfield

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D60382

llvm-svn: 359447

5a330470

[X86][SSE] Moved haddps test from phaddsub.ll to haddsub.ll (D61245) · c570b2a2
Simon Pilgrim authored Apr 29, 2019
```
Also merged duplicate PR39921 + PR39936 tests

llvm-svn: 359437
```
c570b2a2
[InstCombine][X86] Add PACKSS tests for truncation of sign-extended comparisons · 46128cdf
Simon Pilgrim authored Apr 29, 2019
```
llvm-svn: 359435
```
46128cdf

[ARM] Add bitcast/extract_subvec. of fp16 vectors · d95abb17

Diogo N. Sampaio authored Apr 29, 2019

Summary:
This patch adds some basic operations for fp16
vectors, such as bitcast from fp16 to i16,
required to perform extract_subvector (also added
here) and extract_element.

Reviewers: SjoerdMeijer, DavidSpickett, t.p.northover, ostannard

Reviewed By: ostannard

Subscribers: javed.absar, kristof.beyls, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D60618

llvm-svn: 359433

d95abb17

[ARM] Add v4f16 and v8f16 types to the CallingConv · 2078eb74

Diogo N. Sampaio authored Apr 29, 2019

Summary:
The Procedure Call Standard for the Arm Architecture
states that float16x4_t and float16x8_t behave just
as uint16x4_t and uint16x8_t for argument passing.
This patch adds the fp16 vectors to the
ARMCallingConv.td file.

Reviewers: miyuki, ostannard

Reviewed By: ostannard

Subscribers: ostannard, javed.absar, kristof.beyls, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D60720

llvm-svn: 359431

2078eb74

[DebugInfo] Terminate more location-list ranges at the end of blocks · 055aee1d

Jeremy Morse authored Apr 29, 2019

This patch fixes PR40795, where constant-valued variable locations can
"leak" into blocks placed at higher addresses. The root of this is that
DbgEntityHistoryCalculator terminates all register variable locations at
the end of each block, but not constant-value variable locations.

Fixing this requires constant-valued DBG_VALUE instructions to be
broadcast into all blocks where the variable location remains valid, as
documented in the LiveDebugValues section of SourceLevelDebugging.rst,
and correct termination in DbgEntityHistoryCalculator.

Differential Revision: https://reviews.llvm.org/D59431

llvm-svn: 359426

055aee1d

[DWARF] Fix dump of local/foreign TU lists in .debug_names · 97b8cd54
Fangrui Song authored Apr 29, 2019
```
Differential Revision: https://reviews.llvm.org/D61241

llvm-svn: 359425
```
97b8cd54

[X86] Remove some intel syntax aliases on (v)cvtpd2(u)dq, (v)cvtpd2ps,... · 9202d5f8

Craig Topper authored Apr 29, 2019

[X86] Remove some intel syntax aliases on (v)cvtpd2(u)dq, (v)cvtpd2ps, (v)cvt(u)qq2ps. Add 'x' and'y' suffix aliases to masked version of the same in att syntax.

The 128/256 bit version of these instructions require an 'x' or 'y' suffix to
disambiguate the memory form in att syntax.

We were allowing the same suffix in intel syntax, but it appears gas does not
do that.

gas does allow the 'x' and 'y' suffix on register and broadcast forms even
though its not needed. We were allowing it on unmasked register form, but not on
masked versions or on masked or unmasked broadcast form.

While there fix some test coverage holes so they can be extended with the 'x'
and 'y' suffix tests.

llvm-svn: 359418

9202d5f8

[llvm-nm] -print-size => --print-size · 2d5e7de5
Fangrui Song authored Apr 29, 2019
```
llvm-svn: 359417
```
2d5e7de5

Apr 28, 2019

[X86] Add PR39921 HADD pairwise reduction test and AVX2 test coverage · 65f12f66
Simon Pilgrim authored Apr 28, 2019
```
llvm-svn: 359409
```
65f12f66
[X86][AVX] Add fast-hops target for add/fadd reduction tests · 85bacd0f
Simon Pilgrim authored Apr 28, 2019
```
llvm-svn: 359408
```
85bacd0f
[X86] Add PR39936 HADD Tests · e375257e
Simon Pilgrim authored Apr 28, 2019
```
llvm-svn: 359407
```
e375257e
[X86][AVX] Enabled AVX512F tests and add PR40815 test case · d3941952
Simon Pilgrim authored Apr 28, 2019
```
llvm-svn: 359401
```
d3941952

[X86][AVX] Combine non-lane crossing binary shuffles using X86ISD::VPERMV3 · 22d1476b

Simon Pilgrim authored Apr 28, 2019

Some of the combines might be further improved if we lower more shuffles with X86ISD::VPERMV3 directly, instead of waiting to combine the results.

llvm-svn: 359400

22d1476b

[SelectionDAG] include FP min/max variants as binary operators · ce8cfe96

Sanjay Patel authored Apr 28, 2019

The x86 test diffs don't look great because of extra move ops,
but FP min/max should clearly be included in the list.

llvm-svn: 359399

ce8cfe96

[DAGCombiner] try repeated fdiv divisor transform before building estimate · fb9a5307

Sanjay Patel authored Apr 28, 2019

This was originally part of D61028, but it's an independent diff.

If we try the repeated divisor reciprocal transform before producing an estimate sequence,
then we have an opportunity to use scalar fdiv. On x86, the trade-off is 1 divss vs. 5
vector FP ops in the default estimate sequence. On recent chips (Skylake, Ryzen), the
full-precision division is only 3 cycle throughput, so that's probably the better perf
default option and avoids problems from x86's inaccurate estimates.

The last 2 tests show that users still have the option to override the defaults by using
the function attributes for reciprocal estimates, but those patterns are potentially made
faster by converting the vector ops (including ymm ops) to scalar math.

Differential Revision: https://reviews.llvm.org/D61149

llvm-svn: 359398

fb9a5307

[MCA] Fix typo in AVX2 gather tests. NFC · 43003f0f
Andrea Di Biagio authored Apr 28, 2019
```
llvm-svn: 359397
```
43003f0f

[X86][SSE] Optimize llvm.experimental.vector.reduce.xor.vXi1 parity reduction (PR38840) · 93ad4821

Simon Pilgrim authored Apr 28, 2019

An xor reduction of a bool vector can be optimized to a parity check of the MOVMSK/BITCAST'd integer - if the population count is odd return 1, else return 0.

Differential Revision: https://reviews.llvm.org/D61230

llvm-svn: 359396

93ad4821

[X86][AVX] Add AVX512DQ coverage for masked memory ops tests (PR34584) · fed302ae
Simon Pilgrim authored Apr 28, 2019
```
llvm-svn: 359395
```
fed302ae

[X86] Remove (V)MOV64toSDrr/m and (V)MOVDI2SSrr/m. Use 128-bit result... · bd35a309

Craig Topper authored Apr 28, 2019

[X86] Remove (V)MOV64toSDrr/m and (V)MOVDI2SSrr/m. Use 128-bit result MOVD/MOVQ and COPY_TO_REGCLASS instead

Summary:
The register form of these instructions are CodeGenOnly instructions that cover
GR32->FR32 and GR64->FR64 bitcasts. There is a similar set of instructions for
the opposite bitcast. Due to the patterns using bitcasts these instructions get
marked as "bitcast" machine instructions as well. The peephole pass is able to
look through these as well as other copies to try to avoid register bank copies.

Because FR32/FR64/VR128 are all coalescable to each other we can end up in a
situation where a GR32->FR32->VR128->FR64->GR64 sequence can be reduced to
GR32->GR64 which the copyPhysReg code can't handle.

To prevent this, this patch removes one set of the 'bitcast' instructions. So
now we can only go GR32->VR128->FR32 or GR64->VR128->FR64. The instruction that
converts from GR32/GR64->VR128 has no special significance to the peephole pass
and won't be looked through.

I guess the other option would be to add support to copyPhysReg to just promote
the GR32->GR64 to a GR64->GR64 copy. The upper bits were basically undefined
anyway. But removing the CodeGenOnly instruction in favor of one that won't be
optimized seemed safer.

I deleted the peephole test because it couldn't be made to work with the bitcast
instructions removed.

The load version of the instructions were unnecessary as the pattern that selects
them contains a bitcasted load which should never happen.

Fixes PR41619.

Reviewers: RKSimon, spatel

Reviewed By: RKSimon

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D61223

llvm-svn: 359392

bd35a309

Apr 27, 2019
- Revert rL359389: [X86][SSE] Add support for <64 x i1> bool reduction · 03c4e266
  Simon Pilgrim authored Apr 27, 2019
```
Minor generalization of the existing <32 x i1> pre-AVX2 split code.
........
Causing irregular buildbot failures.

llvm-svn: 359391
```
  03c4e266
- [X86][AVX] Add additional SSE/AVX expandload and compressstore targets · 1a4a4325
  Simon Pilgrim authored Apr 27, 2019
```
llvm-svn: 359390
```
  1a4a4325
- [X86][SSE] Add support for <64 x i1> bool reduction · 4118be3a
  Simon Pilgrim authored Apr 27, 2019
```
Minor generalization of the existing <32 x i1> pre-AVX2 split code.

llvm-svn: 359389
```
  4118be3a
- [X86][AVX] Cleanup and add additional expandload and compressstore tests · 399746ea
  Simon Pilgrim authored Apr 27, 2019
```
sort order by types and add vXi32/vXi16/vXi8 test coverage

llvm-svn: 359388
```
  399746ea
- [X86][AVX512] Improve vector bool reductions · 2a2d4224
  Simon Pilgrim authored Apr 27, 2019
```
As predicate masks are legal on AVX512 targets, we avoid MOVMSK in these cases, but we can just bitcast the bool vector to the integer equivalent directly - avoiding expansion of the reduction to a shuffle pattern.

llvm-svn: 359386
```
  2a2d4224
- [X86] Add vector boolean reduction tests (PR38840) · 913bfd33
  Simon Pilgrim authored Apr 27, 2019
```
AND/OR/XOR tests for the @llvm.experimental.vector.reduce intrinsics

AND/OR are pretty good (pre-AVX512), XOR (not so common but used for parity reduction) is still pretty bad.

llvm-svn: 359385
```
  913bfd33
- [llvm-nm][llvm-readelf] Avoid single-dash -long-option in tests · 763a2e1f
  Fangrui Song authored Apr 27, 2019
```
llvm-svn: 359383
```
  763a2e1f
- Fix check-prefixes typo · 5cf61653
  Simon Pilgrim authored Apr 27, 2019
```
llvm-svn: 359382
```
  5cf61653
- [X86][SSE] Add initial test case for subvector insert/extract of illegal types · 3879b2cd
  Simon Pilgrim authored Apr 27, 2019
```
Suggested by @nikic on D59188

llvm-svn: 359379
```
  3879b2cd
- [X86][AVX] Merge mask select with shuffles across extract_subvector (PR40332) · acc1e6d1
  Simon Pilgrim authored Apr 27, 2019
```
Fixes PR40332 in the limited case where we're selecting between a target shuffle and a zero vector.

We can extend this in the future to handle more opcodes and non-zero selections.

llvm-svn: 359378
```
  acc1e6d1