Commits · 7e64c1ee9123071d64cd019e3344ca7781fd8b3b · Lorenzo Albano / LLVM bpEVL

Jul 11, 2018

Recommit r336653: [VPlan] Add VPlanTestBase.h with helper · 7e64c1ee
Florian Hahn authored Jul 11, 2018
```
The original version caused a memsan failure.

llvm-svn: 336792
```
7e64c1ee

[AArch64][SVE] Asm: Support for COMPACT instruction. · ea45a89e

Sander de Smalen authored Jul 11, 2018

The compact instruction shuffles active elements of vector
into lowest numbered elements and sets remaining elements
to zero. 

e.g.
  compact z0.s, p0, z1.s

llvm-svn: 336789

ea45a89e

Fix check-prefix vs check-prefixes typo in updated test · f6ff75c4
Simon Pilgrim authored Jul 11, 2018
```
llvm-svn: 336787
```
f6ff75c4
[AArch64] Regenerate SDIV tests · 1975efe5
Simon Pilgrim authored Jul 11, 2018
```
Will make codegen diffs much easier to grok in a future patch

llvm-svn: 336786
```
1975efe5

[NFC][InstCombine] icmp-logical.ll: add a few more tests. · a042fae6

Roman Lebedev authored Jul 11, 2018

The @masked_and_notA_slightly_optimized and @masked_or_A
will break when PR38123 will be fixed:
https://rise4fun.com/Alive/Rny
Clearly, they aren't optimized currently.

https://rise4fun.com/Alive/ERo

llvm-svn: 336784

a042fae6

[AArch64][SVE] Asm: Support for LAST(A|B) and CLAST(A|B) instructions. · a90530f7

Sander de Smalen authored Jul 11, 2018

The LASTB and LASTA instructions extract the last active element,
or element after the last active, from the source vector.

The added variants are:

  Scalar:
  last(a|b)  w0, p0, z0.b
  last(a|b)  w0, p0, z0.h
  last(a|b)  w0, p0, z0.s
  last(a|b)  x0, p0, z0.d

  SIMD & FP Scalar:
  last(a|b)  b0, p0, z0.b
  last(a|b)  h0, p0, z0.h
  last(a|b)  s0, p0, z0.s
  last(a|b)  d0, p0, z0.d

The CLASTB and CLASTA conditionally extract the last or element after
the last active element from the source vector.

The added variants are:

  Scalar:
  clast(a|b)  w0, p0, w0, z0.b
  clast(a|b)  w0, p0, w0, z0.h
  clast(a|b)  w0, p0, w0, z0.s
  clast(a|b)  x0, p0, x0, z0.d

  SIMD & FP Scalar:
  clast(a|b)  b0, p0, b0, z0.b
  clast(a|b)  h0, p0, h0, z0.h
  clast(a|b)  s0, p0, s0, z0.s
  clast(a|b)  d0, p0, d0, z0.d

  Vector:
  clast(a|b)  z0.b, p0, z0.b, z1.b
  clast(a|b)  z0.h, p0, z0.h, z1.h
  clast(a|b)  z0.s, p0, z0.s, z1.s
  clast(a|b)  z0.d, p0, z0.d, z1.d

Please refer to the architecture specification for more details on
the semantics of the added instructions.

llvm-svn: 336783

a90530f7

[llvm-readobj] Add -hex-dump (-x) option · b98f5048
Paul Semel authored Jul 11, 2018
```
Differential Revision: https://reviews.llvm.org/D48281

llvm-svn: 336782
```
b98f5048
[NFC][InstCombine] Fix extra space padding in icmp-mul-zext.ll test · 5260c9ef
Roman Lebedev authored Jul 11, 2018
```
update_test_checks will drop it anyway, creating noise..

llvm-svn: 336781
```
5260c9ef
[NFC][InstCombine] Add variable names and regenerate icmp-logical.ll test. · c5e437e5
Roman Lebedev authored Jul 11, 2018
```
llvm-svn: 336780
```
c5e437e5

[SelectionDAG] Add constant buildvector support to isKnownNeverZero · 075b04a5

Simon Pilgrim authored Jul 11, 2018

This allows us to use SelectionDAG::isKnownNeverZero in DAGCombiner::visitREM (visitSDIVLike/visitUDIVLike handle the checking for constants).

llvm-svn: 336779

075b04a5

[llvm-mca] Add tests for partial register writes. · 2b3a4f9c

Andrea Di Biagio authored Jul 11, 2018

llvm-mca doesn't know that on modern AMD processors, portions of a general
purpose register are not treated independently. So, a partial register write has
a false dependency on the super-register.

The issue with partial register writes will be addressed by a follow-up patch.

llvm-svn: 336778

2b3a4f9c

[mips] Remove dead code. NFC · 6cb1c6b3
Simon Atanasyan authored Jul 11, 2018
```
llvm-svn: 336777
```
6cb1c6b3

[DAGCombiner] Support non-uniform X%C -> X-(X/C)*C folds · df9d5977

Simon Pilgrim authored Jul 11, 2018

First stage in PR38057 - support non-uniform constant vectors in the combine to reuse the division-by-constant logic.

We can definitely do better for srem pow2 remainders (and avoid that extra multiply....) but this at least helps keep everything on the vector unit.

Differential Revision: https://reviews.llvm.org/D48975

llvm-svn: 336774

df9d5977

[DAGCombiner] Add (urem X, -1) -> select(X == -1, 0, x) fold · 97cf1116
Simon Pilgrim authored Jul 11, 2018
```
llvm-svn: 336773
```
97cf1116

[TableGen] Add missing std::moves to fix build failure. · 09f25657

Simon Tatham authored Jul 11, 2018

gcc 4.7 seems to disagree with gcc 5.3 about whether you need to say
'return std::move(thing)' instead of just 'return thing'. All the
json::Arrays and json::Objects that I was implicitly turning into
json::Values by returning them from functions now have explicit
std::move wrappers, so hopefully 4.7 will be happy now.

llvm-svn: 336772

09f25657

[TableGen] Add a general-purpose JSON backend. · 6a8c6cad

Simon Tatham authored Jul 11, 2018

The aim of this backend is to output everything TableGen knows about
the record set, similarly to the default -print-records backend. But
where -print-records produces output in TableGen's input syntax
(convenient for humans to read), this backend produces it as
structured JSON data, which is convenient for loading into standard
scripting languages such as Python, in order to extract information
from the data set in an automated way.

The output data contains a JSON representation of the variable
definitions in output 'def' records, and a few pieces of metadata such
as which of those definitions are tagged with the 'field' prefix and
which defs are derived from which classes. It doesn't dump out
absolutely every piece of knowledge it _could_ produce, such as type
information and complicated arithmetic operator nodes in abstract
superclasses; the main aim is to allow consumers of this JSON dump to
essentially act as new backends, and backends don't generally need to
depend on that kind of data.

The new backend is implemented as an EmitJSON() function similar to
all of llvm-tblgen's other EmitFoo functions, except that it lives in
lib/TableGen instead of utils/TableGen on the basis that I'm expecting
to add it to clang-tblgen too in a future patch.

To test it, I've written a Python script that loads the JSON output
and tests properties of it based on comments in the .td source - more
or less like FileCheck, except that the CHECK: lines have Python
expressions after them instead of textual pattern matches.

Reviewers: nhaehnle

Reviewed By: nhaehnle

Subscribers: arichardson, labath, mgorny, llvm-commits

Differential Revision: https://reviews.llvm.org/D46054

llvm-svn: 336771

6a8c6cad

[WebAssembly] Only call llvm::value::dump() in debug build. · 867b0e41

Eric Liu authored Jul 11, 2018

This fixes compile error in r336759. llvm::value::dump is not available
in released build.

llvm-svn: 336770

867b0e41

[X86] The TEST instruction is eliminated when BSF/TZCNT is used · 02867f0f

Craig Topper authored Jul 11, 2018

Summary:
These changes cover the PR#31399.
Now the ffs(x) function is lowered to (x != 0) ? llvm.cttz(x) + 1 : 0
and it corresponds to the following llvm code:
  %cnt = tail call i32 @llvm.cttz.i32(i32 %v, i1 true)
  %tobool = icmp eq i32 %v, 0
  %.op = add nuw nsw i32 %cnt, 1
  %add = select i1 %tobool, i32 0, i32 %.op
and x86 asm code:
  bsfl     %edi, %ecx
  addl     $1, %ecx
  testl    %edi, %edi
  movl     $0, %eax
  cmovnel  %ecx, %eax
In this case the 'test' instruction can't be eliminated because
the 'add' instruction modifies the EFLAGS, namely, ZF flag
that is set by the 'bsf' instruction when 'x' is zero.

We now produce the following code:
  bsfl     %edi, %ecx
  movl     $-1, %eax
  cmovnel  %ecx, %eax
  addl     $1, %eax

Patch by Ivan Kulagin

Reviewers: davide, craig.topper, spatel, RKSimon

Reviewed By: craig.topper

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D48765

llvm-svn: 336768

02867f0f

Revert r336760: "[ORC] Add unit tests for the reexports utility that were..." · 709f773a
Lang Hames authored Jul 11, 2018
```
This patch broke a few buildbots. I will investigate and re-apply when I have
a fix.

llvm-svn: 336767
```
709f773a

[X86] Remove some composite MOVSS/MOVSD isel patterns. · 1d6a80cd

Craig Topper authored Jul 11, 2018

These patterns looked for a MOVSS/SD followed by a scalar_to_vector. Or a scalar_to_vector followed by a load.

In both cases we emitted a MOVSS/SD for the MOVSS/SD part, a REG_CLASS for the scalar_to_vector, and a MOVSS/SD for the load.

But we have patterns that do each of those 3 things individually so there's no reason to build large patterns.

Most of the test changes are just reorderings. The one test that had a meaningful change is pr30430.ll and it appears to be a regression. But its doing -O0 so I think it missed a lot of opportunities and was just getting lucky before.

llvm-svn: 336762

1d6a80cd

[ORC] Remove a shadowing definition. · a53aa290

Lang Hames authored Jul 11, 2018

There is already a VSO member V in the CoreAPIsStandardTest test fixture.

llvm-svn: 336761

a53aa290

[ORC] Add unit tests for the reexports utility that were left out of r336741, · fdf1a855
Lang Hames authored Jul 11, 2018
```
and fix a bug that these exposed.

llvm-svn: 336760
```
fdf1a855

[WebAssembly] Add pass to infer prototypes for prototype-less functions · 92617559

Sam Clegg authored Jul 11, 2018

See https://bugs.llvm.org/show_bug.cgi?id=35385

Differential Revision: https://reviews.llvm.org/D48471

llvm-svn: 336759

92617559

[ORC] Drop constexpr in unit test to appease a bot. · fcd1b66a
Lang Hames authored Jul 11, 2018
```
llvm-svn: 336758
```
fcd1b66a
[ORC] Use a gtest fixture to remove a bunch of boilerplate in CoreAPIsTest.cpp. · 58ba7812
Lang Hames authored Jul 11, 2018
```
llvm-svn: 336757
```
58ba7812

[Power9] Add remaining __flaot128 builtin support for FMA round to odd · b9d01aa2

Stefan Pintilie authored Jul 11, 2018

Implement this as it is done on GCC:

__float128 a, b, c, d;
a = __builtin_fmaf128_round_to_odd (b, c, d);         // generates xsmaddqpo
a = __builtin_fmaf128_round_to_odd (b, c, -d);        // generates xsmsubqpo
a = - __builtin_fmaf128_round_to_odd (b, c, d);       // generates xsnmaddqpo
a = - __builtin_fmaf128_round_to_odd (b, c, -d);      // generates xsnmsubpqp

Differential Revision: https://reviews.llvm.org/D48218

llvm-svn: 336754

b9d01aa2

· fb361d25

Chen Zheng authored Jul 11, 2018

  [test cases] add test cases for find more abs pattern

  Differential Revision: https://reviews.llvm.org/D49123

llvm-svn: 336752

fb361d25

[TableGen] Fix some bad formatting. NFC · 6d775a27
Craig Topper authored Jul 11, 2018
```
llvm-svn: 336751
```
6d775a27

[LangRef] Clarify alloca of zero bytes. · 18f882c8

Eli Friedman authored Jul 11, 2018

Let's be conservative here; it matches what we actually implemented, and
it should be rare in practice anyway.

Differential Revision: https://reviews.llvm.org/D49042

llvm-svn: 336744

18f882c8

[ARM] Treat cmn immediates as legal in isLegalICmpImmediate. · d2c73923

Eli Friedman authored Jul 10, 2018

The original code attempted to do this, but the std::abs() call didn't
actually do anything due to implicit type conversions.  Fix the type
conversions, and perform the correct check for negative immediates.

This probably has very little practical impact, but it's worth fixing
just to avoid confusion in the future, I think.

Differential Revision: https://reviews.llvm.org/D48907

llvm-svn: 336742

d2c73923

[ORC] Generalize alias materialization to support re-exports (i.e. aliasing of · a3c473e6

Lang Hames authored Jul 10, 2018

symbols in another VSO).

Also fixes a bug where chained aliases within a single VSO would deadlock on
materialization.

llvm-svn: 336741

a3c473e6

Sort includes + include a missing `extern "C"` header · 34828716

George Burgess IV authored Jul 10, 2018

If we don't include Initialization.h,
`LLVMInitializeAggressiveInstCombiner` won't see its `extern "C"` decl.
This causes sadness, name mangling, and linker errors.

Reported on the mailing lists by Vladimir Vissoultchev. Thanks!

llvm-svn: 336736

34828716

[X86] Remove AddedComplexity from all patterns that use X86vzmovl as their root. · 27c77fe4

Craig Topper authored Jul 10, 2018

Some added 20 and some added 15. Its unclear when to use which value and whether they are required at all.

This patch removes them all. If we start finding real world issues we may need to add them back with proper tests.

llvm-svn: 336735

27c77fe4

Fix -Wmismatched-tags warning · d5e57ed9
Richard Trieu authored Jul 10, 2018
```
class -> struct in forward declaration.

llvm-svn: 336733
```
d5e57ed9

[X86] Teach X86InstrInfo::commuteInstructionImpl to use MOVSD/MOVSS for BLEND... · 860ab496

Craig Topper authored Jul 10, 2018

[X86] Teach X86InstrInfo::commuteInstructionImpl to use MOVSD/MOVSS for BLEND under optsize when the immediate allows it.

Isel currently emits movss/movsd a lot of the time and an accidental double commute turns it into a blend.

Ideally we'd select blend directly in isel under optspeed and not rely on the double commute to create blend.

llvm-svn: 336731

860ab496

Jul 10, 2018

[NFC] typo · a929fd7f
JF Bastien authored Jul 10, 2018
```
llvm-svn: 336730
```
a929fd7f

[X86] Remove X86ISD::MOVLPS and X86ISD::MOVLPD. NFCI · dea0b88b

Craig Topper authored Jul 10, 2018

These ISD nodes try to select the MOVLPS and MOVLPD instructions which are special load only instructions. They load data and merge it into the lower 64-bits of an XMM register. They are logically equivalent to our MOVSD node plus a load.

There was only one place in X86ISelLowering that used MOVLPD and no places that selected MOVLPS. The one place that selected MOVLPD had to choose between it and MOVSD based on whether there was a load. But lowering is too early to tell if the load can really be folded. So in isel we have patterns that use MOVSD for MOVLPD if we can't find a load.

We also had patterns that select the MOVLPD instruction for a MOVSD if we can find a load, but didn't choose the MOVLPD ISD opcode for some reason.

So it seems better to just standardize on MOVSD ISD opcode and manage MOVSD vs MOVLPD instruction with isel patterns.

llvm-svn: 336728

dea0b88b

[AMDGPU] Fix layering issue with AMDGPUHSAMetadataStreamer (NFC) · 01ce144d
Scott Linder authored Jul 10, 2018
```
llvm-svn: 336722
```
01ce144d

[ThinLTO] Use std::map to get determistic imports files · c0320ef4

Teresa Johnson authored Jul 10, 2018

Summary:
I noticed that the .imports files emitted for distributed ThinLTO
backends do not have consistent ordering. This is because StringMap
iteration order is not guaranteed to be deterministic. Since we already
have a std::map with this information, used when emitting the individual
index files (ModuleToSummariesForIndex), use it for the imports files as
well.

This issue is likely causing some unnecessary rebuilds of the ThinLTO
backends in our distributed build system as the imports files are inputs
to those backends.

Reviewers: pcc, steven_wu, mehdi_amini

Subscribers: mehdi_amini, inglorion, eraman, steven_wu, dexonsmith, llvm-commits

Differential Revision: https://reviews.llvm.org/D48783

llvm-svn: 336721

c0320ef4

[X86] Remove dead SDNode object from X86InstrFragmentsSIMD.td. NFC · fb302d01
Craig Topper authored Jul 10, 2018
```
It points to an opcode that doesn't exist.

llvm-svn: 336720
```
fb302d01