Commits · d83c883d297b5a2bc407c20527cd4e5a1ee6e39e · Roger Ferrer / llvm-epi

Apr 13, 2018

[PowerPC] add fsub-fneg test; NFC · 9adb386a

Sanjay Patel authored Apr 12, 2018

This is a test for a transform that was suggested in the post-commit
mailing list thread for rL329821. The target in question is not in
trunk, so PPC gets to stand in for it because it's the only in-tree
target that sets 'isFPExtFree()' to 'true'.

llvm-svn: 329963

9adb386a

Apr 12, 2018

Let llvm-diff correctly deal with Undef/ConstantAggregateZero/ConstantVector/IndirectBr · d0d2ce94

Brian Gesiak authored Apr 12, 2018

Summary:
llvm-diff incorrectly reports that there's a diff when input IR contains undef/zeroinitializer/constantvector/indirectbr.
(This happens even if two identical files are given, e.g. `llvm-diff x.ll x.ll`)

This is fix to the bug report https://bugs.llvm.org/show_bug.cgi?id=33623 .



Reviewers: dexonsmith, rjmccall

Reviewed By: rjmccall

Subscribers: chenwj, mgrang, llvm-commits

Differential Revision: https://reviews.llvm.org/D34856

llvm-svn: 329957

d0d2ce94

AArch64: Introduce a DAG combine for folding offsets into addresses. · 00db326b

Peter Collingbourne authored Apr 12, 2018

This is a code size win in code that takes offseted addresses
frequently, such as C++ constructors that typically need to compute
an offseted address of a vtable. This reduces the size of Chromium
for Android's .text section by 108KB.

Differential Revision: https://reviews.llvm.org/D45199

llvm-svn: 329956

00db326b

[DebugInfo] Create merged locations for instructions other than calls · 65b0d4df

Vedant Kumar authored Apr 12, 2018

This lifts a restriction on DILocation::getMergedLocation(), allowing it
to create merged locations for instructions other than calls.

Instruction::applyMergedLocation() now defaults to creating merged
locations for all instructions.

The default behavior of getMergedLocation() is unchanged: callers which
invoke it directly are unaffected.

This change will enable a follow-up Mem2Reg fix which improves crash
reporting.

Differential Revision: https://reviews.llvm.org/D45396

llvm-svn: 329955

65b0d4df

[WebAssembly] libObject: Don't include the name the size of custom sections · 9745afa6
Sam Clegg authored Apr 12, 2018
```
Differential Revision: https://reviews.llvm.org/D45579

llvm-svn: 329947
```
9745afa6

[RISCV] Add c.mv rs1, rs2 pattern for addi rs1, rs2, 0 · e8b7ff30

Sameer AbuAsal authored Apr 12, 2018

Summary:
GCC compresses the pseudo instruction "mv rd, rs",  which is an alias of
"addi rd, rs, 0", to "c.mv rd, rs".

In LLVM we rely on the canonical MC instruction (MCInst) to do our compression
checks and since there is no rule to compress "addi rd, rs, 0" --> "c.mv
rd, rs" we lose this compression opportunity to gcc.

 In this patch we fix that by adding an addi to c.mv compression pattern, the
 instruction "mv rd, rs" will be compressed to "c.mv rd, rs" just like
 gcc does.

Patch by Zhaoshi Zheng (zzheng) and Sameer (sabuasal).

Reviewers: asb, apazos, zzheng, mgrang, shiva0217

Reviewed By: asb

Subscribers: rbar, johnrusso, simoncook, jordy.potman.lists, niosHD, kito-cheng, llvm-commits

Differential Revision: https://reviews.llvm.org/D45583

llvm-svn: 329939

e8b7ff30

[X86] Introduce LLVM wbinvd intrinsic · 297c1387

Gabor Buella authored Apr 12, 2018

A previously missing intrinsic for an old instruction.

Reviewers: craig.topper, echristo

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D45312

llvm-svn: 329936

297c1387

[Power9]Legalize and emit code for converting (Un)Signed DWord to Quad-Precision · 10367eb4

Lei Huang authored Apr 12, 2018

Legalize and emit code for:

  * xscvsdqp
  * xscvudqp

Differential Revision: https://reviews.llvm.org/D45230

llvm-svn: 329931

10367eb4

[AArch64] Move AFI->setRedZone(false) to top of emitPrologue · 8aa6cd5c

Jessica Paquette authored Apr 12, 2018

AFI->setRedZone(false) was put in the wrong place before, and so it only fired
on functions that didn't have stack frames. This moves that to the top of
emitPrologue to make sure that every function without a redzone has it set
correctly.

This also adds a function representing one of the early exit cases (GHC calling
convention) to the MachineOutliner noredzone test to ensure that we can outline
from functions like these, where we never use a redzone.

llvm-svn: 329922

8aa6cd5c

revert r328921 - [DAGCombine] (float)((int) f) --> ftrunc (PR36617) · 5ace2b76

Sanjay Patel authored Apr 12, 2018

This change is exposing UB in source code - as was warned/predicted. :)
See D44909 for discussion. Reverting while we figure out how to fix things.

llvm-svn: 329920

5ace2b76

[mips] Correct the predicates of the load/store (double)word for coprocessor 3. · d886aba3
Simon Dardis authored Apr 12, 2018
```
llvm-svn: 329913
```
d886aba3

[IRCE] isKnownNonNegative helper function · 97375359

Sam Parker authored Apr 12, 2018

Created a helper function to query for non negative SCEVs. Uses the
SGE predicate to catch constants that could be interpreted as
negative.

Differential Revision: https://reviews.llvm.org/D45481

llvm-svn: 329907

97375359

[mips] Correct the predicates for special nops, tlb ctrl instrs, software breakpoint and prefx. · a5a3c38c
Simon Dardis authored Apr 12, 2018
```
Reviewers: atanasyan, abeserminji

Differential Revision: https://reviews.llvm.org/D44436

llvm-svn: 329905
```
a5a3c38c

[InstCombine][NFC]: Add tests: foldSelectICmpAndAnd(): and is commutative · 53271ba1

Roman Lebedev authored Apr 12, 2018

Summary:
The fold added in D45108 did not account for the fact that
the and instruction is commutative, and if the mask is a variable,
the mask variable and the fold variable may be swapped.

I have noticed this by accident when looking into [[ https://bugs.llvm.org/show_bug.cgi?id=6773 | PR6773 ]]

Reviewers: spatel, craig.topper

Reviewed By: spatel

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D45538

llvm-svn: 329901

53271ba1

[RISCV] Change function alignment to 4 bytes, and 2 bytes for RVC · b48b027d

Shiva Chen authored Apr 12, 2018

Summary:

According RISC-V ELF psABI specification, base RV32 and RV64 ISAs only
allow 32-bit instruction alignment, but instruction allow to be aligned
to 16-bit boundaries for C-extension.

So we just align to 4 bytes and 2 bytes for C-extension is enough.

Reviewers: asb, apazos

Differential Revision: https://reviews.llvm.org/D45560

Patch by Kito Cheng.

llvm-svn: 329899

b48b027d

[MIPS GlobalISel] minor update to MIR tests added in r329819 · 984db9ec

Petar Jovanovic authored Apr 12, 2018

Remove 'registers' section, as suggested (D. Sanders) at code review

https://reviews.llvm.org/D44304

llvm-svn: 329888

984db9ec

[NFC] fix trivial typos in documents and comments · bcadfee2
Hiroshi Inoue authored Apr 12, 2018
```
"is is" -> "is", "if if" -> "if", "or or" -> "or"

llvm-svn: 329878
```
bcadfee2

[RISCV] Codegen support for RV32D floating point comparison operations · 21d28fe8

Alex Bradbury authored Apr 12, 2018

Also add double-prevoius-failure.ll which captures a test case that at one
point triggered a compiler crash, while developing calling convention support
for f64 on RV32D with soft-float ABI.

llvm-svn: 329877

21d28fe8

[RISCV] Codegen support for RV32D floating point conversion operations · 60baa2e0

Alex Bradbury authored Apr 12, 2018

This also includes support and a test for truncating stores, which are now
possible thanks to the fpround pattern.

llvm-svn: 329876

60baa2e0

[RISCV] Add codegen support for RV32D floating point arithmetic operations · 5d0dfa5e
Alex Bradbury authored Apr 12, 2018
```
llvm-svn: 329874
```
5d0dfa5e
[RISCV] Add tests missed in r329871 · 8f296478
Alex Bradbury authored Apr 12, 2018
```
llvm-svn: 329872
```
8f296478

[DeadArgElim] Remove allocsize attributes on callsites · 48ee59b6

George Burgess IV authored Apr 12, 2018

We're already removing allocsize attributes from Functions that we
remove args from, since removing arguments from a function may make the
allocsize attribute incorrect. It appears we forgot to also remove them
from callsites.

Without this, I get verifier errors on `@Test2`.

It probably wouldn't be too hard to make DAE properly update allocsize
attributes instead of dropping them, but I can't think of a scenario
where that'd be useful in practice.

llvm-svn: 329868

48ee59b6

Revert "[llvm-objcopy] Switch over to using TableGen for parsing arguments" · a8c689e7

Jake Ehrlich authored Apr 12, 2018

TableGen seems to work differently on windows. I'll need to revert this

This reverts commit 7a153ddea067b24da59f6a66c733d79205969501.

llvm-svn: 329867

a8c689e7

[llvm-objcopy] Switch over to using TableGen for parsing arguments · cafa1128

Jake Ehrlich authored Apr 11, 2018

Swithces from using the command line library to using TableGen. This will allow
llvm-strip to exist and allow refinements of the command line syntax.

Differential Revision: https://reviews.llvm.org/D44236

llvm-svn: 329863

cafa1128

Apr 11, 2018

[PowerPC] Fix condition for 64-bit rotate when replacing r+r instr with r+i · c564dc06

Nemanja Ivanovic authored Apr 11, 2018

This patch fixes https://bugs.llvm.org/show_bug.cgi?id=37039
The condition only covers one of the two 64-bit rotate instructions. This just
adds the second (RLDICLo).

Patch by Josh Stone.

llvm-svn: 329852

c564dc06

Attempting to work around a non-determinism issue. · 0cba63c0

Puyan Lotfi authored Apr 11, 2018

The main thing that matters with this test is that the COPYs
are moved together not where the REG_SEQUENCES are.

llvm-svn: 329850

0cba63c0

[X86] Describe wbnoinvd instruction · 2ef36f35

Gabor Buella authored Apr 11, 2018

Similar to the wbinvd instruction, except this
one does not invalidate caches. Ring 0 only.
The encoding matches a wbinvd instruction with
an F3 prefix.

Reviewers: craig.topper, zvi, ashlykov

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D43816

llvm-svn: 329847

2ef36f35

[DSE] Add tests for atomic memory intrinsics (NFC) · 381cdf3e

Daniel Neilson authored Apr 11, 2018

Summary:
These tests show that DSE currently does nothing with the atomic memory
intrinsics. Future work will teach DSE how to simplify these.

llvm-svn: 329845

381cdf3e

[DSE] Regenerate tests with update_test_checks.py (NFC) · 9cfa786f

Daniel Neilson authored Apr 11, 2018

Summary:
In preparation for a future commit, this regenerates the test checks for
test/Transforms/DeadStoreElimination/OverwriteStoreBegin.ll
test/Transforms/DeadStoreElimination/OverwriteStoreEnd.ll

llvm-svn: 329839

9cfa786f

CodeGen: Don't try to canonicalize Unix-style paths in CodeView debug info. · cb8a666f

Peter Collingbourne authored Apr 11, 2018

Most importantly, we should not replace slashes with backslashes
because that would invalidate the path.

Differential Revision: https://reviews.llvm.org/D45473

llvm-svn: 329838

cb8a666f

[X86][Atom] Convert Atom scheduler model to SchedRW (PR32431) · 8fc2b496

Simon Pilgrim authored Apr 11, 2018

Atom is the only x86 target that still uses schedule itineraries, if we can remove this then we can begin the work on removing x86 itineraries. I've also found that it will help with PR36550.

I've focussed on matching the existing model as closely as possible (relying on the schedule tests), PR36895 indicated a lot of these were incorrect but we can just as easily fix these after this patch as before. Hopefully we can get llvm-exegesis to help here,

There are a few instructions that rely on itinerary scheduling (mainly push/pop/return) of multiple resource stages, but I don't think any of these are show stoppers.

There are also a few codegen changes that seem related to the post-ra scheduler acting a little differently, I haven't tracked these down but they don't seem critical.

NOTE: I don't have access to any Atom hardware, so this hasn't been tested in the wild.

Differential Revision: https://reviews.llvm.org/D45486

llvm-svn: 329837

8fc2b496

[NVPTX] Removed 'satom' feature which is no longer used. · 2f8efcf3
Artem Belevich authored Apr 11, 2018
```
Differential Revision: https://reviews.llvm.org/D45061

llvm-svn: 329830
```
2f8efcf3

[AMDGPU] Ensure there are enough registers for wave dispatch · fd8d4af3

Tim Renouf authored Apr 11, 2018

Summary:
This fixes the number of SGPRs and VGPRs in the *_RSRC1 register to
allow for registers set up in wave dispatch, even if those registers are
not used in the shader.

Re-landed after noticing that the buildbot failure from 329808 seemed to
be unrelated.

Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, t-tye, llvm-commits

Differential Revision: https://reviews.llvm.org/D45503

Change-Id: I6575f0e0d2a528d1319d0b289f0ebe4510fa5771
llvm-svn: 329826

fd8d4af3

[DSE] Regenerate tests with update_test_checks.py (NFC) · 7e2e5c3c

Daniel Neilson authored Apr 11, 2018

Summary:
In preparation for a future commit, this regenerates the test checks for
test/Transforms/DeadStoreElimination/simple.ll
test/Transforms/DeadStoreElimination/memintrinsics.ll

llvm-svn: 329824

7e2e5c3c

[FastISel] Disable local value sinking by default · 08286994

Reid Kleckner authored Apr 11, 2018

This is causing compilation timeouts on code with long sequences of
local values and calls (i.e. foo(1); foo(2); foo(3); ...).  It turns out
that code coverage instrumentation is a great way to create sequences
like this, which how our users ran into the issue in practice.

Intel has a tool that detects these kinds of non-linear compile time
issues, and Andy Kaylor reported it as PR37010.

The current sinking code scans the whole basic block once per local
value sink, which happens before emitting each call. In theory, local
values should only be introduced to be used by instructions between the
current flush point and the last flush point, so we should only need to
scan those instructions.

llvm-svn: 329822

08286994

[InstCombine] limit X - (cast(-Y) --> X + cast(Y) with hasOneUse() · ff98682c
Sanjay Patel authored Apr 11, 2018
```
llvm-svn: 329821
```
ff98682c

[DWARFv5] Fuss with asm syntax for conveying MD5 checksum. · 0195469a

Paul Robinson authored Apr 11, 2018

Previously the MD5 option of the .file directive provided the checksum
as a quoted hex string; now it's a normal hex number with 0x prefix,
same as the .octa directive accepts.

Differential Revision: https://reviews.llvm.org/D45459

llvm-svn: 329820

0195469a

[MIPS GlobalISel] Select add i32, i32 · 366857a2

Petar Jovanovic authored Apr 11, 2018

Add the minimal support necessary to lower a function that returns the
sum of two i32 values.
Support argument/return lowering of i32 values through registers only.
Add tablegen for regbankselect and instructionselect.

Patch by Petar Avramovic.

Differential Revision: https://reviews.llvm.org/D44304

llvm-svn: 329819

366857a2

[SLP] update a test case. NFC. · 5ba37955
Haicheng Wu authored Apr 11, 2018
```
llvm-svn: 329818
```
5ba37955

[AMDGPU] Fix lowering enqueue_kernel · 9381ae97

Yaxun Liu authored Apr 11, 2018

Two issues were fixed:

runtime has difficulty to allocate memory for an external symbol of a
kernel and set the address of the external symbol, therefore make the runtime
handle of an enqueued kernel an ordinary global variable. Runtime only needs
to store the address of the loaded kernel to the handle and has verified
that this approach works.

handle the situation where __enqueue_kernel* gets inlined therefore
the enqueued kernel may be used through a constant expr instead
of an instruction.

Differential Revision: https://reviews.llvm.org/D45187

llvm-svn: 329815

9381ae97