Commits · 4506e447c1b6b8bb1da0f4bd2ea3b353d72f4a87 · Lorenzo Albano / LLVM bpEVL

Dec 27, 2016

[InstCombine][X86] Add DemandedElts support for PMULDQ/PMULUDQ instructions · c9cf7fc7

Simon Pilgrim authored Dec 26, 2016

PMULDQ/PMULUDQ vXi64 instructions only use the even numbered v2Xi32 input elements which SimplifyDemandedVectorElts should try and use.

Differential Revision: https://reviews.llvm.org/D28119

llvm-svn: 290554

c9cf7fc7

Dec 26, 2016

clang-format NewGVN files · 85f91b0e
Daniel Berlin authored Dec 26, 2016
```
llvm-svn: 290551
```
85f91b0e

Misc cleanups and simplifications for NewGVN. · 85cbc8c0

Daniel Berlin authored Dec 26, 2016

Mostly use a bit more idiomatic C++ where we can,
so we can combine some things later.

Reviewers: davide

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D28111

llvm-svn: 290550

85cbc8c0

Don't use our own incorrect version of isTriviallyDeadInstruction in NewGVN. Fixes PR/31472 · d59e8010
Daniel Berlin authored Dec 26, 2016
```
llvm-svn: 290549
```
d59e8010

[NewGVN] Add a flag to enable the pass via `-mllvm`. · fe7a3ee5

Davide Italiano authored Dec 26, 2016

NewGVN can be tested passing `-mllvm -enable-newgvn` to clang.

Differential Revision:  https://reviews.llvm.org/D28059

llvm-svn: 290548

fe7a3ee5

[NewGVN] Fold lookupOperandLeader() when there's only one use. NFCI. · a312ca84
Davide Italiano authored Dec 26, 2016
```
llvm-svn: 290543
```
a312ca84
[InstCombiner] Simplify lib calls to `round{,f}` · b5e03b61
Bryant Wong authored Dec 26, 2016
```
Differential Revision: https://reviews.llvm.org/D28110

llvm-svn: 290542
```
b5e03b61
[AVX-512] Fix some patterns to use extended register classes. · 5ef13ba1
Craig Topper authored Dec 26, 2016
```
llvm-svn: 290536
```
5ef13ba1

[AVX-512][InstCombine] Teach InstCombine to turn scalar add/sub/mul/div with... · 7b788ada

Craig Topper authored Dec 26, 2016

[AVX-512][InstCombine] Teach InstCombine to turn scalar add/sub/mul/div with rounding intrinsics into normal IR operations if the rounding mode is CUR_DIRECTION.

Summary:
I only do this for unmasked cases for now because isel is failing to fold the mask. I'll try to fix that soon.

I'll do the same thing for packed add/sub/mul/div in a future patch.

Reviewers: delena, RKSimon, zvi, craig.topper

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D27879

llvm-svn: 290535

7b788ada

[AVX-512] Don't assume that the rounding mode argument to intrinsics is a... · f56d985f

Craig Topper authored Dec 26, 2016

[AVX-512] Don't assume that the rounding mode argument to intrinsics is a constant. While clang will guarantee this, nothing in the backend will.

A non-constant value will now result in an isel error instead of just asserting or crashing due to a bad cast during lowering.

llvm-svn: 290532

f56d985f

[AVX-512][InstCombine] Teach InstCombine to converted masked vpermv intrinsics... · e3280457

Craig Topper authored Dec 25, 2016

[AVX-512][InstCombine] Teach InstCombine to converted masked vpermv intrinsics into shufflevector instructions

Summary:
This patch adds support for converting the masked vpermv intrinsics into shufflevector instructions if the indices are constants.

We also need to wrap a select instruction around the shuffle to take care of the masking part. InstCombine will take care of optimizing the select if the mask is constant so I didn't bother checking for that.

Reviewers: zvi, delena, spatel, RKSimon

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D27825

llvm-svn: 290530

e3280457

[MemorySSA] Define a restricted upward AccessList splice. · 4213d941
Bryant Wong authored Dec 25, 2016
```
Differential Revision: https://reviews.llvm.org/D26661

llvm-svn: 290527
```
4213d941

Dec 25, 2016

[AliasAnalysis] Teach BasicAA about memcpy. · a07d9b14
Bryant Wong authored Dec 25, 2016
```
Differential Revision: https://reviews.llvm.org/D27034

llvm-svn: 290526
```
a07d9b14

Value number stores and memory states so we can detect when memory states are... · d7c12ee5

Daniel Berlin authored Dec 25, 2016

Value number stores and memory states so we can detect when memory states are equivalent (IE store of same value to memory).

Reviewers: davide

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D28084

llvm-svn: 290525

d7c12ee5

Rename GVNExpression *ops_ members to *op_* to match conventions in the rest of LLVM · 65f5f0d7
Daniel Berlin authored Dec 25, 2016
```
llvm-svn: 290524
```
65f5f0d7

[Orc][RPC] Add a ParallelCallGroup utility for dispatching and waiting on · c9d0ff13

Lang Hames authored Dec 25, 2016

multiple asynchronous RPC calls.

ParallelCallGroup allows multiple asynchronous calls to be dispatched,
and provides a wait method that blocks until all asynchronous calls have
been executed on the remote and all return value handlers run on the
local machine.

This will allow, for example, the JIT client to issue memory allocation calls
for all sections in parallel, then block until all memory has been allocated
on the remote and the allocated addresses registered with the client, at which
point the JIT client can proceed to applying relocations.

llvm-svn: 290523

c9d0ff13

revert commit 290516 · 86602e85
Michael Zuckerman authored Dec 25, 2016
```
llvm-svn: 290517
```
86602e85
Commit try added new empty line · 45aa4206
Michael Zuckerman authored Dec 25, 2016
```
llvm-svn: 290516
```
45aa4206
[DebugInfo] Added support for Checksum debug info feature. · 7faeecc8
Amjad Aboud authored Dec 25, 2016
```
Differential Revision: https://reviews.llvm.org/D27642

llvm-svn: 290514
```
7faeecc8

MetadataLoader: replace the tracking of ForwardReferences and UnresolvedNodes... · 690952d1

Mehdi Amini authored Dec 25, 2016

MetadataLoader: replace the tracking of ForwardReferences and UnresolvedNodes with a set-based solution (NFC)

This makes it explicit what is the exact list to handle, and it
looks much more easy to manipulate and understand that the
previous custom tracking of min/max to express the range where
to look for.

Differential Revision: https://reviews.llvm.org/D28089

llvm-svn: 290507

690952d1

MetadataLoader: add an extra assertion in Placeholders flush (NFC) · 4f90ee00
Mehdi Amini authored Dec 25, 2016
```
We don't expect any forward reference at this point.

llvm-svn: 290506
```
4f90ee00

Dec 24, 2016

[NewGVN] Prefer `auto` to explicit type when the latter is obvious. · 463c32ea
Davide Italiano authored Dec 24, 2016
```
llvm-svn: 290499
```
463c32ea
[SelectionDAG] Early out from computeKnownBits when we know we will have no common bits. · 0d66d296
Simon Pilgrim authored Dec 24, 2016
```
Avoid extra (recursive) calls to computeKnownBits if we already know that there are no common known bits.

llvm-svn: 290490
```
0d66d296
[PM] Try to improve the comments here to make what's going on more · 534d644b
Chandler Carruth authored Dec 24, 2016
```
clear.

Based on post-commit review suggestion from Sean. (Thanks!)

llvm-svn: 290488
```
534d644b
Mark isOnlyReachableViaThisEdge as const · 8a6a8614
Daniel Berlin authored Dec 24, 2016
```
llvm-svn: 290468
```
8a6a8614
Add an assertion for cl::opt names: they can't start with '-' · 4fe6a8c8
Mehdi Amini authored Dec 23, 2016
```
llvm-svn: 290467
```
4fe6a8c8

[PM] Teach the always inlining test case to be much more strict about · 4eaff12b

Chandler Carruth authored Dec 23, 2016

whether functions are removed, and fix the new PM's always inliner to
actually pass this test.

Without this, the new PM's always inliner leaves all the functions
kicking around which won't work out very well given the semantics of
always inline.

Doing this really highlights how frustrating the current alwaysinline
semantic contract is though -- why can we put it on *external*
functions, etc?

Also I've added a number of tricky and interesting test cases for
removing functions with the always inliner. There is one remaining case
not handled -- fully removing comdats -- and I've left a FIXME about
this.

llvm-svn: 290457

4eaff12b

Dec 23, 2016

[PM] Add support for building a default AA pipeline to the PassBuilder. · 060ad61f

Chandler Carruth authored Dec 23, 2016

Pretty boring and lame as-is but necessary. This is definitely a place
we'll end up with extension hooks longer term. =]

Differential Revision: https://reviews.llvm.org/D28076

llvm-svn: 290449

060ad61f

Function-import: Disable IRVerifier on lazy-loaded modules: the ODR... · 94f86ad4

Mehdi Amini authored Dec 23, 2016

Function-import: Disable IRVerifier on lazy-loaded modules: the ODR TypeUniquing generates invalid debug info.

llvm-svn: 290442

94f86ad4

Fix build after r290437 (missing include) · fc06b83e
Mehdi Amini authored Dec 23, 2016
```
llvm-svn: 290438
```
fc06b83e
FunctionImport: fix typo '#ifndef NDEBUG' instead of '#ifndef DEBUG' · 9a9077fd
Mehdi Amini authored Dec 23, 2016
```
llvm-svn: 290437
```
9a9077fd
AMDGPU: split ret/noret patterns for global atomics · 206a510e
Jan Vesely authored Dec 23, 2016
```
Differential Revision: https://reviews.llvm.org/D27989

llvm-svn: 290435
```
206a510e
[LICM] Plug a leak freeing the ASTs before clearing the map. · b9ff23a4
Davide Italiano authored Dec 23, 2016
```
llvm-svn: 290433
```
b9ff23a4
[MemDep] NFC changes · 383edba1
Piotr Padlewski authored Dec 23, 2016
```
llvm-svn: 290428
```
383edba1

[LICM] Work around LICM needs to maintain state across loops. · 34f94384

Davide Italiano authored Dec 23, 2016

The pass creates some state which expects to be cleaned up by
a later instance of the same pass. opt-bisect happens to expose
this not ideal design because calling skipLoop() will result in
this state not being cleaned up at times and an assertion firing
in `doFinalization()`. Chandler tells me the new pass manager will
give us options to avoid these design traps, but until it's not ready,
we need a workaround for the current pass infrastructure. Fix provided
by Andy Kaylor, see the review for a complete discussion.

Differential Revision:  https://reviews.llvm.org/D25848

llvm-svn: 290427

34f94384

[AArch64] Cortex-A57 FDIV/FSQRT scheduling fix (W-unit) · 21da340f

Renato Golin authored Dec 23, 2016

According to the Cortex-A57 doc, FDIV/FSQRT instructions should use F0 unit
(W-unit in AArch64SchedA57.td, the same as cryptography instructions),
not F1 unit (X-unit in td, like ASIMD absolute diff accum SABA/UABA).

This patch changes FDIV/FSQRT scheduling declarations to use A57UnitW
instead of A57UnitX. Also, latencies for those instructions are
corrected.

Patch by Andrew Zhogin.

llvm-svn: 290426

21da340f

Revert r290423 because it broke the sanitizer-x86_64-linux-autoconf buildbot. · 898127fe
Florian Hahn authored Dec 23, 2016
```
llvm-svn: 290425
```
898127fe

[framelowering] Skip dbg values when getting next/previous instruction. · 1d6b1a7b

Florian Hahn authored Dec 23, 2016

Summary:
In mergeSPUpdates, debug values need to be ignored when getting the
previous element, otherwise debug data could have an impact on codegen.

In eliminateCallFramePseudoInstr, debug values after the erased element
could have an impact on codegen and should be skipped.

Closes PR31319 (https://llvm.org/bugs/show_bug.cgi?id=31319)

Reviewers: mkuper, MatzeB, aprantl

Subscribers: gbedwell, llvm-commits

Differential Revision: https://reviews.llvm.org/D27688

llvm-svn: 290423

1d6b1a7b

[NewGVN] Remove (for now) unused code. NFCI. · 0ff94162
Davide Italiano authored Dec 23, 2016
```
llvm-svn: 290420
```
0ff94162
[ThinLTO] Verify lazy-loaded source module for function importing when assertions are enabled (NFC) · 96cdc493
Mehdi Amini authored Dec 23, 2016
```
llvm-svn: 290416
```
96cdc493