Commits · 977daf307ddcf2469bb4ca57a988369266a6f83d · Roger Ferrer / llvm-epi

Jul 14, 2016

Speculatively fix the sphinx build, which does not think the original code was... · 977daf30

Aaron Ballman authored Jul 14, 2016

Speculatively fix the sphinx build, which does not think the original code was valid nasm (http://lab.llvm.org:8011/builders/llvm-sphinx-docs/builds/11854/steps/docs-llvm-html/logs/stdio).

llvm-svn: 275408

977daf30

[X86][AVX] Add support for narrowing 128-bit+ shuffle mask elements to 64-bits to allow combining · 053d3290

Simon Pilgrim authored Jul 14, 2016

Primarily this is to allow blend with zero instead of having to use vperm2f128, but we can use this in the future to deal with AVX512 cases where we need to keep the original element size to correctly fold masked operations.

llvm-svn: 275406

053d3290

This converts a signed remainder instruction to unsigned remainder, which · 716abbb2

Sjoerd Meijer authored Jul 14, 2016

enables the code size optimisation to fold a rem and div into a single
aeabi_uidivmod call. This was not happening before because sdiv was converted
but srem not, and instructions with different signedness are not combined.

Differential Revision: http://reviews.llvm.org/D22214

llvm-svn: 275403

716abbb2

[X86][AVX] Add 128-bit wide shuffle tests that should combine to blend-with-zero · 700e4a1a
Simon Pilgrim authored Jul 14, 2016
```
llvm-svn: 275402
```
700e4a1a

code hoisting pass based on GVN · 63847d04

Sebastian Pop authored Jul 14, 2016

This pass hoists duplicated computations in the program. The primary goal of
gvn-hoist is to reduce the size of functions before inline heuristics to reduce
the total cost of function inlining.

Pass written by Sebastian Pop, Aditya Kumar, Xiaoyu Hu, and Brian Rzycki.
Important algorithmic contributions by Daniel Berlin under the form of reviews.

Differential Revision: http://reviews.llvm.org/D19338

llvm-svn: 275401

63847d04

[X86][AVX] Add VBROADCASTF128/VBROADCASTI128 shuffle comments support · a76a8e50
Simon Pilgrim authored Jul 14, 2016
```
llvm-svn: 275400
```
a76a8e50
Remove extra ';' to appease -Wpedantic · 086639a6
Dean Michael Berris authored Jul 14, 2016
```
Summary:

Reviewers: dok

Subscribers: llvm-commits
llvm-svn: 275399
```
086639a6
[X86][AVX] Regenerate broadcast upgrade tests · 9e812169
Simon Pilgrim authored Jul 14, 2016
```
llvm-svn: 275398
```
9e812169
[X86][AVX2] VBROADCASTSSrr/VBROADCASTSSYrr require AVX2 not AVX · b8c261c9
Simon Pilgrim authored Jul 14, 2016
```
llvm-svn: 275391
```
b8c261c9

This implements a more optimal algorithm for selecting a base constant in · 38c2cd0c

Sjoerd Meijer authored Jul 14, 2016

constant hoisting. It not only takes into account the number of uses and the
cost of expressions in which constants appear, but now also the resulting
integer range of the offsets. Thus, the algorithm maximizes the number of uses
within an integer range that will enable more efficient code generation. On
ARM, for example, this will enable code size optimisations because less
negative offsets will be created. Negative offsets/immediates are not supported
by Thumb1 thus preventing more compact instruction encoding.

Differential Revision: http://reviews.llvm.org/D21183

llvm-svn: 275382

38c2cd0c

[InstCombine] Masked loads with undef masks can fold to normal loads · 666aa945

David Majnemer authored Jul 14, 2016

We were able to fold masked loads with an all-ones mask to a normal
load.  However, we couldn't turn a masked load with a mask with mixed
ones and undefs into a normal load.

llvm-svn: 275380

666aa945

Simplify llvm.masked.load w/ undef masks · 17a95aaa

David Majnemer authored Jul 14, 2016

We can always pick the passthru value if the mask is undef: we are
permitted to treat the mask as-if it were filled with zeros.

llvm-svn: 275379

17a95aaa

[AVX512] Implement EXTLOAD lowering with patterns to select existing VPMOVZX... · 6840f115

Craig Topper authored Jul 14, 2016

[AVX512] Implement EXTLOAD lowering with patterns to select existing VPMOVZX instructions instead of creating CodeGenOnly instructions.

llvm-svn: 275378

6840f115

[X86] Fix stupid typo in isel lowering. · 17e8ea18

Eli Friedman authored Jul 14, 2016

Apparently someone miscounted the number of zeros in the immediate.
Fixes https://llvm.org/bugs/show_bug.cgi?id=28544 .

llvm-svn: 275376

17e8ea18

AMDGPU/R600: Delete/rename intrinsics no longer used by mesa · ca7f5701
Matt Arsenault authored Jul 14, 2016
```
Use the replacement pass to update the tests, and delete old names.

llvm-svn: 275375
```
ca7f5701
AMDGPU/R600: Remove intrinsics with no tests and no users · 648e422b
Matt Arsenault authored Jul 14, 2016
```
Mesa removed this path, so nothing is using these anymore.

llvm-svn: 275372
```
648e422b
AMDGPU: Remove unused intrinsics · 897eee41
Matt Arsenault authored Jul 14, 2016
```
llvm-svn: 275371
```
897eee41

AMDGPU: Fix test not actually testing anything · aa94c1e7

Matt Arsenault authored Jul 14, 2016

It wasn't actually running the pass, and since it is
missing the llvm prefix, the eh intrinsic was not
really an IntrinsicInst.

Also add missing test for lifetime markers.

llvm-svn: 275370

aa94c1e7

AMDGPU: Remove dead code · 0bf9984b
Matt Arsenault authored Jul 14, 2016
```
llvm-svn: 275369
```
0bf9984b

XRay: Add entry and exit sleds · 52735fc4

Dean Michael Berris authored Jul 14, 2016

Summary:
In this patch we implement the following parts of XRay:

- Supporting a function attribute named 'function-instrument' which currently only supports 'xray-always'. We should be able to use this attribute for other instrumentation approaches.
- Supporting a function attribute named 'xray-instruction-threshold' used to determine whether a function is instrumented with a minimum number of instructions (IR instruction counts).
- X86-specific nop sleds as described in the white paper.
- A machine function pass that adds the different instrumentation marker instructions at a very late stage.
- A way of identifying which return opcode is considered "normal" for each architecture.

There are some caveats here:

1) We don't handle PATCHABLE_RET in platforms other than x86_64 yet -- this means if IR used PATCHABLE_RET directly instead of a normal ret, instruction lowering for that platform might do the wrong thing. We think this should be handled at instruction selection time to by default be unpacked for platforms where XRay is not availble yet.

2) The generated section for X86 is different from what is described from the white paper for the sole reason that LLVM allows us to do this neatly. We're taking the opportunity to deviate from the white paper from this perspective to allow us to get richer information from the runtime library.

Reviewers: sanjoy, eugenis, kcc, pcc, echristo, rnk

Subscribers: niravd, majnemer, atrick, rnk, emaste, bmakam, mcrosier, mehdi_amini, llvm-commits

Differential Revision: http://reviews.llvm.org/D19904

llvm-svn: 275367

52735fc4

[SCCP] Pass a Value * instead of templating this function. NFC. · ed4d5ea8
Davide Italiano authored Jul 14, 2016
```
Thanks to Eli for the suggestion!

llvm-svn: 275366
```
ed4d5ea8
clarify a bit. · 0bd88229
Chris Lattner authored Jul 14, 2016
```
llvm-svn: 275364
```
0bd88229

[IPSCCP] Constant fold struct argument/instructions when all the lattice values are constant. · 7dac027e

Davide Italiano authored Jul 14, 2016

This now should also work with the interprocedural variant of the pass.
Slightly easier now that the yak is shaved.

Differential Revision:   http://reviews.llvm.org/D22329

llvm-svn: 275363

7dac027e

[Object] Re-apply r275316 now that I have the corresponding LLD patch ready. · fc209623
Lang Hames authored Jul 14, 2016
```
llvm-svn: 275361
```
fc209623
Teach fast isel about thiscall (and callee-pop) calls. · af7e8465
Nico Weber authored Jul 14, 2016
```
http://reviews.llvm.org/D22315

llvm-svn: 275360
```
af7e8465

[Scalarizer] PR28108: Skip over nullptr rather than crashing on it. · 8484f92f

Mehdi Amini authored Jul 14, 2016

Summary:
In Scalarizer::gather we see if we already have a scattered form of Op,
and in that case use the new form.

In the particular case of PR28108, the found ValueVector SV has size 2,
where the first Value is nullptr, and the second is indeed a proper Value.
The nullptr then caused an assert to blow when we tried to do
cast<Instruction>(SV[I]).

With this patch we check SV[I] before doing the cast, and if it's nullptr
we just skip over it.

I don't know the Scalarizer well enough to know if this is the best fix
or if something should be done else where to prevent the nullptr from
being in the ValueVector at all, but at least this avoids the crash
and looking at the test case output it looks reasonable.

Reviewers: hfinkel, frasercrmck, wala, mehdi_amini

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D21518

llvm-svn: 275359

8484f92f

Add missing test for r275347 "[IPRA] Set callee saved registers to none for... · 9e332a77

Mehdi Amini authored Jul 14, 2016

Add missing test for r275347 "[IPRA] Set callee saved registers to none for local function when IPRA is enabled."

llvm-svn: 275358

9e332a77

[SCCP] Generalize tryToReplaceInstWithConstant to work also with arguments. · 6ed6d779
Davide Italiano authored Jul 14, 2016
```
llvm-svn: 275357
```
6ed6d779

MIRParser: Fix MIRParser not reporting nullptr on error. · d6f9562b

Matthias Braun authored Jul 14, 2016

While some code paths in MIRParserImpl::parse() already returned nullptr
in case of error one of the important ones did not.

llvm-svn: 275355

d6f9562b

Synchronize LLVM and clang's ObjCDeclSpec::ObjCPropertyAttributeKind. · 0418ef26

Adrian Prantl authored Jul 14, 2016

This adds Clang-specific DWARF constants for nullability and ObjC
class properties that are already generated by clang. This patch adds
dwarfdump support and a more comprehensive testcase.

<rdar://problem/27335745>

llvm-svn: 275354

0418ef26

[Object] Revert r275316, Archive::child_iterator changes, while I update lld. · ae610ab5
Lang Hames authored Jul 14, 2016
```
Should fix the bots broken by r275316.

llvm-svn: 275353
```
ae610ab5

[ConstantFolding] Fold masked loads · 7f781aba

David Majnemer authored Jul 14, 2016

We can constant fold a masked load if the operands are appropriately
constant.

Differential Revision: http://reviews.llvm.org/D22324

llvm-svn: 275352

7f781aba

Force a semicolon at the end of the LLVM_ENABLE_BITMASK_ENUMS_IN_NAMESPACE() macro. · d5bbd856
Justin Lebar authored Jul 13, 2016
```
This silences a warning about an extra semicolon on gcc.

llvm-svn: 275349
```
d5bbd856

Add EnableIPRA to TargetOptions, and move the cl::opt -enable-ipra to TargetMachine.cpp · cfed2564

Mehdi Amini authored Jul 13, 2016

Avoid exposing a cl::opt in a public header and instead promote this
option in the API.
Alternatively, we could land the cl::opt in CommandFlags.h so that
it is available to every tool, but we would still have to find an
option for clang.

llvm-svn: 275348

cfed2564

[IPRA] Set callee saved registers to none for local function when IPRA is enabled. · 4beea662

Mehdi Amini authored Jul 13, 2016

IPRA try to optimize caller saved register by propagating register
usage information from callee to caller so it is beneficial to have
caller saved registers compare to callee saved registers when IPRA
is enabled. Please find more detailed explanation here
https://groups.google.com/d/msg/llvm-dev/XRzGhJ9wtZg/tjAJqb0eEgAJ.

This change makes local function do not have any callee preserved
register when IPRA is enabled. A simple test case is also added to
verify this change.

Patch by Vivek Pandya <vivekvpandya@gmail.com>

Differential Revision: http://reviews.llvm.org/D21561

llvm-svn: 275347

4beea662

[JumpThreading] Delete commented out debug code; NFC · 931df67a
Sanjoy Das authored Jul 13, 2016
```
llvm-svn: 275346
```
931df67a

[ConstantFolding] Extend FoldReinterpretLoadFromConstPtr to handle negative offsets · f89660ab

David Majnemer authored Jul 13, 2016

Treat loads which clip before the start of a global initializer the same
way we treat clipping beyond the end of the initializer: use zeros.

llvm-svn: 275345

f89660ab

Move a transform from InstCombine to InstSimplify. · d77a3b61

David Majnemer authored Jul 13, 2016

This transform doesn't require any new instructions, it can safely live
in InstSimplify.

llvm-svn: 275344

d77a3b61

Fix copy/paste bug in r275340. · 4d36e770
Michael Kuperstein authored Jul 13, 2016
```
llvm-svn: 275343
```
4d36e770

MIRParser: Move SlotMapping and SourceMgr refs to PFS; NFC · e35861d6

Matthias Braun authored Jul 13, 2016

Code cleanup: Move references to SlotMapping and SourceMgr into the
PerFunctionMIParsingState to avoid unnecessary passing around in
parameters.

llvm-svn: 275342

e35861d6