Commits · 0b6773e189951331af5903113bc5525ad5d86d31 · Roger Ferrer / llvm-epi

Jul 18, 2016

[Hexagon] Handle returning small structures by value · 14412ef0

Krzysztof Parzyszek authored Jul 18, 2016

This is not compliant with the official ABI, but allows experimentation
with calling conventions.

llvm-svn: 275825

14412ef0

[Hexagon] Revert r275822: mistake in commit message · 4661a958
Krzysztof Parzyszek authored Jul 18, 2016
```
llvm-svn: 275824
```
4661a958
[X86][AVX] Add target shuffle decode support for VBROADCAST · c941f6b3
Simon Pilgrim authored Jul 18, 2016
```
Currently we only decode broadcasts from a vector of the same size.

llvm-svn: 275823
```
c941f6b3

[Hexagon] Handle returning small structures by value · 5948ea78

Krzysztof Parzyszek authored Jul 18, 2016

This is compliant with the official ABI, but allows experimentation with
calling conventions.

llvm-svn: 275822

5948ea78

[X86] Accept SELECT op code for x86-64 fp128 type · 4d9f2c15

Chih-Hung Hsieh authored Jul 18, 2016

DAGTypeLegalizer::CanSkipSoftenFloatOperand should allow
SELECT op code for x86_64 fp128 type for MME targets,
so SoftenFloatOperand does not abort on SELECT op code.

Differential Revision: http://reviews.llvm.org/D21758

llvm-svn: 275818

4d9f2c15

[MathExtras] Fix UB in minIntN · a2a218fb

David Majnemer authored Jul 18, 2016

We negated a value with a signed type which invited problems when that
value was the most negative signed number.  Use an unsigned type
for the value instead.  It will compute the same twos complement
result without the UB.

llvm-svn: 275815

a2a218fb

[LoopDist] This test does not require ASSERTS · d6ba0bf8

Adam Nemet authored Jul 18, 2016

Only its counterpart, diagnostics-with-hotness-lazy-BFI.ll, which
invokes opt with -debug-only=.

llvm-svn: 275812

d6ba0bf8

[LoopDist] Port to new PM · b2593f78

Adam Nemet authored Jul 18, 2016

Summary:
The direct motivation for the port is to ensure that the OptRemarkEmitter
tests work with the new PM.

This remains a function pass because we not only create multiple loops
but could also version the original loop.

In the test I need to invoke opt
with -passes='require<aa>,loop-distribute'.  LoopDistribute does not
directly depend on AA however LAA does.  LAA uses getCachedResult so
I *think* we need manually pull in 'aa'.

Reviewers: davidxl, silvas

Subscribers: sanjoy, llvm-commits, mzolotukhin

Differential Revision: https://reviews.llvm.org/D22437

llvm-svn: 275811

b2593f78

[OptRemarkEmitter] Port to new PM · 79ac42a5

Adam Nemet authored Jul 18, 2016

Summary:
The main goal is to able to start using the new OptRemarkEmitter
analysis from the LoopVectorizer.  Since the vectorizer was recently
converted to the new PM, it makes sense to convert this analysis as
well.

This pass is currently tested through the LoopDistribution pass, so I am
also porting LoopDistribution to get coverage for this analysis with the
new PM.

Reviewers: davidxl, silvas

Subscribers: llvm-commits, mzolotukhin

Differential Revision: https://reviews.llvm.org/D22436

llvm-svn: 275810

79ac42a5

Sort include headers · 3beef418
Adam Nemet authored Jul 18, 2016
```
llvm-svn: 275809
```
3beef418
[X86][AVX2] Added tests that demonstrate duplicate broadcasts · 4ac74206
Simon Pilgrim authored Jul 18, 2016
```
We don't yet decode broadcasts as a target shuffle

llvm-svn: 275808
```
4ac74206
[Hexagon] Misc changes to HexagonMachineScheduler, NFC · 2be7eadb
Krzysztof Parzyszek authored Jul 18, 2016
```
- Remove duplicated code.
- Convert loop to range-for.

llvm-svn: 275806
```
2be7eadb

[Hexagon] Enable .cur formation in MISched for Hexagon V60 · 786333ff

Krzysztof Parzyszek authored Jul 18, 2016

Schedule a load and its use in the same packet in MISched. Previously,
isResourceAvailable was returning false for dependences in the same
packet, which prevented MISched from packetizing a load and its use in
the same packet for v60.

Patch by Ikhlas Ajbar.

llvm-svn: 275804

786333ff

Revert "r275571 [DSE]Enhance shorthening MemIntrinsic based on OverlapIntervals" · 63dd36fa
Alexander Kornienko authored Jul 18, 2016
```
Causes https://llvm.org/bugs/show_bug.cgi?id=28588

llvm-svn: 275801
```
63dd36fa
[Hexagon] Add verbose debugging mode to Hexagon MI Scheduler · f05dc4d5
Krzysztof Parzyszek authored Jul 18, 2016
```
Patch by Sergei Larin.

llvm-svn: 275799
```
f05dc4d5

[PowerPC] Remove redundant direct moves when extracting integers and converting to FP · d3c284f6

Nemanja Ivanovic authored Jul 18, 2016

This patch corresponds to review:
https://reviews.llvm.org/D21354

We use direct moves for extracting integer elements from vectors. We also use
direct moves when converting integers to FP. When these operations are chained,
we get a direct move out of a VSR followed by a direct move back into a VSR.
These are redundant - all we need to do is line up the element and convert.

llvm-svn: 275796

d3c284f6

[MC] Cleanup Error Handling in AsmParser · a645433c

Nirav Dave authored Jul 18, 2016

Add parseToken and compatriot functions to stitch error checks in
straight linear code. As part of this fix some erronous handling of
directives where the EndOfStatement token either was not checked or
Lexed on termination.

Reviewers: rnk, majnemer

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D22312

llvm-svn: 275795

a645433c

[Hexagon] Use timing class info as tie-breaker in machine scheduler · 393b3793
Krzysztof Parzyszek authored Jul 18, 2016
```
Patch by Sirish Pande.

llvm-svn: 275794
```
393b3793

[Hexagon] HexagonMachineScheduler should account for resources · 3467e9d0

Krzysztof Parzyszek authored Jul 18, 2016

The machine scheduler needs to account for available resources
more accurately in order to avoid scheduling an instruction that
forces a new packet to be created.

This occurs in two ways: First, an instruction without an available
resource may have a large priority due to other metrics and be
scheduled when there are other instructions with available resources.
Second, an instruction with a non-zero latency may become available
prematurely. In both these cases, we attempt change the priority
in order to allow a better instruction to be scheduled.

Patch by Brendon Cahoon.

llvm-svn: 275793

3467e9d0

[Hexagon] Fix zero latency instructions with multiple predecessors · 748d3efe

Krzysztof Parzyszek authored Jul 18, 2016

An instruction may have multiple predecessors that are candidates
for using .cur. However, only one of them can use .cur in the
packet. When this case occurs, we need to make sure that only
one of the dependences gets a 0 latency value.

Patch by Brendon Cahoon.

llvm-svn: 275790

748d3efe

Fixed errors in docs. · d80f6265
Alexander Kornienko authored Jul 18, 2016
```
llvm-svn: 275789
```
d80f6265
[SLPVectorizer][X86] Added sqrt vectorization tests · 1b2ab113
Simon Pilgrim authored Jul 18, 2016
```
llvm-svn: 275788
```
1b2ab113

[inlineasm] Propagate operand constraints to the backend · d32a2d30

Simon Dardis authored Jul 18, 2016

When SelectionDAGISel transforms a node representing an inline asm
block, memory constraint information is not preserved. This can cause
constraints to be broken when a memory offset is of the form:

offset + frame index

when the frame is resolved.

By propagating the constraints all the way to the backend, targets can
enforce memory operands of inline assembly to conform to their constraints.

For MIPSR6, some instructions had their offsets reduced to 9 bits from
16 bits such as ll/sc. This becomes problematic when using inline assembly
to perform atomic operations, as an offset can generated that is too big to
encode in the instruction.

Reviewers: dsanders, vkalintris

Differential Review: https://reviews.llvm.org/D21615

llvm-svn: 275786

d32a2d30

AMDGPU: Disable AMDGPUPromoteAlloca pass for shader calling conventions. · bef1ceb8

Nicolai Haehnle authored Jul 18, 2016

Summary:
The work item intrinsics are not available for the shader
calling conventions. And even if we did hook them up most
shader stages haves some extra restrictions on the amount
of available LDS.

Reviewers: tstellarAMD, arsenm

Subscribers: nhaehnle, arsenm, llvm-commits, kzhuravl

Differential Revision: https://reviews.llvm.org/D20728

llvm-svn: 275779

bef1ceb8

[ARM] Update test to use CHECK-LABEL. NFCI. · 6731f134
Diana Picus authored Jul 18, 2016
```
llvm-svn: 275777
```
6731f134

[ARM] Skip inline asm memory operands in DAGToDAGISel · 73ed44d3

Diana Picus authored Jul 18, 2016

The current logic for handling inline asm operands in DAGToDAGISel interprets
the operands by looking for constants, which should represent the flags
describing the kind of operand we're dealing with (immediate, memory, register
def etc). The operands representing actual data are skipped only if they are
non-const, with the exception of immediate operands which are skipped explicitly
when a flag describing an immediate is found.

The oversight is that memory operands may be const too (e.g. for device drivers
reading a fixed address), so we should explicitly skip the operand following a
flag describing a memory operand. If we don't, we risk interpreting that
constant as a flag, which is definitely not intended.

Fixes PR26038

Differential Revision: https://reviews.llvm.org/D22103

llvm-svn: 275776

73ed44d3

[AVX512] Add EVEX versions of scalar ADD/SUB/MUL/DIV to load folding tables. · a3c55f59
Craig Topper authored Jul 18, 2016
```
llvm-svn: 275775
```
a3c55f59
[X86] Fix test checks to include leading 'v' on avx mnemonic names. · 83613bb4
Craig Topper authored Jul 18, 2016
```
llvm-svn: 275774
```
83613bb4

[ARM] Honour ABI for rem under -O0 for EABI, GNUEABI, Android and Musl · 774d157a

Diana Picus authored Jul 18, 2016

At higher optimization levels, we generate the libcall for DIVREM_Ix, which is
fine: aeabi_{u|i}divmod. At -O0 we generate the one for REM_Ix, which is the
default {u}mod{q|h|s|d}i3.

This commit makes sure that we don't generate REM_Ix calls for ABIs that
don't support them (i.e. where we need to use DIVREM_Ix instead). This is
achieved by bailing out of FastISel, which can't handle non-double multi-reg
returns, and letting the legalization infrastructure expand the REM_Ix calls.

It also updates the divmod-eabi.ll test to run under -O0 as well, and adds some
Windows checks to it to make sure we don't break things for it.

Fixes PR27068

Differential Revision: https://reviews.llvm.org/D21926

llvm-svn: 275773

774d157a

[AVX512] Add KADD/KAND/KOR/KXOR to X86InstrInfo::isAssociativeAndCommutative. · 16a07449
Craig Topper authored Jul 18, 2016
```
llvm-svn: 275771
```
16a07449
[X86] Add VPMULLW/D/Q instructions to X86InstrInfo::isAssociativeAndCommutative. · 463f949a
Craig Topper authored Jul 18, 2016
```
llvm-svn: 275770
```
463f949a
[X86] Add VPADD instructions to X86InstrInfo::isAssociativeAndCommutative. · 1af6cc00
Craig Topper authored Jul 18, 2016
```
llvm-svn: 275769
```
1af6cc00
[X86] Add floating point packed logical ops to X86InstrInfo::isAssociativeAndCommutative. · ba9b93d7
Craig Topper authored Jul 18, 2016
```
llvm-svn: 275768
```
ba9b93d7
[X86] Add AVX512 instructions to X86InstrInfo::isAssociativeAndCommutative. · 3a99de40
Craig Topper authored Jul 18, 2016
```
llvm-svn: 275767
```
3a99de40

[X86] Add more AVX512 instructions to X86InstrInfo::isHighLatencyDef. Also add... · fe5a6dc5

Craig Topper authored Jul 18, 2016

[X86] Add more AVX512 instructions to X86InstrInfo::isHighLatencyDef. Also add all packed fp division instructions.

llvm-svn: 275766

fe5a6dc5

[X86] Add AVX512 load opcodes and a couple AVX load opcodes to... · f7a06c29

Craig Topper authored Jul 18, 2016

[X86] Add AVX512 load opcodes and a couple AVX load opcodes to X86InstrInfo::areLoadsFromSameBasePtr.

llvm-svn: 275765

f7a06c29

[X86] Add more opcodes to isFrameLoadOpcode/isFrameStoreOpcode. Mainly AVX-512 related. · 650a15e2
Craig Topper authored Jul 18, 2016
```
llvm-svn: 275764
```
650a15e2
[AVX512] Use VMOVAPSZ128rr/VMOVAPS256rr for VR128X/VR256X physreg moves when VLX is supported. · 5c913e84
Craig Topper authored Jul 18, 2016
```
Ideally we would use VEX encoded moves instead of EVEX if the high 16 registers aren't referenced, but this a good first step.

llvm-svn: 275763
```
5c913e84
[X86] Fix 80-column violations. NFC · 53f3d1b4
Craig Topper authored Jul 18, 2016
```
llvm-svn: 275762
```
53f3d1b4

[GVNHoist] Change the key for VNtoInsns to a pair · 04c7c225

David Majnemer authored Jul 18, 2016

While debugging GVNHoist, I found it confusing that the entries in a
VNtoInsns were not always value numbers.  They _usually_ were except for
StoreInst in which case they were a hash of two different value numbers.

This leads to two observations:
- It is more difficult to debug things when the semantic contents of
  VNtoInsns changes over time.
- Using a single value number is not much cheaper, the value of
  VNtoInsns is a SmallVector.
- It is not immediately clear what the algorithm would do if there were
  hash collisions in the StoreInst case.

Using a DenseMap of std::pair sidesteps all of this.

N.B.  The changes in the test were due their sensitivity to the
iteration order of VNtoInsns which has changed.

llvm-svn: 275761

04c7c225