Commits · b37674dca0cc31a7f7fb1425d4af959bb228cdc9 · Lorenzo Albano / LLVM bpEVL

Jun 10, 2016
- [X86][AVX512] Added avx512 VPSLLDQ/VPSRLDQ instruction comments · 643734c5
  Simon Pilgrim authored Jun 09, 2016
```
llvm-svn: 272319
```
  643734c5
Jun 09, 2016

[LiveRangeEdit] Fix a crash in eliminateDeadDef. · d307909a

Quentin Colombet authored Jun 09, 2016

When we delete a live-range, we check if that live-range is the origin of others
to keep it around for rematerialization. For that we check that the instruction
we are about to remove is the same as the definition of the VNI of the original
live-range.
If this is the case, we just shrink the live-range to an empty one.

Now, when we try to delete one of the children of such live-range (product of
splitting), we do the same check.
However, now the original live-range is empty and there is no way we can
access the VNI to check its definition, and we crash.

When we cannot get the VNI for the original live-range, that means we are not in
the presence of the original definition. Thus, this check does not need to happen
in that case and the crash is sloved!

This bug was introduced in r266162 | wmi | 2016-04-12 20:08:27. It affects every
target that uses the greedy register allocator.
To happen, we need to delete both a the original instruction and its split
products, in that order. This is likely to happen when rematerialization comes
into play.

Trying to produce a more robust test case. Will follow in a coming commit.

This fixes llvm.org/PR27983.

rdar://problem/26651519 

llvm-svn: 272314

d307909a

[X86][AVX512] Dropped avx512 VPSLLDQ/VPSRLDQ intrinsics · f718682e

Simon Pilgrim authored Jun 09, 2016

Auto-upgrade to generic shuffles like sse/avx2 implementations now that we can lower to VPSLLDQ/VPSRLDQ 

llvm-svn: 272308

f718682e

[X86][AVX512] Fixed issue with v16i32 shuffles lowering to VPALIGNR · 47c76e20
Simon Pilgrim authored Jun 09, 2016
```
llvm-svn: 272307
```
47c76e20

BitcodeReader: Use std:::piecewise_construct when upgrading type refs · c3f89973

Duncan P. N. Exon Smith authored Jun 09, 2016

r267296 used std::piecewise_construct without using
std::forward_as_tuple, and r267298 hacked it out (using an emplace_back
followed by a couple of reset() calls) because of a problem on a bot.
I'm finally circling back to call forward_as_tuple as I should have to
begin with (thanks to David Blaikie for pointing out the missing piece).

Note that this code uses emplace_back() instead of
push_back(make_pair()) because the move constructor for TrackingMDRef is
expensive (cheaper than a copy, but still expensive).

llvm-svn: 272306

c3f89973

[X86][AVX512] Added support for lowering 512-bit vector shuffles to bit/byte shifts · 0ab9d302

Simon Pilgrim authored Jun 09, 2016

512-bit VPSLLDQ/VPSRLDQ can only be used for avx512bw targets so lowerVectorShuffleAsShift had to be adjusted to include the subtarget

llvm-svn: 272300

0ab9d302

[NVPTX] Add intrinsics for shfl instructions. · ed2c282d

Justin Lebar authored Jun 09, 2016

Summary:
Currently clang emits these instructions via inline (volatile) asm in
the CUDA headers.  Switching to intrinsics will let the optimizer reason
across calls to these intrinsics.

Reviewers: tra

Subscribers: llvm-commits, jholewinski

Differential Revision: http://reviews.llvm.org/D21160

llvm-svn: 272298

ed2c282d

[PM] Port LCSSA to the new PM. · e12c487b
Easwaran Raman authored Jun 09, 2016
```
Differential Revision: http://reviews.llvm.org/D21090

llvm-svn: 272294
```
e12c487b

AMDGPU/SI: Fix 32-bit fdiv lowering · ed0f97fa

Wei Ding authored Jun 09, 2016

We were using the fast fdiv lowering for all division, implementation of
IEEE754 fdiv is added.

http://reviews.llvm.org/D20557

llvm-svn: 272292

ed0f97fa

[LV] Use vector phis for some secondary induction variables · c5edcdeb

Michael Kuperstein authored Jun 09, 2016

Previously, we materialized secondary vector IVs from the primary scalar IV,
by offseting the primary to match the correct start value, and then broadcasting
it - inside the loop body. Instead, we can use a real vector IV, like we do for
the primary.

This enables using vector IVs for secondary integer IVs whose type matches the
type of the primary.

Differential Revision: http://reviews.llvm.org/D20932

llvm-svn: 272283

c5edcdeb

SelectionDAG: Implement expansion of {S,U}MIN/MAX in integer legalization · 2da0cba5

Jan Vesely authored Jun 09, 2016

Fixes {u,}long_{min,max,clamp} opencl piglit regressions on EG.

Reviewers: arsenm
Differential Revision: http://reviews.llvm.org/D17898

llvm-svn: 272272

2da0cba5

Reapply "[MBP] Reduce code size by running tail merging in MBP."" · 5b458cc1

Haicheng Wu authored Jun 09, 2016

This reapplies commit r271930, r271915, r271923.  They hit a bug in
Thumb which is fixed in r272258 now.

The original message:

The code layout that TailMerging (inside BranchFolding) works on is not the
final layout optimized based on the branch probability. Generally, after
BlockPlacement, many new merging opportunities emerge.

This patch calls Tail Merging after MBP and calls MBP again if Tail Merging
merges anything.

llvm-svn: 272267

5b458cc1

[SystemZ] Enable long displacement constraints for inline ASM operands · 79564611

Ulrich Weigand authored Jun 09, 2016

This enables use of the 'S' constraint for inline ASM operands on
SystemZ, which allows for a memory reference with a signed 20-bit
immediate displacement. This patch includes corresponding documentation
and test case updates.

I've changed the 'T' constraint to match the new behavior for 'S', as
'T' also uses a long displacement (though index constraints are still
not implemented). I also changed 'm' to match the behavior for 'S' as
this will allow for a wider range of displacements for 'm', though
correct me if that's not the right decision.

Author: colpell
Differential Revision: http://reviews.llvm.org/D21097

llvm-svn: 272266

79564611

[CodeGen] Change getSDagStackGuard to get an internal sym. · bd4243c5
Davide Italiano authored Jun 09, 2016
```
Fixes a crash in the backend during an LTO build of rtld(1) in
FreeBSD.

llvm-svn: 272262
```
bd4243c5
[mips][microMIPS] Implement BOVC, BNVC, EXT, INS and JALRC instructions · c962c493
Hrvoje Varga authored Jun 09, 2016
```
Differential Revision: http://reviews.llvm.org/D11798

llvm-svn: 272259
```
c962c493

[Thumb] A branch is not part of an IT block · a7dbf987

James Molloy authored Jun 09, 2016

ReplaceTailWithBranchTo assumed that if an instruction is predicated, it must be part of an IT block. This is not correct for conditional branches.

No testcase as this was triggered by the reverted patch r272017 - test coverage will occur when that patch is re-reverted and there is no known way to trigger this in the meantime.

llvm-svn: 272258

a7dbf987

[AVX512] Remove masked_move/blendm intrinsic from back-end. · f635367e

Igor Breger authored Jun 09, 2016

This is complement patch to D21060.

Differential Revision: http://reviews.llvm.org/D21174

llvm-svn: 272257

f635367e

[mips][microMIPS] Add CodeGen support for SEL.*, SELEQZ, SELNEZ, SELEQZ.*,... · cd242c16

Zlatko Buljan authored Jun 09, 2016

[mips][microMIPS] Add CodeGen support for SEL.*, SELEQZ, SELNEZ, SELEQZ.*, SELNEZ.* and CMP.condn.fmt instructions
Differential Revision: http://reviews.llvm.org/D20862

llvm-svn: 272256

cd242c16

[AMDGPU] Disassembler: Support for sdwa instructions · c9bdcb75

Sam Kolton authored Jun 09, 2016

Reviewers: vpykhtin, tstellarAMD

Subscribers: arsenm, kzhuravl

Differential Revision: http://reviews.llvm.org/D21129

llvm-svn: 272255

c9bdcb75

[AVX512] Fix shuffle decode printing for several instructions with write... · 6f7288dc

Craig Topper authored Jun 09, 2016

[AVX512] Fix shuffle decode printing for several instructions with write masks. There are still more bugs here with UNPCK and PALIGN for sure. But these were the easiest ones to fix.

llvm-svn: 272252

6f7288dc

[Thumb] Select a BIC instead of AND if the immediate can be encoded more optimally negated · feb9f424

James Molloy authored Jun 09, 2016

If an immediate is only used in an AND node, it is possible that the immediate can be more optimally materialized when negated. If this is the case, we can negate the immediate and use a BIC instead;

  int i(int a) {
    return a & 0xfffffeec;
  }

Used to produce:
    ldr r1, [CONSTPOOL]
    ands r0, r1
  CONSTPOOL: 0xfffffeec

And now produces:
    movs    r1, #255
    adds    r1, #20  ; Less costly immediate generation
    bics    r0, r1

llvm-svn: 272251

feb9f424

[X86] Bring consistent naming to the SSE/AVX and AVX512 PALIGNR instructions.... · 7a299309

Craig Topper authored Jun 09, 2016

[X86] Bring consistent naming to the SSE/AVX and AVX512 PALIGNR instructions. Then add shuffle decode printing for the EVEX forms which is made easier by having the naming structure more similar to other instructions.

llvm-svn: 272249

7a299309

[X86] Fix bad comment in assert. NFC · 565a5b54
Craig Topper authored Jun 09, 2016
```
llvm-svn: 272248
```
565a5b54
Revert r272194 No need for it if loop Analysis Manager is used · ecde1c7f
Xinliang David Li authored Jun 09, 2016
```
llvm-svn: 272243
```
ecde1c7f

AArch64: support the `.arch` directive in the IAS · 6c19ffc8

Saleem Abdulrasool authored Jun 09, 2016

Add support to the AArch64 IAS for the `.arch` directive.  This allows the
assembly input to use architectural functionality in part of a file.  This is
used in existing code like BoringSSL.

Resolves PR26016!

llvm-svn: 272241

6c19ffc8

[libFuzzer] add one more OOM test, which we currently don't handle very well · f7798526
Kostya Serebryany authored Jun 09, 2016
```
llvm-svn: 272240
```
f7798526

[ThinLTO/gold] Enable summary-based internalization · 7ab1f692

Teresa Johnson authored Jun 09, 2016

Summary: Enable existing summary-based importing support in the gold-plugin.

Reviewers: mehdi_amini

Subscribers: llvm-commits, mehdi_amini

Differential Revision: http://reviews.llvm.org/D21080

llvm-svn: 272239

7ab1f692

Minor clean up in loopHasNoAbnormalExits; NFC · 1eade915
Sanjoy Das authored Jun 09, 2016
```
llvm-svn: 272238
```
1eade915

Be wary of abnormal exits from loop when exploiting UB · c7f69b92

Sanjoy Das authored Jun 09, 2016

We can safely rely on a NoWrap add recurrence causing UB down the road
only if we know the loop does not have a exit expressed in a way that is
opaque to ScalarEvolution (e.g. by a function call that conditionally
calls exit(0)).

I believe with this change PR28012 is fixed.

Note: I had to change some llvm-lit tests in LoopReroll, since it looks
like they were depending on this incorrect behavior.

llvm-svn: 272237

c7f69b92

Factor out a loopHasNoAbnormalExits; NFC · 97cd7d5d
Sanjoy Das authored Jun 09, 2016
```
llvm-svn: 272236
```
97cd7d5d

Search for llvm-symbolizer binary in the same directory as argv[0], before · 2ad6d48b

Richard Smith authored Jun 09, 2016

looking for it along $PATH. This allows installs of LLVM tools outside of
$PATH to find the symbolizer and produce pretty backtraces if they crash.

llvm-svn: 272232

2ad6d48b

[codeview] Skip DIGlobalVariables with no variable · 6d1d2754
Reid Kleckner authored Jun 09, 2016
```
They have probably been discarded during optimization.

llvm-svn: 272231
```
6d1d2754

[pdbdump] Verify part of TPI hash streams. · c41cd6dc

Rui Ueyama authored Jun 09, 2016

TPI hash table contains a parallel array for the type records.
For each type record R, a hash value is calculated by `H(R) % NumBuckets`
where H is a hash function, and the result is stored to a bucket element.
H is TPI1::hashPrec function in microsoft-pdb repository.

Our hash function does not support all type record types yet.
Currently it supports only records for line number.
I'll extend it in a follow up patch.

The aim of verify the hash table is not only detect corrupted files.
It ensures that our understanding of how the hash values are calculated
is correct.

llvm-svn: 272229

c41cd6dc

[cpu-detection] Add missing break statements in outer switches · 080241b7

Alina Sbirlea authored Jun 09, 2016

Summary:
Break on all switch cases for outer and inner switches.
No functionality changed.

Reviewers: llvm-commits, sanjoy

Differential Revision: http://reviews.llvm.org/D21158

llvm-svn: 272228

080241b7

[MIR] Check that generic virtual registers get a size. · 2c646968

Quentin Colombet authored Jun 08, 2016

Without that check it was possible to write test cases where the size
was not specified and we ended up with weird asserts down the road,
because the default value (1) would not make sense.

llvm-svn: 272226

2c646968

Function names should start with lowercase letters. · f05f360d
Rui Ueyama authored Jun 08, 2016
```
llvm-svn: 272225
```
f05f360d

[LoopSimplify] Preserve LCSSA when merging exit blocks. · 8e7e7672

Michael Zolotukhin authored Jun 08, 2016

Summary:
This fixes PR26682. Also add LCSSA as a preserved pass to LoopSimplify,
that looks correct to me and allows to write a test for the issue.

Reviewers: chandlerc, bogner, sanjoy

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D21112

llvm-svn: 272224

8e7e7672

[PDB] Move PDB functions to a separate file. · 170988f2

Rui Ueyama authored Jun 08, 2016

We are going to use the hash functions from TPI streams.

Differential Revision: http://reviews.llvm.org/D21142

llvm-svn: 272223

170988f2

[LoopUnroll] Check that DT is available before trying to verify it. · aa547616
Michael Zolotukhin authored Jun 08, 2016
```
llvm-svn: 272221
```
aa547616

Jun 08, 2016
- [RegBankSelect] Print out the actual mapping of the operands. · 33406457
  Quentin Colombet authored Jun 08, 2016
```
This improves the debuggability of the pass.

llvm-svn: 272210
```
  33406457