Commits · 9934c54cca1bc94e02f58c3fe1209e72312c115b · Lorenzo Albano / LLVM bpEVL

Nov 09, 2016

[Hexagon] Separate Hexagon subreg indices for different register classes · a540997c

Krzysztof Parzyszek authored Nov 09, 2016

For pairs of 32-bit registers: isub_lo, isub_hi.
For pairs of vector registers: vsub_lo, vsub_hi.

Add generic subreg indices: ps_sub_lo, ps_sub_hi, and a function
  HexagonRegisterInfo::getHexagonSubRegIndex(RegClass, GenericSubreg)
that returns the appropriate subreg index for RegClass.

llvm-svn: 286377

a540997c

[Hexagon] Eliminate Insert4 pseudo-instruction, use combines instead · 601d7eb1
Krzysztof Parzyszek authored Nov 09, 2016
```
llvm-svn: 286368
```
601d7eb1
[SystemZ] A few fixes in scheduler files. · e127fe70
Jonas Paulsson authored Nov 09, 2016
```
Review: U Weigand
llvm-svn: 286362
```
e127fe70
Remove TimeValue usage from Scalar/SROA.cpp. NFC. · c207bec3
Pavel Labath authored Nov 09, 2016
```
llvm-svn: 286361
```
c207bec3

Zero-initialize chrono duration objects · 775bbc37

Pavel Labath authored Nov 09, 2016

The default duration constructor does not zero-initialize the object, we need to
do that manually.

llvm-svn: 286359

775bbc37

[dsymutil] Replace TimeValue with TimePoint · 62d72041

Pavel Labath authored Nov 09, 2016

Summary:
All changes are pretty straight-forward. I chose to use TimePoints with
second precision, as that is all that seems to be required here.

Reviewers: friss, zturner

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D25908

llvm-svn: 286358

62d72041

[mips] Add non-const getter for the Elf_Mips_Options class. NFC · 96b4b713
Simon Atanasyan authored Nov 09, 2016
```
llvm-svn: 286351
```
96b4b713

[MachineScheduler] Comments fixing. · 28f29487

Jonas Paulsson authored Nov 09, 2016

The name/comment of the third argument to the ScheduleDAGMI constructor
is RemoveKillFlags and not IsPostRA. Only the comments are changed.

Review: A Trick
llvm-svn: 286350

28f29487

[ARM] Loop Strength Reduction crashes when targeting ARM or Thumb. · 0ee3ec2f

Alexandros Lamprineas authored Nov 09, 2016

Scalar Evolution asserts when not all the operands of an Add Recurrence
Expression are loop invariants. Loop Strength Reduction should only
create affine Add Recurrences, so that both the start and the step of
the expression are loop invariants.

Differential Revision: https://reviews.llvm.org/D26185

llvm-svn: 286347

0ee3ec2f

[AVX-512] Add lowering to cvttpd2udq/cvttps2udq for fptoui v2f64/2f32 to 2i32 · f334ac19

Craig Topper authored Nov 09, 2016

This patch adds support for fptoui to 2i32 from both 2f64 and 2f32, building on Simon's change for the signed version in r284459 and using AVX-512 instructions.

If we don't have VLX support we need to use a 512-bit operation for v2f64->v2i32 and extract the result.

It also recognises that cvttpd2udq zeroes the upper 64-bits of the xmm result.

Differential Revision: https://reviews.llvm.org/D26331

llvm-svn: 286345

f334ac19

[X86] Lower AVX512 and SSE intrinsics for CVTTPD2DQ to X86ISD::CVTTPD2DQ. · 731bf9c5

Craig Topper authored Nov 09, 2016

Summary: This allows the SSE intrinsic to use the EVEX instruction when available. It also fixes EVEX to not use a weird (v4i32 (fp_to_sint v2f64)) node and it merges some isel patterns. This also fixes some cases that weren't combining vzmovl with cvttpd2dq to remove extra moves.

Reviewers: delena, zvi, RKSimon

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D26330

llvm-svn: 286344

731bf9c5

[AVX-512] Add more varied alignments to tests for storing the lower 128-bits... · ef1807fb

Craig Topper authored Nov 09, 2016

[AVX-512] Add more varied alignments to tests for storing the lower 128-bits of a 256 or 512-bit subvector extract.

llvm-svn: 286343

ef1807fb

[AVX-512] Use alignedstore256 in patterns that look for stores of the lower... · 28e3dfc0

Craig Topper authored Nov 09, 2016

[AVX-512] Use alignedstore256 in patterns that look for stores of the lower 256-bits of a 512-bit vector to use a 256-bit aligned store.

Previously we were only checking for 16 byte alignment instead of 32 byte alignment. Fixes PR30947.

llvm-svn: 286342

28e3dfc0

[AVX-512] Add test cases to demonstrate PR30947. We accidentally use 32 byte... · abf50415

Craig Topper authored Nov 09, 2016

[AVX-512] Add test cases to demonstrate PR30947. We accidentally use 32 byte aligned store instructions when the original store was only 16 byte aligned if the store is from the lower bits of a subvector extract.

llvm-svn: 286341

abf50415

[AVX-512] Make VBMI instruction set enabling imply that the BWI instruction set is also enabled. · 5c842be9

Craig Topper authored Nov 09, 2016

Summary:
This is needed to make the v64i8 and v32i16 types legal for the 512-bit VBMI instructions. Fixes PR30912.

Reviewers: delena, zvi

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D26322

llvm-svn: 286339

5c842be9

[XRay][docs] Fix llvm snippets to be well-formed · 0f1ddfa8
Dean Michael Berris authored Nov 09, 2016
```
llvm-svn: 286330
```
0f1ddfa8

Revert "[ThinLTO] Prevent exporting of locals used/defined in module level asm" · b6a11a78

Mehdi Amini authored Nov 09, 2016

This reverts commit r286297.
Introduces a dependency from libAnalysis to libObject, which I missed
during the review.

llvm-svn: 286329

b6a11a78

[doc] Remove explicit CMake version requirement for MSVC · 0695e5b9
Mehdi Amini authored Nov 09, 2016
```
The global minimum one is way past this version.

llvm-svn: 286328
```
0695e5b9

Bitcode: Remove the remnants of the BitcodeDiagnosticInfo class. · 7576cb0f

Peter Collingbourne authored Nov 09, 2016

The BitcodeReader no longer produces BitcodeDiagnosticInfo diagnostics.
The only remaining reference was in the gold plugin; the code there has been
dead since we stopped producing InvalidBitcodeSignature error codes in r225562.
While at it remove the InvalidBitcodeSignature error code.

llvm-svn: 286326

7576cb0f

Enable Loop Sink pass for functions that has profile. · 947dbe12

Dehao Chen authored Nov 09, 2016

Summary: For functions with profile data, we are confident that loop sink will be optimal in sinking code.

Reviewers: davidxl, hfinkel

Subscribers: mehdi_amini, mzolotukhin, llvm-commits

Differential Revision: https://reviews.llvm.org/D26155

llvm-svn: 286325

947dbe12

Bitcode: Change the BitcodeReader to use llvm::Error internally. · 58f7f075
Peter Collingbourne authored Nov 09, 2016
```
Differential Revision: https://reviews.llvm.org/D26430

llvm-svn: 286323
```
58f7f075

[XRay][Docs] Add documentation for XRay in LLVM · f3da16bf

Dean Michael Berris authored Nov 09, 2016

Summary:
This is the initial version of the documentation for how to use XRay as
it stands in LLVM, Clang, and compiler-rt. We leave some room for later
expansion mentioining what is work in progress and what could be
expected moving forward.

We also give a high level overview of future work that's both ongoing
and planned.

Reviewers: echristo, dblaikie, chandlerc

Subscribers: mehdi_amini, llvm-commits

Differential Revision: https://reviews.llvm.org/D26386

llvm-svn: 286319

f3da16bf

[ValueTracking] recognize obfuscated variants of umin/umax · e1045544

Sanjay Patel authored Nov 09, 2016

The smallest tests that expose this are codegen tests (because SelectionDAGBuilder::visitSelect() uses matchSelectPattern
to create UMAX/UMIN nodes), but it's also possible to see the effects in IR alone with folds of min/max pairs.

If these were written as unsigned compares in IR, InstCombine canonicalizes the unsigned compares to signed compares. 
Ie, running the optimizer pessimizes the codegen for this case without this patch:

define <4 x i32> @umax_vec(<4 x i32> %x) {
  %cmp = icmp ugt <4 x i32> %x, <i32 2147483647, i32 2147483647, i32 2147483647, i32 2147483647>
  %sel = select <4 x i1> %cmp, <4 x i32> %x, <4 x i32> <i32 2147483647, i32 2147483647, i32 2147483647, i32 2147483647>
  ret <4 x i32> %sel
}

$ ./opt umax.ll -S | ./llc -o - -mattr=avx

vpmaxud LCPI0_0(%rip), %xmm0, %xmm0

$ ./opt -instcombine umax.ll -S | ./llc -o - -mattr=avx

vpxor %xmm1, %xmm1, %xmm1
vpcmpgtd  %xmm0, %xmm1, %xmm1
vmovaps LCPI0_0(%rip), %xmm2    ## xmm2 = [2147483647,2147483647,2147483647,2147483647]
vblendvps %xmm1, %xmm0, %xmm2, %xmm0

Differential Revision: https://reviews.llvm.org/D26096

llvm-svn: 286318

e1045544

[cmake] Fix handling compiler-rt in LLVM_ENABLE_PROJECTS by turning any "-" into "_" · 03c62656
Mehdi Amini authored Nov 09, 2016
```
llvm-svn: 286317
```
03c62656

Added the ability to dump hex bytes easily into a raw_ostream. · bde0a163

Greg Clayton authored Nov 09, 2016

Unit tests were added to verify this functionality keeps working correctly.

Example output for raw hex bytes:
llvm::ArrayRef<uint8_t> Bytes = ...;
llvm::outs() << format_hex_bytes(Bytes);
554889e5 4881ec70 04000048 8d051002
00004c8d 05fd0100 004c8b0d d0020000

Example output for raw hex bytes with offsets:
llvm::outs() << format_hex_bytes(Bytes, 0x100000d10);
0x0000000100000d10: 554889e5 4881ec70 04000048 8d051002
0x0000000100000d20: 00004c8d 05fd0100 004c8b0d d0020000

Example output for raw hex bytes with ASCII with offsets:
llvm::outs() << format_hex_bytes_with_ascii(Bytes, 0x100000d10);
0x0000000100000d10: 554889e5 4881ec70 04000048 8d051002 |UH.?H.?p...H....|
0x0000000100000d20: 00004c8d 05fd0100 004c8b0d d0020000 |..L..?...L..?...|

The default groups bytes into 4 byte groups, but this can be changed to 1 byte:
llvm::outs() << format_hex_bytes(Bytes, 0x100000d10, 16 /*NumPerLine*/, 1 /*ByteGroupSize*/);
0x0000000100000d10: 55 48 89 e5 48 81 ec 70 04 00 00 48 8d 05 10 02
0x0000000100000d20: 00 00 4c 8d 05 fd 01 00 00 4c 8b 0d d0 02 00 00

llvm::outs() << format_hex_bytes(Bytes, 0x100000d10, 16 /*NumPerLine*/, 2 /*ByteGroupSize*/);
0x0000000100000d10: 5548 89e5 4881 ec70 0400 0048 8d05 1002
0x0000000100000d20: 0000 4c8d 05fd 0100 004c 8b0d d002 0000

llvm::outs() << format_hex_bytes(Bytes, 0x100000d10, 8 /*NumPerLine*/, 1 /*ByteGroupSize*/);
0x0000000100000d10: 55 48 89 e5 48 81 ec 70
0x0000000100000d18: 04 00 00 48 8d 05 10 02
0x0000000100000d20: 00 00 4c 8d 05 fd 01 00
0x0000000100000d28: 00 4c 8b 0d d0 02 00 00

https://reviews.llvm.org/D26405

llvm-svn: 286316

bde0a163

[InstCombine] fix profitability equation for max-of-nots transform · 4e9d6cd3

Sanjay Patel authored Nov 09, 2016

As the test change shows, we can increase the critical path by adding
a 'not' instruction, so make sure that we're actually removing an
instruction if we do this transform.

This transform could also cause us to miss folds of min/max pairs.

llvm-svn: 286315

4e9d6cd3

[InstCombine] reduce indentation; NFC · 99dc5fef
Sanjay Patel authored Nov 08, 2016
```
llvm-svn: 286314
```
99dc5fef

Nov 08, 2016

Fix some size_t / uint32_t ambiguity errors. · 44728f40
Zachary Turner authored Nov 08, 2016
```
llvm-svn: 286305
```
44728f40

[CodeView] Hook up CodeViewRecordIO to type serialization path. · 4efa0a42

Zachary Turner authored Nov 08, 2016

Previously support had been added for using CodeViewRecordIO
to read (deserialize) CodeView type records.  This patch adds
support for writing those same records.  With this patch,
reading and writing of CodeView type records finally uses a single
codepath.

Differential Revision: https://reviews.llvm.org/D26253

llvm-svn: 286304

4efa0a42

Emit the DW_AT_type for a C++ static member definition · 3502f208

Adrian Prantl authored Nov 08, 2016

if it is more specific than the one in its DW_AT_specification.

If a static member is an array, the translation unit containing the
member definition may have a more specific type (including its length)
than TUs only seeing the class declaration. This patch adds a
DW_AT_type to the member's DW_TAG_variable in addition to the
DW_AT_specification in these cases. The member type in the
DW_AT_specification still shows the more generic type (without the
length) to avoid defeating type uniquing.

The DWARF standard discourages “duplicating” a DW_AT_type in a member
variable definition but doesn’t explicitly forbid it.  Having the more
specific type (with the array length) available is what allows the
debugger to print the contents of a static array member variable.

https://reviews.llvm.org/D26368
rdar://problem/28706946

llvm-svn: 286302

3502f208

GlobalISel: make sure debugging variables are appropriately elided in release builds. · e09ae201

David L. Jones authored Nov 08, 2016

Summary:
There are two variables here that break. This change constrains both of them to
debug builds (via DEBUG() or #ifndef NDEBUG).

Reviewers: bkramer, t.p.northover

Subscribers: mehdi_amini, vkalintiris

Differential Revision: https://reviews.llvm.org/D26421

llvm-svn: 286300

e09ae201

[libFuzzer] minor docs update · b506466a
Kostya Serebryany authored Nov 08, 2016
```
llvm-svn: 286299
```
b506466a

[ThinLTO] Prevent exporting of locals used/defined in module level asm · 6955feeb

Teresa Johnson authored Nov 08, 2016

Summary:
This patch uses the same approach added for inline asm in r285513 to
similarly prevent promotion/renaming of locals used or defined in module
level asm.

All static global values defined in normal IR and used in module level asm
should be included on either the llvm.used or llvm.compiler.used global.
The former were already being flagged as NoRename in the summary, and
I've simply added llvm.compiler.used values to this handling.

Module level asm may also contain defs of values. We need to prevent
export of any refs to local values defined in module level asm (e.g. a
ref in normal IR), since that also requires renaming/promotion of the
local. To do that, the summary index builder looks at all values in the
module level asm string that are not marked Weak or Global, which is
exactly the set of locals that are defined. A summary is created for
each of these local defs and flagged as NoRename.

This required adding handling to the BitcodeWriter to look at GV
declarations to see if they have a summary (rather than skipping them
all).

Finally, added an assert to IRObjectFile::CollectAsmUndefinedRefs to
ensure that an MCAsmParser is available, otherwise the module asm parse
would silently fail. Initialized the asm parser in the opt tool for use
in testing this fix.

Fixes PR30610.

Reviewers: mehdi_amini

Subscribers: johanengelen, krasin, llvm-commits

Differential Revision: https://reviews.llvm.org/D26146

llvm-svn: 286297

6955feeb

[asan] Speed up compilation of large C++ stringmaps (tons of allocas) with ASan · a49dcbb7

Kuba Brecka authored Nov 08, 2016

This addresses PR30746, <https://llvm.org/bugs/show_bug.cgi?id=30746>. The ASan pass iterates over entry-block instructions and checks each alloca whether it's in NonInstrumentedStaticAllocaVec, which is apparently slow. This patch gathers the instructions to move during visitAllocaInst.

Differential Revision: https://reviews.llvm.org/D26380

llvm-svn: 286296

a49dcbb7

[BasicAA] Teach BasicAA to handle the inaccessiblememonly and... · 9604f349

Andrew Kaylor authored Nov 08, 2016

[BasicAA] Teach BasicAA to handle the inaccessiblememonly and inaccessiblemem_or_argmemonly attributes

Differential Revision: https://reviews.llvm.org/D26382

llvm-svn: 286294

9604f349

AArch64DeadRegisterDefinitionsPass: Fix Changed flag · c53cbbb1
Matthias Braun authored Nov 08, 2016
```
Fix a bug in the calculation of the changed flag introduced in r285488.

llvm-svn: 286293
```
c53cbbb1
Use a default constructor. (NFC) · 72845a5f
Adrian Prantl authored Nov 08, 2016
```
Thanks to David Blaikie for suggesting this.

llvm-svn: 286292
```
72845a5f

[TBAA] Drop support for "old style" scalar TBAA tags · 2582e690

Sanjoy Das authored Nov 08, 2016

Summary:
We've had support for auto upgrading old style scalar TBAA access
metadata tags into the "new" struct path aware TBAA metadata for 3 years
now.  The only way to actually generate old style TBAA was explicitly
through the IRBuilder API.  I think this is a good time for dropping
support for old style scalar TBAA.

I'm not removing support for textual or bitcode upgrade -- if you have
IR with the old style scalar TBAA tags that go through the AsmParser orf
the bitcode parser before LLVM sees them, they will keep working as
usual.

Note:

  %val = load i32, i32* %ptr, !tbaa !N
  !N = < scalar tbaa node >

is equivalent to

  %val = load i32, i32* %ptr, !tbaa !M
  !N = < scalar tbaa node >
  !M = !{!N, !N, 0}

Reviewers: manmanren, chandlerc, sunfish

Subscribers: mcrosier, llvm-commits, mgorny

Differential Revision: https://reviews.llvm.org/D26229

llvm-svn: 286291

2582e690

GlobalISel: allow CodeGen to fallback on VReg type/class issues. · 6cddfc14

Tim Northover authored Nov 08, 2016

After instruction selection we perform some checks on each VReg just before
discarding the type information. These checks were assertions before, but that
breaks the fallback path so this patch moves the logic into the main flow and
reports a better error on failure.

llvm-svn: 286289

6cddfc14

[SystemZ] Add missing FP extension instructions · 05effca2

Ulrich Weigand authored Nov 08, 2016

This completes assembler / disassembler support for all BFP
instructions provided by the floating-point extensions facility.
The instructions added here are not currently used for codegen.

llvm-svn: 286285

05effca2