Commits · 82eb45a6f8825eda6e1106501495f5b2694b0e49 · Roger Ferrer / llvm-epi

Jan 06, 2017

[test] Remove unused 'test_examples' config var · 82eb45a6

Michal Gorny authored Jan 06, 2017

Remove config.test_examples from lit.site.cfg and the relevant
ENABLE_EXAMPLES definition from CMake. It is not used anywhere.

Differential Revision: https://reviews.llvm.org/D28283

llvm-svn: 291283

82eb45a6

[InstSimplify] Optimize away urems in the presence of range metadata · 8c0e62f5
David Majnemer authored Jan 06, 2017
```
We know that urem %V, C can be optimized away to %V if %V is ult C.

llvm-svn: 291282
```
8c0e62f5

Fix LoopLoadElimination to keep original alignment on the inital hoisted store · 27d224fb

Mehdi Amini authored Jan 06, 2017

This is fixing a bug where Loop Vectorization is widening a load but
with a lower alignment. Hoisting the load without propagating the alignment
will allow inst-combine to later deduce a higher alignment that what the pointer
actually is.

Differential Revision: https://reviews.llvm.org/D28408

llvm-svn: 291281

27d224fb

AMDGPU/R600: Don't use REGISTER_{LOAD,STORE} ISD nodes · 06200bd7

Jan Vesely authored Jan 06, 2017

This will make transition to SCRATCH_MEMORY easier

Differential Revision: https://reviews.llvm.org/D24746

llvm-svn: 291279

06200bd7

[X86][SSE] Standardized triples in vector shift tests · 08519d7b
Simon Pilgrim authored Jan 06, 2017
```
Made no sense for them to be different and caused useless diffs in assembly remarks.

llvm-svn: 291274
```
08519d7b
[CostModel][X86] Add AVX512 and 512-bit vector shift cost tests. · 9cbcc5ff
Simon Pilgrim authored Jan 06, 2017
```
llvm-svn: 291269
```
9cbcc5ff

AArch64CollectLOH: Rewrite as block-local analysis. · 258b847c

Matthias Braun authored Jan 06, 2017

Re-apply r288561: This time with a fix where the ADDs that are part of a
3 instruction LOH would not invalidate the "LastAdrp" state. This fixes
http://llvm.org/PR31361

Previously this pass was using up to 5% compile time in some cases which
is a bit much for what it is doing. The pass featured a full blown
data-flow analysis which in the default configuration was restricted to a
single block.

This rewrites the pass under the assumption that we only ever work on a
single block. This is done in a single pass maintaining a state machine
per general purpose register to catch LOH patterns.

Differential Revision: https://reviews.llvm.org/D27329

This reverts commit 9e6cedb0a4f14364d6511597a9160305e7d34493.

llvm-svn: 291266

258b847c

[InstCombine] add a vector version of a test added in r291262; NFC · 2715d923
Sanjay Patel authored Jan 06, 2017
```
llvm-svn: 291265
```
2715d923

[InstCombine] move and add tests for icmp + shl nsw; NFC · 8d4aa109

Sanjay Patel authored Jan 06, 2017

As discussed here:
http://lists.llvm.org/pipermail/llvm-dev/2017-January/108749.html
...we should be able to better optimize this pattern.

llvm-svn: 291262

8d4aa109

[DWARF] Null out the debug locs of (loop invariant) instructions hoisted by LICM in · c17a279e

Wolfgang Pieb authored Jan 06, 2017

order to avoid jumpy line tables. Calls are left alone because they may be inlined.

Differential Revision: https://reviews.llvm.org/D28390

llvm-svn: 291258

c17a279e

[AArch64] Reduce vector insert/extract cost for Falkor. · e177185e
Chad Rosier authored Jan 06, 2017
```
Differential Revision: https://reviews.llvm.org/D28403

llvm-svn: 291254
```
e177185e
[AMDGPU] Do not emit .AMDGPU.config section for amdhsa · 67a6d540
Konstantin Zhuravlyov authored Jan 06, 2017
```
Differential Revision: https://reviews.llvm.org/D27732

llvm-svn: 291245
```
67a6d540
[X86][AVX] Regenerate shuffle 128-bit tests. · 9122793b
Simon Pilgrim authored Jan 06, 2017
```
The EVEX -> VEX fix means that AVX/AVX512 code is more likely the same now.

llvm-svn: 291242
```
9122793b

[X86][AVX] Regenerate tzcnt tests. · 10cc5d55

Simon Pilgrim authored Jan 06, 2017

The EVEX -> VEX fix means that AVX/AVX512 code is more likely the same now.

llvm-svn: 291241

10cc5d55

[ASan] Make ASan instrument variable-masked loads and stores · 4647b74b

Filipe Cabecinhas authored Jan 06, 2017

Summary: Previously we only supported constant-masked loads and stores.

Reviewers: kcc, RKSimon, pgousseau, gbedwell, vitalybuka

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D28370

llvm-svn: 291238

4647b74b

[CostModel][X86] Fix 512-bit SDIV/UDIV 'big' costs. · d8333372
Simon Pilgrim authored Jan 06, 2017
```
Set the costs on the lowest target that supports the type.

llvm-svn: 291229
```
d8333372
[CostModel][X86] Add SDIV/UDIV cost tests for a wider range of targets · 441d1d35
Simon Pilgrim authored Jan 06, 2017
```
Added a test demonstrating bug in AVX512 division costs

llvm-svn: 291228
```
441d1d35
Move test input to directory called Inputs. · 965d802e
Daniel Jasper authored Jan 06, 2017
```
It is a common convention that our internal test runner depends upon.

llvm-svn: 291227
```
965d802e
[AVX-512] Add EXTRACT_SUBVECTOR support to combineBitcastForMaskedOp. · e86fb932
Craig Topper authored Jan 06, 2017
```
llvm-svn: 291214
```
e86fb932
[AVX-512] Add more masked vector extract test cases with and without a bitcast between the select. · 8cbac879
Craig Topper authored Jan 06, 2017
```
The ones with the bitcast need additional work to fold the mask operation properly. This will be fixed in a future commit.

llvm-svn: 291213
```
8cbac879

LowerTypeTests: Split the pass in two: a resolution phase and a lowering phase. · 81271b7b

Peter Collingbourne authored Jan 06, 2017

This change separates how type identifiers are resolved from how intrinsic
calls are lowered. All information required to lower an intrinsic call
is stored in a new TypeIdLowering data structure. The idea is that this
data structure can either be initialized using the module itself during
regular LTO, or using the module summary in ThinLTO backends.

Differential Revision: https://reviews.llvm.org/D28341

llvm-svn: 291205

81271b7b

[SelectionDAG] Correctly transform range metadata to AssertZExt · eaba06cf

David Majnemer authored Jan 06, 2017

We used the logBase2 of the high instead of the ceilLogBase2 resulting
in the wrong result for certain values.  For example, it resulted in an
i1 AssertZExt when the exclusive portion of the range was 3.

llvm-svn: 291196

eaba06cf

Jan 05, 2017

[AArch64] Fold some filled/spilled subreg COPYs · d46b6e80

Geoff Berry authored Jan 05, 2017

Summary:
Extend AArch64 foldMemoryOperandImpl() to handle folding spills of
subreg COPYs with read-undef defs like:

  %vreg0:sub_32<def,read-undef> = COPY %WZR; GPR64:%vreg0

by widening the spilled physical source reg and generating:

  STRXui %XZR <fi#0>

as well as folding fills of similar COPYs like:

  %vreg0:sub_32<def,read-undef> = COPY %vreg1; GPR64:%vreg0, GPR32:%vreg1

by generating:

  %vreg0:sub_32<def,read-undef> = LDRWui <fi#0>

Reviewers: MatzeB, qcolombet

Subscribers: aemerson, rengolin, mcrosier, llvm-commits

Differential Revision: https://reviews.llvm.org/D27425

llvm-svn: 291180

d46b6e80

ThinLTO: add early "dead-stripping" on the Index · 6c475a75

Teresa Johnson authored Jan 05, 2017

Summary:
Using the linker-supplied list of "preserved" symbols, we can compute
the list of "dead" symbols, i.e. the one that are not reachable from
a "preserved" symbol transitively on the reference graph.
Right now we are using this information to mark these functions as
non-eligible for import.

The impact is two folds:
- Reduction of compile time: we don't import these functions anywhere
  or import the function these symbols are calling.
- The limited number of import/export leads to better internalization.

Patch originally by Mehdi Amini.

Reviewers: mehdi_amini, pcc

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D23488

llvm-svn: 291177

6c475a75

PR 31534: When emitting both DWARF unwind tables and debug information, · 83963995

Joerg Sonnenberger authored Jan 05, 2017

do not use .cfi_sections. This requires checking if any non-declaration
function in the module needs an unwind table.

llvm-svn: 291172

83963995

[LICM] Allow promotion of some stores that are not guaranteed to execute. · c9acad12

Michael Kuperstein authored Jan 05, 2017

Promotion is always legal when a store within the loop is guaranteed to execute.

However, this is not a necessary condition - for promotion to be memory model
semantics-preserving, it is enough to have a store that dominates every exit
block. This is because if the store dominates every exit block, the fact the
exit block was executed implies the original store was executed as well.

Differential Revision: https://reviews.llvm.org/D28147

llvm-svn: 291171

c9acad12

CodeGen: Assert that liveness is up to date when reading block live-ins. · 11723322

Matthias Braun authored Jan 05, 2017

Add an assert that checks whether liveins are up to date before they are
used.

- Do not print liveins into .mir files anymore in situations where they
  are out of date anyway.
- The assert in the RegisterScavenger is superseded by the new one in
  livein_begin().
- Skip parts of the liveness updating logic in IfConversion.cpp when
  liveness isn't tracked anymore (just enough to avoid hitting the new
  assert()).

Differential Revision: https://reviews.llvm.org/D27562

llvm-svn: 291169

11723322

[x86] add test to show bug in select lowering; NFC · 686527c1
Sanjay Patel authored Jan 05, 2017
```
llvm-svn: 291151
```
686527c1
[CostModel][X86] Include the cost of 256-bit upper subvector extract/insertion in AVX1 v4i64 MUL · b01e8442
Simon Pilgrim authored Jan 05, 2017
```
Matches other MUL/ADD/SUB 256-bit case on AVX1

llvm-svn: 291149
```
b01e8442
[AArch64][CostModel] Add coverage for bswap intrinsics. · e20a3a48
Chad Rosier authored Jan 05, 2017
```
llvm-svn: 291140
```
e20a3a48
[X86] Add test cases that cover pr31551. NFC. · b10f7de3
Zvi Rackover authored Jan 05, 2017
```
llvm-svn: 291127
```
b10f7de3

[CostModel][X86] Add support for broadcast shuffle costs · bca02f9e

Simon Pilgrim authored Jan 05, 2017

Currently only for broadcasts with input and output of the same width.

Differential Revision: https://reviews.llvm.org/D27811

llvm-svn: 291122

bca02f9e

[X86] Optimize vector shifts with variable but uniform shift amounts · 4b7d724d

Zvi Rackover authored Jan 05, 2017

Summary:
For instructions such as PSLLW/PSLLD/PSLLQ a variable shift amount may be passed in an XMM register.
The lower 64-bits of the register are evaluated to determine the shift amount.
This patch improves the construction of the vector containing the shift amount.

Reviewers: craig.topper, delena, RKSimon

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D28353

llvm-svn: 291120

4b7d724d

[AArch64] Remove mcpu option as this test is not target specific. NFC. · 3ccd1dff
Chad Rosier authored Jan 05, 2017
```
llvm-svn: 291117
```
3ccd1dff

[PowerPC] Implement missing ISA 2.06 instructions. · 3a2f00b0

Tony Jiang authored Jan 05, 2017

Instructions: fctidu[.], fctiwu[.], ftdiv, ftsqrt are not implemented. Implement
them and add corresponding test cases in this patch.

llvm-svn: 291116

3a2f00b0

[AArch64] Remove unused arguments from tests. NFC. · e1dc73d9
Chad Rosier authored Jan 05, 2017
```
llvm-svn: 291112
```
e1dc73d9

[ThinLTO] Subsume all importing checks into a single flag · 519465b9

Teresa Johnson authored Jan 05, 2017

Summary:
This adds a new summary flag NotEligibleToImport that subsumes
several existing flags (NoRename, HasInlineAsmMaybeReferencingInternal
and IsNotViableToInline). It also subsumes the checking of references
on the summary that was being done during the thin link by
eligibleForImport() for each candidate. It is much more efficient to
do that checking once during the per-module summary build and record
it in the summary.

Reviewers: mehdi_amini

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D28169

llvm-svn: 291108

519465b9

Currently isLikelyComplexAddressComputation tries to figure out if the given... · 23599ba7

Mohammed Agabaria authored Jan 05, 2017

Currently isLikelyComplexAddressComputation tries to figure out if the given stride seems to be 'complex' and need some extra cost for address computation handling.

This code seems to be target dependent which may not be the same for all targets.
Passed the decision whether the given stride is complex or not to the target by sending stride information via SCEV to getAddressComputationCost instead of 'IsComplex'.

Specifically at X86 targets we dont see any significant address computation cost in case of the strided access in general.

Differential Revision: https://reviews.llvm.org/D27518

llvm-svn: 291106

23599ba7

[GlobalISel] Add support for address-taken basic blocks · a983e7c4

Kristof Beyls authored Jan 05, 2017

To make this work, pointers from the MachineBasicBlock to the LLVM-IR-level
basic blocks need to be initialized, as the AsmPrinter uses this link to be
able to print out labels for the basic blocks that are address-taken.

Most of the changes in this commit are about adapting existing tests to include
the basic block name that is now printed out in the MIR format, now that the
name becomes available as the link to the LLVM-IR basic block is initialized.
The relevant test change for the functionality added in this patch are the
added "(address-taken)" strings in
test/CodeGen/AArch64/GlobalISel/arm64-irtranslator.ll.

Differential Revision: https://reviews.llvm.org/D28123

llvm-svn: 291105

a983e7c4

[GlobalISel] Add support for switch statements · eced071e

Kristof Beyls authored Jan 05, 2017

This commit does this using a trivial chain of conditional branches.  In the
future, we probably want to reuse the optimized switch lowering used in
SelectionDAG.

Differential Revision: https://reviews.llvm.org/D28176

llvm-svn: 291099

eced071e