Commits · 74df05471ecca697e0da9c4d788339a6697740fa · Lorenzo Albano / LLVM bpEVL

May 11, 2017

Richard Smith authored May 11, 2017

It's failing due to Hexagon calling convention lowering being broken (empty
structs are not passed even if they have nontrivial destructors / copy ctors).

llvm-svn: 302825

74df0547

[Libcxxabi]: Support using compiler-rt for MinGW64 · 53877bc5
Martell Malone authored May 11, 2017
```
Reviewers: EricWF

Differential Revision: https://reviews.llvm.org/D33098

llvm-svn: 302824
```
53877bc5

De-virtualize GlobalValue · e7c7854c

Reid Kleckner authored May 11, 2017

The erase/remove from parent methods now use a switch table to remove
themselves from their appropriate parent ilist.

The copyAttributesFrom method is now completely non-virtual, since we
only ever copy attributes from a global of the appropriate type.

Pre-requisite to de-virtualizing Value to save a vptr
(https://reviews.llvm.org/D31261).

NFC

llvm-svn: 302823

e7c7854c

[AArch64][MachineCombine] Fold FNMUL+FSUB -> FNMADD. · aeffffdb
Chad Rosier authored May 11, 2017
```
Differential Revision: http://reviews.llvm.org/D33101.

llvm-svn: 302822
```
aeffffdb
[AMDGPU] Placate unused variable warning in release builds. · 0dcc015a
Davide Italiano authored May 11, 2017
```
llvm-svn: 302821
```
0dcc015a

[MSP430] Generate EABI-compliant libcalls · 38e30197

Vadzim Dambrouski authored May 11, 2017

Updates the MSP430 target to generate EABI-compatible libcall names.
As a byproduct, adjusts the hardware multiplier options available in
the MSP430 target, adds support for promotion of the ISD::MUL operation
for 8-bit integers, and correctly marks R11 as used by call instructions.

Patch by Andrew Wygle.

Differential Revision: https://reviews.llvm.org/D32676

llvm-svn: 302820

38e30197

[LiveVariables] Switch Kill/Defs sets to be DenseSet(s). · 36acbc71

Davide Italiano authored May 11, 2017

The testcase in PR32984 shows a non linear compile time increase
after a change that made the LoopUnroll pass more aggressive
(increasing the threshold).

My profiling shows all the time of PHI elimination goes to
llvm::LiveVariables::addNewBlock. This is because we keep
Defs/Kills registers in a SmallSet and vfind(const T &V); is O(N).

Switching to a DenseSet reduces the time spent in the pass from
297 seconds to 97 seconds. Profiling still shows a lot of time is
spent iterating the data structure, so I guess there's room for
improvement.

Dan tells me GCC uses real set operations for live registers and
it takes no-time on this testcase. Matthias points out we might
want to switch all this to LiveIntervalAnalysis so it's not entirely
sure if a rewrite is worth it.

Differential Revision:  https://reviews.llvm.org/D33088

llvm-svn: 302819

36acbc71

Work around different -std= default for PS4 target. · 2cbd1f6c
Richard Smith authored May 11, 2017
```
llvm-svn: 302818
```
2cbd1f6c

PR22877: When constructing an array via a constructor with a default argument · 72236372

Richard Smith authored May 11, 2017

in list-initialization, run cleanups for the default argument after each
iteration of the initialization loop.

We previously only ran the destructor for any temporary once, at the end of the
complete loop, rather than once per iteration!

Re-commit of r302750, reverted in r302776.

llvm-svn: 302817

72236372

[APInt] Remove an APInt copy from the return of APInt::multiplicativeInverse. · dbd6219f
Craig Topper authored May 11, 2017
```
llvm-svn: 302816
```
dbd6219f
[APInt] Fix typo in comment. NFC · 3fbecada
Craig Topper authored May 11, 2017
```
llvm-svn: 302815
```
3fbecada

AMDGPU: Remove tfe bit from flat instruction definitions · 47ccafe7

Matt Arsenault authored May 11, 2017

We don't use it and it was removed in gfx9, and the encoding
bit repurposed.

Additionally actually using it requires changing the output register
class, which wasn't done anyway.

llvm-svn: 302814

47ccafe7

AMDGPU: Pull fneg out of extract_vector_elt · bf5482e4

Matt Arsenault authored May 11, 2017

This allows folding source modifiers in more f16 cases.
Makes it easier to select per-component packed neg modifiers.

llvm-svn: 302813

bf5482e4

[AMDGPU] Fix incorrect register pressure calculation · 33a97ec4

Stanislav Mekhanoshin authored May 11, 2017

Earlier fix D32572 introduced a bug where live-ins were calculated
for basic block instead of scheduling region. This change fixes it.

Differential Revision: https://reviews.llvm.org/D33086

llvm-svn: 302812

33a97ec4

[SLP] Emit optimization remarks · 0aca09fc

Adam Nemet authored May 11, 2017

The approach I followed was to emit the remark after getTreeCost concludes
that SLP is profitable. I initially tried emitting them after the
vectorizeRootInstruction calls in vectorizeChainsInBlock but I vaguely
remember missing a few cases for example in HorizontalReduction::tryToReduce.

ORE is placed in BoUpSLP so that it's available from everywhere (notably
HorizontalReduction::tryToReduce).

We use the first instruction in the root bundle as the locator for the remark.
In order to get a sense how far the tree is spanning I've include the size of
the tree in the remark. This is not perfect of course but it gives you at
least a rough idea about the tree. Then you can follow up with -view-slp-tree
to really see the actual tree.

llvm-svn: 302811

0aca09fc

[PowerPC] Eliminate integer compare instructions - vol. 1 · 96c3d626

Nemanja Ivanovic authored May 11, 2017

This patch is the first in a series of patches to provide code gen for
doing compares in GPRs when the compare result is required in a GPR.

It adds the infrastructure to select GPR sequences for i1->i32 and i1->i64
extensions. This first patch handles equality comparison on i32 operands with
the result sign or zero extended.

Differential Revision: https://reviews.llvm.org/D31847

llvm-svn: 302810

96c3d626

Add a test that local submodule visibility has no effect on debug info · 40b201c3
Adrian Prantl authored May 11, 2017
```
rdar://problem/27876262

llvm-svn: 302809
```
40b201c3
[DAGCombine] Use SelectionDAG::getAnyExtOrTrunc helper. NFCI. · 6faddcbd
Simon Pilgrim authored May 11, 2017
```
llvm-svn: 302808
```
6faddcbd

[asan] Test 'strndup_oob_test.cc' added in r302781 fails on the... · 9ce59db4

Pierre Gousseau authored May 11, 2017

[asan] Test 'strndup_oob_test.cc' added in r302781 fails on the clang-cmake-thumbv7-a15-full-sh bot.
Marking as unsupported on armv7l-unknown-linux-gnueabihf, same as strdup_oob_test.cc

llvm-svn: 302807

9ce59db4

Fix -DLLVM_ENABLE_THREADS=OFF build after r302748 · 905da745
Hans Wennborg authored May 11, 2017
```
llvm-svn: 302806
```
905da745

[Simplify] Remove identical scalar writes. · 07e315e7

Michael Kruse authored May 11, 2017

After DeLICM, it is possible to have two writes of the same value to
the same location in the same statement when it determined that those
writes do not conflict (write the same value).

Teach -polly-simplify to remove one of the writes. It interferes with
the pattern matching of matrix-multiplication kernels and also seem
to not be optimized away by LLVM.

The algorthm is simple, has O(n^2) behaviour (n = max number of
MemoryAccesses in a statement) and only matches the most obvious cases,
but seem to be enough to pattern-match Boost ublas gemm.

Not handled cases include:
- StoreInst instructions (a.k.a. explicit writes), since the value might
  be loaded or overwritten between the two stores.
- PHINode, especially LCSSA, when the PHI value matches with on other's.
- Partial writes (in preparation)

llvm-svn: 302805

07e315e7

[X86][AVX] Added zeroall/zeroupper scheduler tests · e2c055b8
Simon Pilgrim authored May 11, 2017
```
Missing on SandyBridge and Btver2 models

llvm-svn: 302804
```
e2c055b8

Modules: fix modules build. · a4241175

Tim Northover authored May 11, 2017

A recent commit made GlobalVariable.h depend on intrinsics generation, so (I
think) it needs to be in the lower-level module. I'll confirm with others, but
this should fix the bots.

llvm-svn: 302803

a4241175

Mark LWG#2782 as complete. No functionality change; we already do this. Just... · 35f62e32
Marshall Clow authored May 11, 2017
```
Mark LWG#2782 as complete. No functionality change; we already do this. Just added a few more tests.

llvm-svn: 302802
```
35f62e32
Renumber test line number expectations after r302783. · 71ed2e64
Benjamin Kramer authored May 11, 2017
```
Also remove a confused stable-runtimes requirement.

llvm-svn: 302801
```
71ed2e64

Replace a nested namespace used for overload resolution with a struct. Richard... · 7e154cdc

Marshall Clow authored May 11, 2017

Replace a nested namespace used for overload resolution with a struct. Richard Smith says that using the namespace results in an ODR violation, but I disagree. Nevertheless, the struct works just as well.

llvm-svn: 302800

7e154cdc

Mark LWG#2850 as complete. No functionality change; we had tests that covered... · afda4a9a

Marshall Clow authored May 11, 2017

Mark LWG#2850 as complete. No functionality change; we had tests that covered it already. Just added comments to the tests. Thanks to K-ballo for the heads up.

llvm-svn: 302799

afda4a9a

Mark LWG#2796 as complete. No functionality change; we had tests that covered... · 9630f46d

Marshall Clow authored May 11, 2017

Mark LWG#2796 as complete. No functionality change; we had tests that covered it already. Just added comments to the tests

llvm-svn: 302798

9630f46d

[CodeCompletion] Provide member completions for dependent expressions whose · e6afa397

Alex Lorenz authored May 11, 2017

type is a TemplateSpecializationType or InjectedClassNameType

Fixes PR30847. Partially fixes PR20973 (first position only).

PR17614 is still not working, its expression has the dependent
builtin type. We'll have to teach the completion engine how to "resolve"
dependent expressions to fix it.

rdar://29818301

llvm-svn: 302797

e6afa397

[CodeCompletion] NFC, extract a function that generates member · 0fe0d985
Alex Lorenz authored May 11, 2017
```
completion results for records

llvm-svn: 302796
```
0fe0d985
Fix two-stage build on windows using DistributionExample cmake cache · 25f1a6ed
NAKAMURA Takumi authored May 11, 2017
```
Thanks to Matthew Larionov <matthewtff@gmail.com>

llvm-svn: 302795
```
25f1a6ed

[IR] Allow attributes with global variables · f3d7904d

Javed Absar authored May 11, 2017

This patch extends llvm-ir to allow attributes to be set on global variables.
An RFC was sent out earlier by my colleague James Molloy: http://lists.llvm.org/pipermail/cfe-dev/2017-March/053100.html
A key part of that proposal was to extend LLVM-IR to carry attributes on global variables.
This generic feature could be useful for multiple purposes.
In our present context, it would be useful to carry user specified sections for bss/rodata/data.

Reviewed by: Jonathan Roelofs, Reid Kleckner
Differential Revision: https://reviews.llvm.org/D32009

llvm-svn: 302794

f3d7904d

[GlobalISel][X86] Remove hand-written G_FADD/F_SUB selection. · a44fc83d
Igor Breger authored May 11, 2017
```
Now it handle by TableGen.

llvm-svn: 302793
```
a44fc83d

[ELF] - Make text section location explicit in early-assign-symbol.s test. · f2cd0f9d

George Rimar authored May 11, 2017

Testcase itself depends on .text section location, which was orphan earlier.

Suggested by Rafael Espíndola

llvm-svn: 302792

f2cd0f9d

[X86] Moving X86Local namespace from .cpp to .h file to use it in memory folding TableGen backend. · 3c18f190
Ayman Musa authored May 11, 2017
```
Differential Revision: https://reviews.llvm.org/D32797

llvm-svn: 302791
```
3c18f190

[LV] Refactor ILV.vectorize{Loop}() by introducing LVP.executePlan(); NFC · 58b28d54

Ayal Zaks authored May 11, 2017

Introduce LoopVectorizationPlanner.executePlan(), replacing ILV.vectorize() and
refactoring ILV.vectorizeLoop(). Method collectDeadInstructions() is moved from
ILV to LVP. These changes facilitate building VPlans and using them to generate
code, following https://reviews.llvm.org/D28975 and its tentative breakdown.

Method ILV.createEmptyLoop() is renamed ILV.createVectorizedLoopSkeleton() to
improve clarity; it's contents remain intact.

Differential Revision: https://reviews.llvm.org/D32200

llvm-svn: 302790

58b28d54

[asan] Test 'strndup_oob_test.cc' added in r302781 fails on clang-s390x-linux. · 24090e59
Pierre Gousseau authored May 11, 2017
```
Marking it as unsupported for now to hopefully make the bot green.

llvm-svn: 302789
```
24090e59
[msan] add a regression test for PR32842 · 65de5715
Alexander Potapenko authored May 11, 2017
```
Make sure MSan doesn't miss a bug comparing two integers with defined low bits.

llvm-svn: 302788
```
65de5715

[msan] Fix PR32842 · a658ae8f

Alexander Potapenko authored May 11, 2017

It turned out that MSan was incorrectly calculating the shadow for int comparisons: it was done by truncating the result of (Shadow1 OR Shadow2) to i1, effectively rendering all bits except LSB useless.
This approach doesn't work e.g. in the case where the values being compared are even (i.e. have the LSB of the shadow equal to zero).
Instead, if CreateShadowCast() has to cast a bigger int to i1, we replace the truncation with an ICMP to 0.

This patch doesn't affect the code generated for SPEC 2006 binaries, i.e. there's no performance impact.

For the test case reported in PR32842 MSan with the patch generates a slightly more efficient code:

  orq     %rcx, %rax
  jne     .LBB0_6
, instead of:

  orl     %ecx, %eax
  testb   $1, %al
  jne     .LBB0_6

llvm-svn: 302787

a658ae8f

[MSAN] test failed randomly on ARM when XFAILED for MIPS · 29006dc7
Renato Golin authored May 11, 2017
```
llvm-svn: 302786
```
29006dc7