Commits · cfd1ce6a526a0d66853260a0f0eb777abdd0119b · Roger Ferrer / llvm-epi

Dec 14, 2016

[AVR] Add the integrated testing tool to the .gitignore · cfd1ce6a
Dylan McKay authored Dec 14, 2016
```
We build it as an LLVM tool.

llvm-svn: 289645
```
cfd1ce6a

[Assembler] Better error messages for .org directive · 268f42f1

Oliver Stannard authored Dec 14, 2016

Currently, the error messages we emit for the .org directive when the
expression is not absolute or is out of range do not include the line
number of the directive, so it can be hard to track down the problem if
a file contains many .org directives.

This patch stores the source location in the MCOrgFragment, so that it
can be used for diagnostics emitted during layout.

Since layout is an iterative process, and the errors are detected during
each iteration, it would have been possible for errors to be reported
multiple times. To prevent this, I've made the assembler bail out after
each iteration if any errors have been reported. This will still allow
multiple unrelated errors to be reported in the common case where they
are all detected in the first round of layout.

Differential Revision: https://reviews.llvm.org/D27411

llvm-svn: 289643

268f42f1

[AVR] Add a function instrumentation pass · 3abd1d3e
Dylan McKay authored Dec 14, 2016
```
This will be used for an on-chip test suite.

llvm-svn: 289641
```
3abd1d3e

[X86][InstCombine] Handle demanded elements for operand of AVX-512 scalar... · aeaa52cc

Craig Topper authored Dec 14, 2016

[X86][InstCombine] Handle demanded elements for operand of AVX-512 scalar floating point to integer conversion intrinsics.

llvm-svn: 289639

aeaa52cc

[PowerPC] Fix logic dealing with nop after calls (and tail-call eligibility) · 065b7565

Hal Finkel authored Dec 14, 2016

This change aims to unify and correct our logic for when we need to allow for
the possibility of the linker adding a TOC restoration instruction after a
call. This comes up in two contexts:

1. When determining tail-call eligibility. If we make a tail call (i.e.
directly branch to a function) then there is no place for the linker to add
a TOC restoration.
2. When determining when we need to add a nop instruction after a call.
Likewise, if there is a possibility that the linker might need to add a
TOC restoration after a call, then we need to put a nop after the call
(the bl instruction).

First problem: We were using similar, but different, logic to decide (1) and
(2). This is just wrong. Both the resideInSameModule function (used when
determining tail-call eligibility) and the isLocalCall function (used when
deciding if the post-call nop is needed) were supposed to be determining the
same underlying fact (i.e. might a TOC restoration be needed after the call).
The same logic should be used in both places.

Second problem: The logic in both places was wrong. We only know that two
functions will share the same TOC when both functions come from the same
section of the same object. Otherwise the linker might cause the functions to
use different TOC base addresses (unless the multi-TOC linker option is
disabled, in which case only shared-library boundaries are relevant). There are
a number of factors that can cause functions to be placed in different sections
or come from different objects (-ffunction-sections, explicitly-specified
section names, COMDAT, weak linkage, etc.). All of these need to be checked.
The existing logic only checked properties of the callee, but the properties of
the caller must also be checked (for example, calling from a function in a
COMDAT section means calling between sections).

There was a conceptual error in the resideInSameModule function in that it
allowed tail calls to functions with weak linkage and protected/hidden
visibility. While protected/hidden visibility does prevent the function
implementation from being replaced at runtime (via interposition), it does not
prevent the linker from using an alternate implementation at link time (i.e.
using some strong definition to replace the provided weak one during linking).
If this happens, then we're still potentially looking at a required TOC
restoration upon return.

Otherwise, in general, the post-call nop is needed wherever ELF interposition
needs to be supported. We don't currently support ELF interposition at the IR
level (see http://lists.llvm.org/pipermail/llvm-dev/2016-November/107625.html
for more information), and I don't think we should try to make it appear to
work in the backend in spite of that fact. This will yield subtle bugs if
interposition is attempted. As a result, regardless of whether we're in PIC
mode, we don't assume that we need to add the nop to support the possibility of
ELF interposition. However, the necessary check is in place (i.e. calling
GV->isInterposable and TM.shouldAssumeDSOLocal) so when we have functions for
which interposition is allowed at the IR level, we'll add the nop as necessary.
In the mean time, we'll generate more tail calls and fewer nops when compiling
position-independent code.

Differential Revision: https://reviews.llvm.org/D27231

llvm-svn: 289638

065b7565

[X86][InstCombine] Teach SimplifyDemandedVectorElts to handle masked scalar... · 268b3abe

Craig Topper authored Dec 14, 2016

[X86][InstCombine] Teach SimplifyDemandedVectorElts to handle masked scalar add/sub/mul/div/max/min intrinsics better.

Now we can remove these intrinsics if element 0 isn't used. Also fix undef element tracking.

llvm-svn: 289636

268b3abe

[X86][InstCombine] Handle scalar fmadd intrinsics correctly in SimplifyDemandedVectorElts. · dfd268d7
Craig Topper authored Dec 14, 2016
```
Now we pass a modified version of DemandedElts to each operand and we calculate undef elts correctly.

llvm-svn: 289632
```
dfd268d7

[ThinLTO] Add an API to trigger file-based API for returning objects to the linker · 8e13bc45

Mehdi Amini authored Dec 14, 2016

Summary:
The motivation is to support better the -object_path_lto option on
Darwin. The linker needs to write down the generate object files on
disk for later use by lldb or dsymutil (debug info are not present
in the final binary). We're moving this into libLTO so that we can
be smarter when a cache is enabled and hard-link when possible
instead of duplicating the files.

Reviewers: tejohnson, deadalnix, pcc

Subscribers: dexonsmith, llvm-commits

Differential Revision: https://reviews.llvm.org/D27507

llvm-svn: 289631

8e13bc45

[X86][InstCombine] Teach SimplifyDemandedVectorElts to handle scalar round... · eb6a20e7

Craig Topper authored Dec 14, 2016

[X86][InstCombine] Teach SimplifyDemandedVectorElts to handle scalar round intrinsics more correctly.

Now we only pass bit 0 of the DemandedElts to optimize operand 1 as we recurse since the upper bits are unused. Similarly we clear bit 0 for optimizing operand 0.

Also calculate UndefElts correctly.

Simplify InstCombineCalls for these instrinics to just call SimplifyDemandedVectorElts for the call instrution to reuse this support.

llvm-svn: 289629

eb6a20e7

[X86][InstCombine] Teach SimplifyDemandedVectorElts to handle scalar... · a0372dec

Craig Topper authored Dec 14, 2016

[X86][InstCombine] Teach SimplifyDemandedVectorElts to handle scalar min/max/cmp intrinsics more correctly.

Now we only pass bit 0 of the DemandedElts to optimize operand 1 as we recurse since the upper bits are unused.

Also calculate UndefElts correctly.

Simplify InstCombineCalls for these instrinics to just call SimplifyDemandedVectorElts for the call instrution to reuse this support.

llvm-svn: 289628

a0372dec

Don't double-initialize cl::opt for iterating in reverse order to uncover... · 76a00b51

Mehdi Amini authored Dec 14, 2016

Don't double-initialize cl::opt for iterating in reverse order to uncover non-determinism in codegen by default

Bots are broken and needs to be fixed before having this on by default.
The feature was committed in r289619.

I tried to disable it in r289624 and failed because it was initialized in two places.

llvm-svn: 289626

76a00b51

Disable Iterating SmallPtrSet in reverse order to uncover non-determinism in codegen by default · fd1184ef
Mehdi Amini authored Dec 14, 2016
```
Bots are broken and needs to be fixed before having this on by default.
The feature was committed in r289619.

llvm-svn: 289624
```
fd1184ef
[libFuzzer] document one more desired feature of a fuzz target · 8efb35b4
Kostya Serebryany authored Dec 14, 2016
```
llvm-svn: 289622
```
8efb35b4
LTO: Add support for multi-module bitcode files. · 1a0720e8
Peter Collingbourne authored Dec 14, 2016
```
Differential Revision: https://reviews.llvm.org/D27313

llvm-svn: 289621
```
1a0720e8

[DWARF] Preserve column number when emitting 'line 0' record · 8fec3da0

Paul Robinson authored Dec 14, 2016

Follow-up to r289256, address a FIXME to avoid resetting the column
number. This reduced .debug_line by 2.6% in a RelWithDebInfo
self-build of clang.

llvm-svn: 289620

8fec3da0

[llvm] Iterate SmallPtrSet in reverse order to uncover non-determinism in codegen · f6b069c7

Mandeep Singh Grang authored Dec 14, 2016

Summary:
Given a flag (-mllvm -reverse-iterate) this patch will enable iteration of SmallPtrSet in reverse order.
The idea is to compile the same source with and without this flag and expect the code to not change.
If there is a difference in codegen then it would mean that the codegen is sensitive to the iteration order of SmallPtrSet.
This is enabled only with LLVM_ENABLE_ABI_BREAKING_CHECKS.

Reviewers: chandlerc, dexonsmith, mehdi_amini

Subscribers: mgorny, emaste, llvm-commits

Differential Revision: https://reviews.llvm.org/D26718

llvm-svn: 289619

f6b069c7

[ARM] Fix typo in checking prefix · 54eb192b
Evandro Menezes authored Dec 14, 2016
```
llvm-svn: 289617
```
54eb192b
Add support for Samsung Exynos M3 (NFC) · aeec780e
Evandro Menezes authored Dec 13, 2016
```
llvm-svn: 289613
```
aeec780e
Update the header docs to match a recent checkin. · 74c265e5
Greg Clayton authored Dec 13, 2016
```
llvm-svn: 289612
```
74c265e5

Switch functions that returned bool and filled in a DWARFFormValue arg with... · 1cbf3fa9

Greg Clayton authored Dec 13, 2016

Switch functions that returned bool and filled in a DWARFFormValue arg with ones that return Optional<DWARFFormValue>

Differential Revision: https://reviews.llvm.org/D27737

llvm-svn: 289611

1cbf3fa9

llvm-cat: Allow bitcode files to be created with no modules. · 98d40e05
Peter Collingbourne authored Dec 13, 2016
```
llvm-svn: 289610
```
98d40e05
[llvm-config] Fixing one check where shared libs implied dylib · da1c84c0
Chris Bieneman authored Dec 13, 2016
```
We shouldn't print the dylib if LinkDylib is false.

llvm-svn: 289609
```
da1c84c0

llvm-config: Set LinkMode in addition to LinkDyLib when using --ignore-llvm · 7ff587a9

Derek Schuff authored Dec 13, 2016

Summary:
LinkDyLib is only used (before arg processing) to set up the default for
LinkMode. So reset LinkMode as well, and process before --link-shared or
--link-static to allow those flags to continue to override it.

Reviewers: beanz

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D27736

llvm-svn: 289608

7ff587a9

Dec 13, 2016

[libFuzzer] fix an UB (invalid shift) spotted by ubsan. The code worked fine... · f6f82c2c

Kostya Serebryany authored Dec 13, 2016

[libFuzzer] fix an UB (invalid shift) spotted by ubsan. The code worked fine by luck, because the way shifts actually work on clang+x86

llvm-svn: 289607

f6f82c2c

[llvm-config] Add --ignore-libllvm · 7f6611cf

Chris Bieneman authored Dec 13, 2016

This flag forces off linking libLLVM. This should resolve some issues reported on llvm-commits.

llvm-svn: 289605

7f6611cf

[Hexagon] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). · 82085927
Eugene Zelenko authored Dec 13, 2016
```
llvm-svn: 289604
```
82085927
Change CoverageTracker from a global variable to member variable to avoid... · 0f35fa90
Dehao Chen authored Dec 13, 2016
```
Change CoverageTracker from a global variable to member variable to avoid breaking thread-safety. (NFC)

llvm-svn: 289603
```
0f35fa90

Re-land "[SCEVExpander] Use llvm data structures; NFC" · c02dda2a

Sanjoy Das authored Dec 13, 2016

This change re-lands r289215, by reverting r289482.  The underlying
issue that caused it to be reverted has been fixed by Tim Northover in
r289496.

Original commit message for r289215:

[SCEVExpander] Use llvm data structures; NFC

Original commit message for r289482:

Revert "[SCEVExpander] Use llvm data structures; NFC"

This reverts r289215 (git SHA1 cb7b86a1).  It breaks the ubsan build
because a DenseMap that keys off of `AssertingVH<T>` will hit UB when it
tries to cast the empty and tombstone keys to `T *` (due to insufficient
alignment).

This is the relevant stack trace (thanks to Mike Aizatsky):

    #0 0x25cf100 in llvm::AssertingVH<llvm::PHINode>::getValPtr() const llvm/include/llvm/IR/ValueHandle.h:212:39
    #1 0x25cea20 in llvm::AssertingVH<llvm::PHINode>::operator=(llvm::AssertingVH<llvm::PHINode> const&) llvm/include/llvm/IR/ValueHandle.h:234:19
    #2 0x25d0092 in llvm::DenseMapBase<llvm::DenseMap<llvm::AssertingVH<llvm::PHINode>, llvm::detail::DenseSetEmpty, llvm::DenseMapInfo<llvm::AssertingVH<llvm::PHINode> >, llvm::detail::DenseSetPair<llvm::AssertingVH<llvm::PHINode> > >, llvm::AssertingVH<llvm::PHINode>, llvm::detail::DenseSetEmpty, llvm::DenseMapInfo<llvm::AssertingVH<llvm::PHINode> >, llvm::detail::DenseSetPair<llvm::AssertingVH<llvm::PHINode> > >::clear() llvm/include/llvm/ADT/DenseMap.h:113:23

llvm-svn: 289602

c02dda2a

[IRCE] Avoid loop optimizations on pre and post loops · 65ca8e91

Anna Thomas authored Dec 13, 2016

Summary:
This patch will add loop metadata on the pre and post loops generated by IRCE.
Currently, we have metadata for disabling optimizations such as vectorization,
unrolling, loop distribution and LICM versioning (and confirmed that these
optimizations check for the metadata before proceeding with the transformation).

The pre and post loops generated by IRCE need not go through loop opts (since
these are slow paths).

Added two test cases as well.

Reviewers: sanjoy, reames

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D26806

llvm-svn: 289588

65ca8e91

[LV] Don't vectorize when we have a small static bound on trip count · 3d23d4a2

Michael Kuperstein authored Dec 13, 2016

We currently check if the exact trip count is known and is smaller than the
"tiny loop" bound. We should be checking the maximum bound on the trip count
instead.

Differential Revision: https://reviews.llvm.org/D27690

llvm-svn: 289583

3d23d4a2

ADT: Use delete[] to delete the array owned by OwningArrayRef, as we created it with new[]. · b56a1034
Peter Collingbourne authored Dec 13, 2016
```
llvm-svn: 289582
```
b56a1034

ADT: Add OwningArrayRef class. · d9af2996

Peter Collingbourne authored Dec 13, 2016

This is a MutableArrayRef that owns its array.
I plan to use this in D22296.

Differential Revision: https://reviews.llvm.org/D27723

llvm-svn: 289579

d9af2996

Object: Make IRObjectFile own multiple modules and enumerate symbols from all modules. · 45102a24
Peter Collingbourne authored Dec 13, 2016
```
This implements multi-module support in IRObjectFile.

Differential Revision: https://reviews.llvm.org/D26951

llvm-svn: 289578
```
45102a24
Object: Remove module accessors from IRObjectFile, and hide its constructor. · c5fecb4f
Peter Collingbourne authored Dec 13, 2016
```
Differential Revision: https://reviews.llvm.org/D27079

llvm-svn: 289577
```
c5fecb4f
LTO: Port the legacy LTO API to ModuleSymbolTable. · 77f4c30d
Peter Collingbourne authored Dec 13, 2016
```
Differential Revision: https://reviews.llvm.org/D27078

llvm-svn: 289576
```
77f4c30d
LTO: Port the new LTO API to ModuleSymbolTable. · ad90369a
Peter Collingbourne authored Dec 13, 2016
```
Differential Revision: https://reviews.llvm.org/D27077

llvm-svn: 289574
```
ad90369a

Generalize strided store pattern in interleave access pass · 77c5eaae

Alina Sbirlea authored Dec 13, 2016

Summary:
This patch aims to generalize matching of the strided store accesses to more general masks.
The more general rule is to have consecutive accesses based on the stride:
[x, y, ... z, x+1, y+1, ...z+1, x+2, y+2, ...z+2, ...]
All elements in the masks need not form a contiguous space, there may be gaps.
As before, undefs are allowed and filled in with adjacent element loads.

Reviewers: HaoLiu, mssimpso

Subscribers: mkuper, delena, llvm-commits

Differential Revision: https://reviews.llvm.org/D23646

llvm-svn: 289573

77c5eaae

Revert "AArch64CollectLOH: Rewrite as block-local analysis." · fde00fc2

Matthias Braun authored Dec 13, 2016

This is not always behaving as expected as it turns out block live-in
lists are only correct most of the time. Still waiting for reviews on
https://reviews.llvm.org/D27559 to have them correct all of the time.

See also http://llvm.org/PR31361, rdar://25117107

This reverts commit r288567.
This reverts commit r288561.

llvm-svn: 289570

fde00fc2

[bpf] change llvm-objdump to print dec instead of hex · 3b9efca8

Alexei Starovoitov authored Dec 13, 2016



since bpf instruction stream is multiple of 8 change llvm-objdump
to print decimal instruction number instead of hex address, so that
users don't have to do this math manually to match kernel verifier output

Signed-off-by: Alexei Starovoitov <ast@kernel.org>
llvm-svn: 289569

3b9efca8

GlobalISel: fix GOT accesses on AArch64. · fe7c59ad

Tim Northover authored Dec 13, 2016

We were using the correct pseudo-instruction, but because the operand's flags
weren't set correctly we still ended up emitting incorrect relocations during
MC lowering.

llvm-svn: 289566

fe7c59ad