Commits · 48977c3364d978b234734e6125adf39e55e9603d · Lorenzo Albano / LLVM bpEVL

Aug 04, 2015

[InstCombine] Moved SSE vector shift constant folding into its own helper function. NFCI. · dcfd7a3f
Simon Pilgrim authored Aug 04, 2015
```
This will make some upcoming bugfixes + improvements easier to manage.

llvm-svn: 243962
```
dcfd7a3f

Linker: Fix references to uniqued nodes after r243883 · 706f37e8

Duncan P. N. Exon Smith authored Aug 04, 2015

r243883 started moving 'distinct' nodes instead of duplicated them in
lib/Linker.  This had the side-effect of sometimes not cloning uniqued
nodes that reference them.  I missed a corner case:

    !named = !{!0}
    !0 = !{!1}
    !1 = distinct !{!0}

!0 is the entry point for "remapping", and a temporary clone (say,
!0-temp) is created and mapped in case we need to model a uniquing
cycle.

    Recursive descent into !1.  !1 is distinct, so we leave it alone,
    but update its operand to !0-temp.

Pop back out to !0.  Its only operand, !1, hasn't changed, so we don't
need to use !0-temp.  !0-temp goes out of scope, and we're finished
remapping, but we're left with:

    !named = !{!0}
    !0 = !{!1}
    !1 = distinct !{null} ; uh oh...

Previously, if !0 and !0-temp ended up with identical operands, then
!0-temp couldn't have been referenced at all.  Now that distinct nodes
don't get duplicated, that assumption is invalid.  We need to
!0-temp->replaceAllUsesWith(!0) before freeing !0-temp.

I found this while running an internal `-flto -g` bootstrap.  Strangely,
there was no case of this in the open source bootstrap I'd done before
commit...

llvm-svn: 243961

706f37e8

Remove the configure and cmake checks for sys/wait.h · c7e3f3ab

Justin Bogner authored Aug 04, 2015

If we don't have sys/wait.h and we're on a unix system there's no way
that several of the llvm tools work at all. This includes clang.

Just remove the configure and cmake checks entirely - we'll get a
build error instead of building something broken now.

llvm-svn: 243957

c7e3f3ab

[SDAG] Fix a result chain in ExpandUnalignedLoad · caf1149b

Hal Finkel authored Aug 04, 2015

On the code path in ExpandUnalignedLoad which expands an unaligned vector/fp
value in terms of a legal integer load of the same size, the ChainResult needs
to be the chain result of the integer load.

No in-tree test case is currently available.

Patch by Jan Hranac!

llvm-svn: 243956

caf1149b

[LAA] Remove unused pointer partition argument from addRuntimeCheck, NFC · 87011187

Adam Nemet authored Aug 04, 2015

This variant of addRuntimeCheck is only used now from the LoopVectorizer
which does not use this parameter.

llvm-svn: 243955

87011187

Introduce enum value for previously defined metadata -- make.implicit · 00038784

Chen Li authored Aug 04, 2015

Summary: This patch adds enum value for an existing metadata type -- make.implicit. Using preassigned enum will be helpful to get compile time type checking and avoid string construction and comparison. The patch also changes uses of make.implicit from string metadata to enum metadata. There is no functionality change.

Reviewers: reames

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D11698

llvm-svn: 243954

00038784

ARM: support windows division routines · 0a2672bb

Saleem Abdulrasool authored Aug 04, 2015

This adds the software division routines for the Windows RTABI.  These are not
expected to be used often though as most modern Windows ARM capable targets
support hardware division.  In the case that the target CPU doesnt support
hardware division, this will be the fallback.

llvm-svn: 243952

0a2672bb

ARM: make Darwin libcall registration table driven (NFC) · 67697a7e

Saleem Abdulrasool authored Aug 04, 2015

Make the libcall updating table driven similar to the approach that the Linux
and Windows codepath does below.  NFC.

llvm-svn: 243951

67697a7e

[UB] Don't allocate space for contained types and then try to copy the · 77711979

Chandler Carruth authored Aug 04, 2015

contained types into the space when we have no contained types. This
fixes the UB stemming from a call to memcpy with a null pointer. This
also reduces the calls to allocate because this actually happens in
a notable client - Clang.

Found by UBSan.

llvm-svn: 243944

77711979

Revert "[LSR] Generate and use zero extends" · 215df9ed
Sanjoy Das authored Aug 04, 2015
```
This reverts commit r243348 and r243357.  They caused PR24347.

llvm-svn: 243939
```
215df9ed

[AArch64] Rename FP formats to be more consistent. NFC. · 81fda188

Ahmed Bougacha authored Aug 04, 2015

Some are named "FP", others "SD", others still "FP*SD".
Rename all this to just use "FP", which, except for conversions
(which don't use this format naming scheme), implies "SD" anyway.

llvm-svn: 243936

81fda188

[AArch64] Add isel support for f16 indexed LD/ST. · e0e12db8
Ahmed Bougacha authored Aug 04, 2015
```
llvm-svn: 243935
```
e0e12db8
[UB] Fix yet another use of memcpy with a null pointer argument. I think · 1c156f73
Chandler Carruth authored Aug 04, 2015
```
this is the last of them in my build of LLVM. Haven't tried Clang yet.

Found via UBSan.

llvm-svn: 243934
```
1c156f73

[AArch64][v8.1a] The "pan" sysreg isn't MSR-specific. NFCI. · e8ea9ac3

Ahmed Bougacha authored Aug 04, 2015

It's already in SysRegMappings, no need to also have it in MSRMappings:
the latter is only used if we didn't find a match in the former.

llvm-svn: 243933

e8ea9ac3

[AArch64] Remove unnecessary "break". NFC. · 0cbe2efc
Ahmed Bougacha authored Aug 04, 2015
```
llvm-svn: 243931
```
0cbe2efc
[AArch64] Use SDValue bool operator. NFC. · 239d635d
Ahmed Bougacha authored Aug 04, 2015
```
llvm-svn: 243930
```
239d635d

[AArch64] Vector FCOPYSIGN supports Custom-lowering: mark it as such. · b0ae36f0

Ahmed Bougacha authored Aug 04, 2015

There's a bunch of code in LowerFCOPYSIGN that does smart lowering, and
is actually already vector-aware; let's use it instead of scalarizing!

The only interesting change is that for v2f32, we previously always used
use v4i32 as the integer vector type.
Use v2i32 instead, and mark FCOPYSIGN as Custom.

llvm-svn: 243926

b0ae36f0

[CodeGen] Fix FCOPYSIGN legalization to account for mismatched types. · f65371a2

Ahmed Bougacha authored Aug 04, 2015

We used to legalize it like it's any other binary operations.  It's not,
because it accepts mismatched operand types.  Because of that, we used
to hit various asserts and miscompiles.

Specialize vector legalizations to, in the worst case, unroll, or, when
possible, to just legalize the operand that needs legalization.

Scalarization isn't covered, because I can't think of a target where
some but not all of the 1-element vector types are to be scalarized.

llvm-svn: 243924

f65371a2

MIR Serialization: Serialize the 'volatile' machine memory operand flag. · a518b796
Alex Lorenz authored Aug 04, 2015
```
llvm-svn: 243923
```
a518b796
[LAA] Remove unused needsAnyChecking(), NFC · 53e30aec
Adam Nemet authored Aug 03, 2015
```
llvm-svn: 243921
```
53e30aec

[LoopVer] Remove unused needsRuntimeChecks(), NFC · 6b6082dc

Adam Nemet authored Aug 03, 2015

The previous commits moved this functionality into the client.

Also remove the now unused member variable.

llvm-svn: 243920

6b6082dc

MIR Serialization: Initial serialization of the machine memory operands. · 4af7e610
Alex Lorenz authored Aug 03, 2015
```
Reviewers: Duncan P. N. Exon Smith
llvm-svn: 243915
```
4af7e610

-Wdeprecated-clean: Fix cases of violating the rule of 5 in ways that are deprecated in C++11 · 774b584f

David Blaikie authored Aug 03, 2015

Various value handles needed to be copy constructible and copy
assignable (mostly for their use in DenseMap). But to avoid an API that
might allow accidental slicing, make these members protected in the base
class and make derived classes final (the special members become
implicitly public there - but disallowing further derived classes that
might be sliced to the intermediate type).

Might be worth having a warning a bit like -Wnon-virtual-dtor that
catches public move/copy assign/ctors in classes with virtual functions.
(suppressable in the same way - by making them protected in the base,
and making the derived classes final) Could be fancier and only diagnose
them when they're actually called, potentially.

Also allow a few default implementations where custom implementations
(especially with non-standard return types) were implemented.

llvm-svn: 243909

774b584f

ARM: remove horrible printf left over from debugging · 9c340ec6
Tim Northover authored Aug 03, 2015
```
llvm-svn: 243907
```
9c340ec6

Aug 03, 2015

Fix with a bit more care. (but only a bit) · 871b4113
David Blaikie authored Aug 03, 2015
```
llvm-svn: 243903
```
871b4113

[Unroll] Improve the brute force loop unroll estimate by propagating · 87adb7a2

Chandler Carruth authored Aug 03, 2015

through PHI nodes across iterations.

This patch teaches the new advanced loop unrolling heuristics to propagate
constants into the loop from the preheader and around the backedge after
simulating each iteration. This lets us brute force solve simple recurrances
that aren't modeled effectively by SCEV. It also makes it more clear why we
need to process the loop in-order rather than bottom-up which might otherwise
make much more sense (for example, for DCE).

This came out of an attempt I'm making to develop a principled way to account
for dead code in the unroll estimation. When I implemented
a forward-propagating version of that it produced incorrect results due to
failing to propagate *cost* between loop iterations through the PHI nodes, and
it occured to me we really should at least propagate simplifications across
those edges, and it is quite easy thanks to the loop being in canonical and
LCSSA form.

Differential Revision: http://reviews.llvm.org/D11706

llvm-svn: 243900

87adb7a2

Try to fix the build for C++ standard libraries missing std::map::emplace · 69374412
David Blaikie authored Aug 03, 2015
```
llvm-svn: 243899
```
69374412

-Wdeprecated-clean: Fix cases of violating the rule of 5 in ways that are deprecated in C++11 · e44a8a70

David Blaikie authored Aug 03, 2015

Some functions return concrete ByteStreamers by value - explicitly
support that in the base class. (dtor can be virtual, no one seems to be
polymorphically owning/destroying them)

llvm-svn: 243897

e44a8a70

Recommit r243824: -Wdeprecated-clean: Fix cases of violating the rule of 5 in... · adbda4b9

David Blaikie authored Aug 03, 2015

Recommit r243824: -Wdeprecated-clean: Fix cases of violating the rule of 5 in ways that are deprecated in C++11

This reverts commit r243888, recommitting r243824.

This broke the Windows build due to a difference in the C++ standard
library implementation. Using emplace/forward_as_tuple should ensure
there's no need to copy ValIDs.

llvm-svn: 243896

adbda4b9

Convert some AArch64 code to foreach loops. NFC. · 7be8f8f0

Pete Cooper authored Aug 03, 2015

Also converted a cast<> to dyn_cast while i was working on the same
line of code.

llvm-svn: 243894

7be8f8f0

Revert "-Wdeprecated-clean: Fix cases of violating the rule of 5 in ways that... · e28b9cbd

Reid Kleckner authored Aug 03, 2015

Revert "-Wdeprecated-clean: Fix cases of violating the rule of 5 in ways that are deprecated in C++11"

This reverts commit r243824.

It broke the build on Windows.

llvm-svn: 243888

e28b9cbd

DI: Disallow uniquable DICompileUnits · 55ca964e

Duncan P. N. Exon Smith authored Aug 03, 2015

Since r241097, `DIBuilder` has only created distinct `DICompileUnit`s.
The backend is liable to start relying on that (if it hasn't already),
so make uniquable `DICompileUnit`s illegal and automatically upgrade old
bitcode.  This is a nice cleanup, since we can remove an unnecessary
`DenseSet` (and the associated uniquing info) from `LLVMContextImpl`.

Almost all the testcases were updated with this script:

    git grep -e '= !DICompileUnit' -l -- test |
    grep -v test/Bitcode |
    xargs sed -i '' -e 's,= !DICompileUnit,= distinct !DICompileUnit,'

I imagine something similar should work for out-of-tree testcases.

llvm-svn: 243885

55ca964e

ARM: prefer allocating VFP regs at stride 4 on Darwin. · 910dde7a

Tim Northover authored Aug 03, 2015

This is necessary for WatchOS support, where the compact unwind format assumes
this kind of layout. For now we only want this on Swift-like CPUs though, where
it's been the Xcode behaviour for ages. Also, since it can expand the prologue
we don't want it at -Oz.

llvm-svn: 243884

910dde7a

Linker: Move distinct MDNodes instead of cloning · 4fb46cb8

Duncan P. N. Exon Smith authored Aug 03, 2015

Instead of cloning distinct `MDNode`s when linking in a module, just
move them over.  The module linker destroys the source module, so the
old node would otherwise just be leaked on the context.  Create the new
node in place.  This also reduces the number of cloned uniqued nodes
(since it's less likely their operands have changed).

This mapping strategy is only correct when we're discarding the source,
so the linker turns it on via a ValueMapper flag, `RF_MoveDistinctMDs`.

There's nothing observable in terms of `llvm-link` output here: the
linked module should be semantically identical.

I'll be adding more 'distinct' nodes to the debug info metadata graph in
order to break uniquing cycles, so the benefits of this will partly come
in future commits.  However, we should get some gains immediately, since
we have a fair number of 'distinct' `DILocation`s being linked in.

llvm-svn: 243883

4fb46cb8

Refactor AtomicExpand::expandAtomicRMWToCmpXchg into a standalone function. · e8aad299

JF Bastien authored Aug 03, 2015

Summary:
This is useful for PNaCl's `RewriteAtomics` pass. NaCl intrinsics don't exist for some of the more exotic RMW instructions, so by refactoring this function into its own, `RewriteAtomics` can share code rewriting those atomics with `AtomicExpand` while additionally saving a few cycles by generating the `cmpxchg` NaCl-specific intrinsic with the callback. Without this patch, `RewriteAtomics` would require two extra passes over functions, by first requiring use of the full `AtomicExpand` pass to just expand the leftover exotic RMWs and then running itself again to expand resulting `cmpxchg`s.

NFC

Reviewers: jfb

Subscribers: jfb, llvm-commits

Differential Revision: http://reviews.llvm.org/D11422

llvm-svn: 243880

e8aad299

Currently string attributes on function arguments/return values can be... · 17376c4e

Artur Pilipenko authored Aug 03, 2015

Currently string attributes on function arguments/return values can be generated using LLVM API. However they are not supported in parser. So, the following scenario will fail:
* generate function with string attribute using API,
* dump it in LL format,
* try to parse.
Add parser support for string attributes to fix the issue.

Reviewed By: reames, hfinkel

Differential Revision: http://reviews.llvm.org/D11058

llvm-svn: 243877

17376c4e

[ARM] Make GlobalMerge merge extern globals by default · f3324cf1

John Brawn authored Aug 03, 2015

Enabling merging of extern globals appears to be generally either beneficial or
harmless. On some benchmarks suites (on Cortex-M4F, Cortex-A9, and Cortex-A57)
it gives improvements in the 1-5% range, but in the rest the overall effect is
zero.

Differential Revision: http://reviews.llvm.org/D10966

llvm-svn: 243874

f3324cf1

[GlobalMerge] Allow targets to enable merging of extern variables, NFC. · 8b954241

John Brawn authored Aug 03, 2015

Adjust the GlobalMergeOnExternal option so that the default behaviour is to
do whatever the Target thinks is best. Explicitly enabled or disabling the
option will override this default.

Differential Revision: http://reviews.llvm.org/D10965

llvm-svn: 243873

8b954241

Be less conservative about forming IT blocks. · 6967e5e4

James Molloy authored Aug 03, 2015

In http://reviews.llvm.org/rL215382, IT forming was made more conservative under
the belief that a flag-setting instruction was unpredictable inside an IT block on ARMv6M.

But actually, ARMv6M doesn't even support IT blocks so that's impossible. In the ARMARM for
v7M, v7AR and v8AR it states that the semantics of such an instruction changes inside an
IT block - it doesn't set the flags. So actually it is fine to use one inside an IT block
as long as the flags register is dead afterwards.

This gives significant performance improvements in a variety of MPEG based workloads.

Differential revision: http://reviews.llvm.org/D11680

llvm-svn: 243869

6967e5e4

ValueMapper: Only check for cycles if operands change · 50f8969e

Duncan P. N. Exon Smith authored Aug 03, 2015

This is a minor optimization to only check for unresolved operands
inside `mapDistinctNode()` if the operands have actually changed.  This
shouldn't really cause any change in behaviour.  I didn't actually see a
slowdown in a profile, I was just poking around nearby and saw the
opportunity.

llvm-svn: 243866

50f8969e