Commits · ab48abeafafef67d5a27daf4da1e207d4e40c3dc · Roger Ferrer / llvm-epi

Aug 18, 2015
- [WebAssembly] Don't default to ELF in the triple. · ab48abea
  Dan Gohman authored Aug 17, 2015
```
WebAssembly doesn't yet have a specified binary format, and it may not
end up being ELF, so we don't want the Triple class defaulting to ELF
for it at this time.

llvm-svn: 245254
```
  ab48abea
- Align SP adjustment in function getSPAdjust · f66d3844
  Guozhi Wei authored Aug 17, 2015
```
This commit adds a new function TargetFrameLowering::alignSPAdjust
and calls it from TargetInstrInfo::getSPAdjust. It fixes PR24142.

llvm-svn: 245253
```
  f66d3844
- [WebAssembly] Make getArchTypePrefix return "wasm". · 4e2d799c
  Dan Gohman authored Aug 17, 2015
```
The arch prefix string isn't currently being used for anything on
WebAssembly, but if it were to be used, it makes sense to use the
same arch prefix string for wasm32 and wasm64.

llvm-svn: 245252
```
  4e2d799c
- MIR Serialization: Serialize the local offsets for the stack objects. · a56ba6a6
  Alex Lorenz authored Aug 17, 2015
```
llvm-svn: 245249
```
  a56ba6a6
- MIR Serialization: Serialize the memory operand's range metadata node. · eb625686
  Alex Lorenz authored Aug 17, 2015
```
llvm-svn: 245247
```
  eb625686
- MIR Serialization: Serialize the memory operand's noalias metadata node. · 03e940d1
  Alex Lorenz authored Aug 17, 2015
```
llvm-svn: 245246
```
  03e940d1
- MIR Serialization: Serialize the memory operand's alias scope metadata node. · a16f624d
  Alex Lorenz authored Aug 17, 2015
```
llvm-svn: 245245
```
  a16f624d
- MIR Serialization: Serialize the memory operand's TBAA metadata node. · a617c916
  Alex Lorenz authored Aug 17, 2015
```
llvm-svn: 245244
```
  a617c916
Aug 17, 2015

[WinEHPrepare] Replace unreasonable funclet terminators with unreachable · 83f4bb23

David Majnemer authored Aug 17, 2015

It is possible to be in a situation where more than one funclet token is
a valid SSA value.  If we see a terminator which exits a funclet which
doesn't use the funclet's token, replace it with unreachable.

Differential Revision: http://reviews.llvm.org/D12074

llvm-svn: 245238

83f4bb23

[SPARC]: recognize '.' as the start of an assembler expression. · 685a7d1a
Douglas Katzman authored Aug 17, 2015
```
llvm-svn: 245232
```
685a7d1a

[ARM] Fix crash when targetting CPU without NEON · 974838f2

James Molloy authored Aug 17, 2015

We emulate a scalar vmin/vmax with NEON instructions as they don't exist in the VFP ISA. So only mark these as legal when NEON is available.

Found here: https://code.google.com/p/chromium/issues/detail?id=521671

llvm-svn: 245231

974838f2

[ScalarEvolutionExpander] Reuse findExistingExpansion during expansion cost... · 06044f97

Igor Laevsky authored Aug 17, 2015

[ScalarEvolutionExpander] Reuse findExistingExpansion during expansion cost calculation for division

Primary purpose of this change is to reuse existing code inside findExistingExpansion. However it introduces very slight semantic change - findExistingExpansion now looks into exiting blocks instead of a loop latches. Originally heuristic was based on the fact that we want to look at the loop exit conditions. And since all exiting latches will be listed in the ExitingBlocks, heuristic stays roughly the same.

Differential Revision: http://reviews.llvm.org/D12008

llvm-svn: 245227

06044f97

[CostModel][AArch64] Increase cost of vector insert element and add missing cast costs · b322aa6f

Silviu Baranga authored Aug 17, 2015

Summary:
Increase the estimated costs for insert/extract element operations on
AArch64. This is motivated by results from benchmarking interleaved
accesses.

Add missing costs for zext/sext/trunc instructions and some integer to
floating point conversions. These costs were previously calculated
by scalarizing these operation and were affected by the cost increase of
the insert/extract element operations.

Reviewers: rengolin

Subscribers: mcrosier, aemerson, rengolin, llvm-commits

Differential Revision: http://reviews.llvm.org/D11939

llvm-svn: 245226

b322aa6f

[CostModel][ARM] Increase cost of insert/extract operations · d5ac2693

Silviu Baranga authored Aug 17, 2015

Summary:
This change limits the minimum cost of an insert/extract
element operation to 2 in cases where this would result
in mixing of NEON and VFP code.

Reviewers: rengolin

Subscribers: mssimpso, aemerson, llvm-commits, rengolin

Differential Revision: http://reviews.llvm.org/D12030

llvm-svn: 245225

d5ac2693

[BasicAliasAnalysis] Do not check ModRef table for intrinsics · b20bda77

Igor Laevsky authored Aug 17, 2015

All possible ModRef behaviours can be completely represented using existing LLVM IR attributes.

Differential Revision: http://reviews.llvm.org/D12033

llvm-svn: 245224

b20bda77

Take alignment into account in isSafeToSpeculativelyExecute and isSafeToLoadUnconditionally. · 34d8ba84
Artur Pilipenko authored Aug 17, 2015
```
Reviewed By: hfinkel, sanjoy, MatzeB

Differential Revision: http://reviews.llvm.org/D9791

llvm-svn: 245223
```
34d8ba84

Extend MCAsmLexer so that it can peek forward several tokens · 1ee99a8b

Benjamin Kramer authored Aug 17, 2015

This commit adds a virtual `peekTokens()` function to `MCAsmLexer`
which can peek forward an arbitrary number of tokens.

It also makes the `peekTok()` method call `peekTokens()` method, but
only requesting one token.

The idea is to better support targets which more more ambiguous
assembly syntaxes.

Patch by Dylan McKay!

llvm-svn: 245221

1ee99a8b

Correcting a -Woverflow warning where 0xFFFF was overflowing an implicit constant conversion. · aa3d810b
Aaron Ballman authored Aug 17, 2015
```
llvm-svn: 245220
```
aa3d810b

[WinEHPrepare] Fix catchret successor phi demotion · 7031c9fc

Joseph Tremoulet authored Aug 17, 2015

Summary:
When demoting an SSA value that has a use on a phi and one of the phi's
predecessors terminates with catchret, the edge needs to be split and the
load inserted in the new block, else we'll still have a cross-funclet SSA
value.

Add a test for this, and for the similar case where a def to be spilled is
on and invoke and a critical edge, which was already implemented but
missing a test.

Reviewers: majnemer

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D12065

llvm-svn: 245218

7031c9fc

Revert "Disable targetdatalayoutcheck" · 58fdd887

Tobias Grosser authored Aug 17, 2015

I committed by accident a local hack that should not have made it upstream.
Sorry for the noise.

llvm-svn: 245212

58fdd887

Disable targetdatalayoutcheck · 607b8b26
Tobias Grosser authored Aug 17, 2015
```
llvm-svn: 245210
```
607b8b26

[mips] [IAS] Add support for the DLA pseudo-instruction and fix problems with DLI · a39ef1c6

Daniel Sanders authored Aug 17, 2015

Summary: It is the same as LA, except that it can also load 64-bit addresses and it only works on 64-bit MIPS architectures.

Reviewers: tomatabacu, seanbruno, vkalintiris

Subscribers: brooks, seanbruno, emaste, llvm-commits

Differential Revision: http://reviews.llvm.org/D9524

llvm-svn: 245208

a39ef1c6

[GMR] isNonEscapingGlobalNoAlias() should look through Bitcasts/GEPs when looking at loads. · adc4e9c4
Michael Kuperstein authored Aug 17, 2015
```
This fixes yet another case from PR24288.

Differential Revision: http://reviews.llvm.org/D12064

llvm-svn: 245207
```
adc4e9c4
Remove hand-rolled matching for fmin and fmax. · 88edc824
James Molloy authored Aug 17, 2015
```
SDAGBuilder now does this all for us.

llvm-svn: 245198
```
88edc824
Rip out hand-rolled matching code for VMIN, VMAX, VMINNM and VMAXNM · c617be55
James Molloy authored Aug 17, 2015
```
This is no longer needed - SDAGBuilder will do this for us.

llvm-svn: 245197
```
c617be55

Generate FMINNAN/FMINNUM/FMAXNAN/FMAXNUM from SDAGBuilder. · ef183397

James Molloy authored Aug 17, 2015

These only get generated if the target supports them. If one of the variants is not legal and the other is, and it is safe to do so, the other variant will be emitted.

For example on AArch32 (V8), we have scalar fminnm but not fmin.

Fix up a couple of tests while we're here - one now produces better code, and the other was just plain wrong to start with.

llvm-svn: 245196

ef183397

Fix PR24469 resulting from r245025 and re-enable dead store elimination across basicblocks. · 3af28945

Karthik Bhat authored Aug 17, 2015

PR24469 resulted because DeleteDeadInstruction in handleNonLocalStoreDeletion was
deleting the next basic block iterator. Fixed the same by resetting the basic block iterator
post call to DeleteDeadInstruction.

llvm-svn: 245195

3af28945

Revert "[InstCombinePHI] Partial simplification of identity operations." · 8ed559ad
David Majnemer authored Aug 17, 2015
```
This reverts commit r244887, it caused PR24470.

llvm-svn: 245194
```
8ed559ad

[PM] Port ScalarEvolution to the new pass manager. · 2f1fd165

Chandler Carruth authored Aug 17, 2015

This change makes ScalarEvolution a stand-alone object and just produces
one from a pass as needed. Making this work well requires making the
object movable, using references instead of overwritten pointers in
a number of places, and other refactorings.

I've also wired it up to the new pass manager and added a RUN line to
a test to exercise it under the new pass manager. This includes basic
printing support much like with other analyses.

But there is a big and somewhat scary change here. Prior to this patch
ScalarEvolution was never *actually* invalidated!!! Re-running the pass
just re-wired up the various other analyses and didn't remove any of the
existing entries in the SCEV caches or clear out anything at all. This
might seem OK as everything in SCEV that can uses ValueHandles to track
updates to the values that serve as SCEV keys. However, this still means
that as we ran SCEV over each function in the module, we kept
accumulating more and more SCEVs into the cache. At the end, we would
have a SCEV cache with every value that we ever needed a SCEV for in the
entire module!!! Yowzers. The releaseMemory routine would dump all of
this, but that isn't realy called during normal runs of the pipeline as
far as I can see.

To make matters worse, there *is* actually a key that we don't update
with value handles -- there is a map keyed off of Loop*s. Because
LoopInfo *does* release its memory from run to run, it is entirely
possible to run SCEV over one function, then over another function, and
then lookup a Loop* from the second function but find an entry inserted
for the first function! Ouch.

To make matters still worse, there are plenty of updates that *don't*
trip a value handle. It seems incredibly unlikely that today GVN or
another pass that invalidates SCEV can update values in *just* such
a way that a subsequent run of SCEV will incorrectly find lookups in
a cache, but it is theoretically possible and would be a nightmare to
debug.

With this refactoring, I've fixed all this by actually destroying and
recreating the ScalarEvolution object from run to run. Technically, this
could increase the amount of malloc traffic we see, but then again it is
also technically correct. ;] I don't actually think we're suffering from
tons of malloc traffic from SCEV because if we were, the fact that we
never clear the memory would seem more likely to have come up as an
actual problem before now. So, I've made the simple fix here. If in fact
there are serious issues with too much allocation and deallocation,
I can work on a clever fix that preserves the allocations (while
clearing the data) between each run, but I'd prefer to do that kind of
optimization with a test case / benchmark that shows why we need such
cleverness (and that can test that we actually make it faster). It's
possible that this will make some things faster by making the SCEV
caches have higher locality (due to being significantly smaller) so
until there is a clear benchmark, I think the simple change is best.

Differential Revision: http://reviews.llvm.org/D12063

llvm-svn: 245193

2f1fd165

[ADT] Teach FoldingSet to be movable. · b596ba23

Chandler Carruth authored Aug 16, 2015

This is a very minimal move support - it leaves the moved-from object in
a zombie state that is only valid for destruction and move assignment.
This seems fine to me, and leaving it in the default constructed state
would require adding more state to the object and potentially allocating
memory (!!!) and so seems like a Bad Idea.

llvm-svn: 245192

b596ba23

Aug 16, 2015
- [TableGen] Use range-based for loop. · 802d3d39
  Craig Topper authored Aug 16, 2015
```
llvm-svn: 245191
```
  802d3d39
- [TableGen] Move the ConversionRow vector into the ConversionTable instead of copying. · c4de7ee7
  Craig Topper authored Aug 16, 2015
```
llvm-svn: 245190
```
  c4de7ee7
- [SimplifyLibCalls] Drop default template args. No functional change. · bb70d751
  Benjamin Kramer authored Aug 16, 2015
```
llvm-svn: 245189
```
  bb70d751
- [IR] Simplify code. No functionality change. · dc1d1cbd
  Benjamin Kramer authored Aug 16, 2015
```
llvm-svn: 245188
```
  dc1d1cbd
- transform fmin/fmax calls when possible (PR24314) · 57fd1dc5
  Sanjay Patel authored Aug 16, 2015
```
If we can ignore NaNs, fmin/fmax libcalls can become compare and select
(this is what we turn std::min / std::max into).

This IR should then be optimized in the backend to whatever is best for
any given target. Eg, x86 can use minss/maxss instructions.

This should solve PR24314:
https://llvm.org/bugs/show_bug.cgi?id=24314

Differential Revision: http://reviews.llvm.org/D11866

llvm-svn: 245187
```
  57fd1dc5
- [LSR][NFC] Don’t duplicate entity name at the beginning of the comment. · 94c4aecf
  Sanjoy Das authored Aug 16, 2015
```
llvm-svn: 245183
```
  94c4aecf
- [LSR][NFC] Use camelCase for method names in Formula and RegUseTracker. · 302bfd04
  Sanjoy Das authored Aug 16, 2015
```
llvm-svn: 245182
```
  302bfd04
- use SDValue bool operator; NFCI · 3ab4a73b
  Sanjay Patel authored Aug 16, 2015
```
llvm-svn: 245181
```
  3ab4a73b
- Add missing include guard. · 178c4652
  Yaron Keren authored Aug 16, 2015
```
llvm-svn: 245173
```
  178c4652
- Revert "Add support for cross block dse. This patch enables dead stroe... · e04443ba
  David Majnemer authored Aug 16, 2015
```
Revert "Add support for cross block dse. This patch enables dead stroe elimination across basicblocks."

This reverts commit r245025, it caused PR24469.

llvm-svn: 245172
```
  e04443ba