Commits · d2d1504805f9bf2a9925dd4f09bfe5329f819030 · Lorenzo Albano / LLVM bpEVL

Apr 23, 2016
- Revert r267210, it makes clang assert (PR27490). · 0aa9845d
  Nico Weber authored Apr 22, 2016
```
llvm-svn: 267232
```
  0aa9845d
- Re-commit optimization bisect support (r267022) without new pass manager support. · aa641a51
  Andrew Kaylor authored Apr 22, 2016
```
The original commit was reverted because of a buildbot problem with LazyCallGraph::SCC handling (not related to the OptBisect handling).

Differential Revision: http://reviews.llvm.org/D19172

llvm-svn: 267231
```
  aa641a51
Apr 22, 2016

[PGO] change the interface for createPGOFuncNameMetadata() · f8f051cb

Rong Xu authored Apr 22, 2016

This patch changes the interface for createPGOFuncNameMetadata() where we add
another PGOFuncName argument.

Differential Revision: http://reviews.llvm.org/D19433

llvm-svn: 267216

f8f051cb

[unordered] sink unordered stores at end of blocks · 5f0e3694

Philip Reames authored Apr 22, 2016

The existing code turned out to be completely correct when auditted.  Thus, only minor code changes and adding a couple of tests.

llvm-svn: 267215

5f0e3694

Fold compares for distinct allocations · f97229d6

Sanjoy Das authored Apr 22, 2016

Summary:
We can fold compares to false when two distinct allocations within a
function are compared for equality.

Patch by Anna Thomas!

Reviewers: majnemer, reames, sanjoy

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D19390

llvm-svn: 267214

f97229d6

[unordered] Extend load/store type canonicalization to handle unordered operations · eedef73b

Philip Reames authored Apr 22, 2016

Extend the type canonicalization logic to work for unordered atomic loads and stores.  Note that while this change itself is fairly simple and low risk, there's a reasonable chance this will expose problems in the backends by suddenly generating IR they wouldn't have seen before.  Anything of this nature will be an existing bug in the backend (you could write an atomic float load), but this will definitely change the frequency with which such cases are encountered.  If you see problems, feel free to revert this change, but please make sure you collect a test case.  

llvm-svn: 267210

eedef73b

PM: Port SinkingPass to the new pass manager · b9394908
Justin Bogner authored Apr 22, 2016
```
llvm-svn: 267199
```
b9394908
PM: Reorder the functions used for SinkingPass. NFC · 82077c4a
Justin Bogner authored Apr 22, 2016
```
This will make the port to the new PM easier to follow.

llvm-svn: 267198
```
82077c4a

[DeadStoreElimination] Shorten beginning of memset overwritten by later stores · d29a24e4

Jun Bum Lim authored Apr 22, 2016

Summary: This change will shorten memset if the beginning of memset is overwritten by later stores.

Reviewers: hfinkel, eeckstein, dberlin, mcrosier

Subscribers: mgrang, mcrosier, llvm-commits

Differential Revision: http://reviews.llvm.org/D18906

llvm-svn: 267197

d29a24e4

PM: Port DCE to the new pass manager · 395c2127

Justin Bogner authored Apr 22, 2016

Also add a very basic test, since apparently there aren't any tests
for DCE whatsoever to add the new pass version to.

llvm-svn: 267196

395c2127

[LoopUtils] Extend findStringMetadataForLoop to return the value for metadata · fe3def7c

Adam Nemet authored Apr 22, 2016

E.g. for:

  !1 = {"llvm.distribute", i32 1}

it now returns the MDOperand for 1.

I will use this in LoopDistribution to check the value of the metadata.

Note that the change is backward-compatible with its current use in
LoopVersioningLICM.  An Optional implicitly converts to a bool depending
whether it contains a value or not.

llvm-svn: 267190

fe3def7c

[EarlyCSE/CVP] Add stats for CVPs and make sure to account for any Changes. · 1a4bc110
Chad Rosier authored Apr 22, 2016
```
llvm-svn: 267187
```
1a4bc110

[MemorySSA] Fix bug in CachingMemorySSAWalker::invalidateInfo · 9fe26e6d

Geoff Berry authored Apr 22, 2016

Summary:
CachingMemorySSAWalker::invalidateInfo was using IsCall to determine
which cache map needed to be cleared of entries referring to the invalidated
MemoryAccess, but there could also be entries referring to it in the
other cache map (value entries, not key entries).  This change just
clears both tables to be conservatively correct.

Also add a verifyRemoved() function, called when expensive
checks (i.e. XDEBUG) are enabled to verify that the invalidated
MemoryAccess object is not referenced in any of the caches.

Reviewers: dberlin, george.burgess.iv

Subscribers: mcrosier, llvm-commits

Differential Revision: http://reviews.llvm.org/D19388

llvm-svn: 267157

9fe26e6d

[EarlyCSE] Don't add the overflow flags to the hash · bfd695d5

David Majnemer authored Apr 22, 2016

We take the intersection of overflow flags while CSE'ing.
This permits us to consider two instructions with different overflow
behavior to be replaceable.

llvm-svn: 267153

bfd695d5

[InstCombine] Preserve fast math flags when combining PHIs · e985c76b

Silviu Baranga authored Apr 22, 2016

Summary:
When optimizing PHIs which have inputs floating point binary
operators, we preserve all IR flags except the fast math
flags.

This change removes the logic which tracked some of the IR flags
(no wrap, exact) and replaces it by doing an and on the IR flags of
all inputs to the PHI - which will also handle the fast math
flags.

Reviewers: majnemer

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D19370

llvm-svn: 267139

e985c76b

Revert "Initial implementation of optimization bisect support." · 6013f45f

Vedant Kumar authored Apr 22, 2016

This reverts commit r267022, due to an ASan failure:

  http://lab.llvm.org:8080/green/job/clang-stage2-cmake-RgSan_check/1549

llvm-svn: 267115

6013f45f

[GVN] Respect fast-math-flags on fcmps · d0ce8f14

David Majnemer authored Apr 22, 2016

We assumed that flags were only present on binary operators.  This is
not true, they may also be present on calls and fcmps.

llvm-svn: 267113

d0ce8f14

[EarlyCSE] Take the intersection of flags on instructions · 9554c133

David Majnemer authored Apr 22, 2016

EarlyCSE had inconsistent behavior with regards to flag'd instructions:
- In some cases, it would pessimize if the available instruction had
  different flags by not performing CSE.
- In other cases, it would miscompile if it replaced an instruction
  which had no flags with an instruction which has flags.

Fix this by being more consistent with our flag handling by utilizing
andIRFlags.

llvm-svn: 267111

9554c133

ValueMapper/Enumerator: Clean up code in post-order traversals, NFC · 71480bd0

Duncan P. N. Exon Smith authored Apr 22, 2016

Re-layer the functions in the new (i.e., newly correct) post-order
traversals in ValueEnumerator (r266947) and ValueMapper (r266949).
Instead of adding a node to the worklist in a helper function and
returning a flag to say what happened, return the node itself.  This
makes the code way cleaner: the worklist is local to the main function,
there is no flag for an early loop exit (since we can cleanly bury the
loop), and it's perfectly clear when pointers into the worklist might be
invalidated.

I'm fixing both algorithms in the same commit to avoid repeating the
commit message; if you take the time to understand one the other should
be easy.  The diff itself isn't entirely obvious since the traversals
have some noise (i.e., things to do), but here's the high-level change:

    auto helper = [&WL](T *Op) {     auto helper = [](T **&I, T **E) {
                                 =>    while (I != E) {
      if (shouldVisit(Op)) {             T *Op = *I++;
        WL.push(Op, Op->begin());        if (shouldVisit(Op)) {
        return true;                       return Op;
      }                                }
      return false;                    return nullptr;
    };                               };
                                 =>
    WL.push(S, S->begin());          WL.push(S, S->begin());
    while (!empty()) {               while (!empty()) {
      auto *N = WL.top().N;            auto *N = WL.top().N;
      auto *&I = WL.top().I;           auto *&I = WL.top().I;
      bool DidChange = false;
      while (I != N->end())
        if (helper(*I++)) {      =>    if (T *Op = helper(I, N->end()) {
          DidChange = true;              WL.push(Op, Op->begin());
          break;                         continue;
        }                              }
      if (DidChange)
        continue;

      POT.push(WL.pop());        =>    POT.push(WL.pop());
    }                                }

Thanks to Mehdi for helping me find a better way to layer this.

llvm-svn: 267099

71480bd0

Fixed flag description · 243b71fd

Mike Aizatsky authored Apr 21, 2016

Summary:
asan-use-after-return control feature we call use-after-return or
stack-use-after-return.

Reviewers: kcc, aizatsky, eugenis

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D19284

llvm-svn: 267064

243b71fd

Apr 21, 2016

[esan] EfficiencySanitizer instrumentation pass · d862c178

Derek Bruening authored Apr 21, 2016

Summary:
Adds an instrumentation pass for the new EfficiencySanitizer ("esan")
performance tuning family of tools.  Multiple tools will be supported
within the same framework.  Preliminary support for a cache fragmentation
tool is included here.

The shared instrumentation includes:
+ Turn mem{set,cpy,move} instrinsics into library calls.
+ Slowpath instrumentation of loads and stores via callouts to
  the runtime library.
+ Fastpath instrumentation will be per-tool.
+ Which memory accesses to ignore will be per-tool.

Reviewers: eugenis, vitalybuka, aizatsky, filcab

Subscribers: filcab, vkalintiris, pcc, silvas, llvm-commits, zhaoqin, kcc

Differential Revision: http://reviews.llvm.org/D19167

llvm-svn: 267058

d862c178

NFC: fix copy / paste comment · c22d2998
JF Bastien authored Apr 21, 2016
```
llvm-svn: 267039
```
c22d2998
NFC: fix nonsensical comment · 3e2e69f6
JF Bastien authored Apr 21, 2016
```
llvm-svn: 267036
```
3e2e69f6

Folding compares with unescaped allocations · a085cfc1

Sanjoy Das authored Apr 21, 2016

Summary:
If we know that the pointer allocated within a function does not escape,
we can fold away comparisons that are done with global pointers

Patch by Anna Thomas!

Reviewers: reames, majnemer, sanjoy

Subscribers: mgrang, mcrosier, majnemer, llvm-commits

Differential Revision: http://reviews.llvm.org/D19276

llvm-svn: 267035

a085cfc1

[instcombine][unordered] Extend load(select) transform to handle unordered loads · a98c7ead
Philip Reames authored Apr 21, 2016
```
llvm-svn: 267023
```
a98c7ead

Initial implementation of optimization bisect support. · f0f27929

Andrew Kaylor authored Apr 21, 2016

This patch implements a optimization bisect feature, which will allow optimizations to be selectively disabled at compile time in order to track down test failures that are caused by incorrect optimizations.

The bisection is enabled using a new command line option (-opt-bisect-limit). Individual passes that may be skipped call the OptBisect object (via an LLVMContext) to see if they should be skipped based on the bisect limit. A finer level of control (disabling individual transformations) can be managed through an addition OptBisect method, but this is not yet used.

The skip checking in this implementation is based on (and replaces) the skipOptnoneFunction check. Where that check was being called, a new call has been inserted in its place which checks the bisect limit and the optnone attribute. A new function call has been added for module and SCC passes that behaves in a similar way.

Differential Revision: http://reviews.llvm.org/D19172

llvm-svn: 267022

f0f27929

[unordered] unordered loads from null are still unreachable · 3ac07184
Philip Reames authored Apr 21, 2016
```
llvm-svn: 267019
```
3ac07184
[LoopUtils] Fix typo in comment · 6dcf0788
Adam Nemet authored Apr 21, 2016
```
llvm-svn: 267016
```
6dcf0788
[LoopUtils] Add asserts to findStringMetadataForLoop. NFC · 293be666
Adam Nemet authored Apr 21, 2016
```
These ensure that operand array has at least one element and it is the
self-reference.

llvm-svn: 267015
```
293be666
[LoopUtils] Move def of findStringMetadataForLoop to LoopUtils.cpp. NFC · 963341c8
Adam Nemet authored Apr 21, 2016
```
The decl is in LoopUtils.h.  I think that this was added to
LoopVersioningLICM.cpp by mistake.

llvm-svn: 267014
```
963341c8

[LoopUtils] Rename {check->find}StringMetadata{Into->For}Loop. NFC · f787826b

Adam Nemet authored Apr 21, 2016

"Into" was misleading.  I am also planning to use this helper to look
for loop metadata and return the argument, so find seems like a better
name.

llvm-svn: 267013

f787826b

[instcombine][unordered] Implement *-load forwarding for unordered atomics · ac55090e

Philip Reames authored Apr 21, 2016

This builds on 266999 which made FindAvailableValue do the right thing.  Tests included show the newly enabled transforms and those which disabled either due to conservatism or correctness requirements.

llvm-svn: 267006

ac55090e

[SimplifyCFG] Fold `llvm.guard(false)` to unreachable · 54a3a006

Sanjoy Das authored Apr 21, 2016

Summary:
`llvm.guard(false)` always bails out of the current compilation unit, so
we can prune any control flow following it.

Reviewers: hfinkel, pcc, reames

Subscribers: majnemer, reames, mcrosier, llvm-commits

Differential Revision: http://reviews.llvm.org/D19245

llvm-svn: 266955

54a3a006

ValueMapper: Map uniqued nodes in post-order · 0ab44dbf

Duncan P. N. Exon Smith authored Apr 21, 2016

The iteratitive algorithm from r265456 claimed but failed to create a
post-order traversal. It had the same error that was fixed in the
ValueEnumerator in r266947: now, instead of pushing all operands on the
worklist at once, we pause whenever an operand gets pushed in order to
go depth-first (I know, it sounds obvious).

Sadly, I have no idea how to observe this from outside the algorithm and
so I haven't written a test. The output should be the same; it should
just use fewer temporary nodes now. I've added some comments that I
hope make the current logic clear enough it's unlikely to regress.

llvm-svn: 266949

0ab44dbf

ThinLTO/ModuleLinker: add a flag to not always pull-in linkonce when performing importing · bda3c97c

Mehdi Amini authored Apr 21, 2016

Summary:
The function importer already decided what symbols need to be pulled
in. Also these magically added ones will not be in the export list
for the source module, which can confuse the internalizer for
instance.

Reviewers: tejohnson, rafael

Subscribers: joker.eph, llvm-commits

Differential Revision: http://reviews.llvm.org/D19096

From: Mehdi Amini <mehdi.amini@apple.com>
llvm-svn: 266948

bda3c97c

Refine instruction weight annotation algorithm for sample profiler. · a8bae823

Dehao Chen authored Apr 20, 2016

Summary:
This patch refined the instruction weight anootation algorithm:
1. Do not use dbg_value intrinsics for annotation.
2. Annotate cold calls if the call is inlined in profile, but not inlined before preparation. This indicates that the annotation preparation step found no sample for the inlined callsite, thus the call should be very cold.

Reviewers: dnovillo, davidxl

Subscribers: mgrang, llvm-commits

Differential Revision: http://reviews.llvm.org/D19286

llvm-svn: 266936

a8bae823

Apr 20, 2016

Rename asan-check-lifetime into asan-stack-use-after-scope · a83bfeac

Kostya Serebryany authored Apr 20, 2016

Summary:
This is done for consistency with asan-use-after-return.
I see no other users than tests.

Reviewers: aizatsky, kcc

Differential Revision: http://reviews.llvm.org/D19306

llvm-svn: 266906

a83bfeac

Typo. · b346dcbc
Chad Rosier authored Apr 20, 2016
```
llvm-svn: 266905
```
b346dcbc
[ValueTracking] Make isImpliedCondition return an Optional<bool>. NFC. · 41dd31f0
Chad Rosier authored Apr 20, 2016
```
Phabricator Revision: http://reviews.llvm.org/D19277

llvm-svn: 266904
```
41dd31f0

[ThinLTO] Prevent importing of "llvm.used" values · b35cc691

Teresa Johnson authored Apr 20, 2016

Summary:
This patch prevents importing from (and therefore exporting from) any
module with a "llvm.used" local value. Local values need to be promoted
and renamed when importing, and their presense on the llvm.used variable
indicates that there are opaque uses that won't see the rename. One such
example is a use in inline assembly.

See also the discussion at:
http://lists.llvm.org/pipermail/llvm-dev/2016-April/098047.html

As part of this, move collectUsedGlobalVariables out of Transforms/Utils
and into IR/Module so that it can be used more widely. There are several
other places in LLVM that used copies of this code that can be cleaned
up as a follow on NFC patch.

Reviewers: joker.eph

Subscribers: pcc, llvm-commits, joker.eph

Differential Revision: http://reviews.llvm.org/D18986

llvm-svn: 266877

b35cc691