Commits · 74e7fb9aaeb1b21cfb26cbaf0352ba6b5f9fb272 · Lorenzo Albano / LLVM bpEVL

Jan 16, 2015

Fix Reassociate handling of constant in presence of undef float · 590a2700
Mehdi Amini authored Jan 16, 2015
```
http://reviews.llvm.org/D6993

llvm-svn: 226245
```
590a2700

Add a new pass "inductive range check elimination" · a1837a34

Sanjoy Das authored Jan 16, 2015

IRCE eliminates range checks of the form

  0 <= A * I + B < Length

by splitting a loop's iteration space into three segments in a way
that the check is completely redundant in the middle segment.  As an
example, IRCE will convert

  len = < known positive >
  for (i = 0; i < n; i++) {
    if (0 <= i && i < len) {
      do_something();
    } else {
      throw_out_of_bounds();
    }
  }

to

  len = < known positive >
  limit = smin(n, len)
  // no first segment
  for (i = 0; i < limit; i++) {
    if (0 <= i && i < len) { // this check is fully redundant
      do_something();
    } else {
      throw_out_of_bounds();
    }
  }
  for (i = limit; i < n; i++) {
    if (0 <= i && i < len) {
      do_something();
    } else {
      throw_out_of_bounds();
    }
  }


IRCE can deal with multiple range checks in the same loop (it takes
the intersection of the ranges that will make each of them redundant
individually).

Currently IRCE does not do any profitability analysis.  That is a
TODO.

Please note that the status of this pass is *experimental*, and it is
not part of any default pass pipeline.  Having said that, I will love
to get feedback and general input from people interested in trying
this out.

This pass was originally r226201.  It was reverted because it used C++
features not supported by MSVC 2012.

Differential Revision: http://reviews.llvm.org/D6693

llvm-svn: 226238

a1837a34

Jan 15, 2015

Revert r226201 (Add a new pass "inductive range check elimination") · 7f62ac8e

Sanjoy Das authored Jan 15, 2015

The change used C++11 features not supported by MSVC 2012.  I will fix
the change to use things supported MSVC 2012 and recommit shortly.

llvm-svn: 226216

7f62ac8e

InductiveRangeCheckElimination: Remove extra ';' · f1f72c9e
David Majnemer authored Jan 15, 2015
```
This silences a GCC warning.

llvm-svn: 226215
```
f1f72c9e

Add a new pass "inductive range check elimination" · 7059e295

Sanjoy Das authored Jan 15, 2015

IRCE eliminates range checks of the form

  0 <= A * I + B < Length

by splitting a loop's iteration space into three segments in a way
that the check is completely redundant in the middle segment.  As an
example, IRCE will convert

  len = < known positive >
  for (i = 0; i < n; i++) {
    if (0 <= i && i < len) {
      do_something();
    } else {
      throw_out_of_bounds();
    }
  }

to

  len = < known positive >
  limit = smin(n, len)
  // no first segment
  for (i = 0; i < limit; i++) {
    if (0 <= i && i < len) { // this check is fully redundant
      do_something();
    } else {
      throw_out_of_bounds();
    }
  }
  for (i = limit; i < n; i++) {
    if (0 <= i && i < len) {
      do_something();
    } else {
      throw_out_of_bounds();
    }
  }


IRCE can deal with multiple range checks in the same loop (it takes
the intersection of the ranges that will make each of them redundant
individually).

Currently IRCE does not do any profitability analysis.  That is a
TODO.

Please note that the status of this pass is *experimental*, and it is
not part of any default pass pipeline.  Having said that, I will love
to get feedback and general input from people interested in trying
this out.

Differential Revision: http://reviews.llvm.org/D6693

llvm-svn: 226201

7059e295

Replace size method call of containers to empty method where appropriate · 8c0809c7

Alexander Kornienko authored Jan 15, 2015

This patch was generated by a clang tidy checker that is being open sourced.
The documentation of that checker is the following:

/// The emptiness of a container should be checked using the empty method
/// instead of the size method. It is not guaranteed that size is a
/// constant-time function, and it is generally more efficient and also shows
/// clearer intent to use empty. Furthermore some containers may implement the
/// empty method but not implement the size method. Using empty whenever
/// possible makes it easier to switch to another container in the future.

Patch by Gábor Horváth!

llvm-svn: 226161

8c0809c7

[PM] Separate the TargetLibraryInfo object from the immutable pass. · b98f63db

Chandler Carruth authored Jan 15, 2015

The pass is really just a means of accessing a cached instance of the
TargetLibraryInfo object, and this way we can re-use that object for the
new pass manager as its result.

Lots of delta, but nothing interesting happening here. This is the
common pattern that is developing to allow analyses to live in both the
old and new pass manager -- a wrapper pass in the old pass manager
emulates the separation intrinsic to the new pass manager between the
result and pass for analyses.

llvm-svn: 226157

b98f63db

SimplifyIndVar: Remove unused variable · f0982d0a
David Majnemer authored Jan 15, 2015
```
OtherOperandIdx is not used anymore, remove it to silence warnings.

llvm-svn: 226138
```
f0982d0a
Update libdeps since TLI was moved from Target to Analysis in r226078. · 24ebfcb6
NAKAMURA Takumi authored Jan 15, 2015
```
llvm-svn: 226126
```
24ebfcb6

[PM] Move TargetLibraryInfo into the Analysis library. · 62d4215b

Chandler Carruth authored Jan 15, 2015

While the term "Target" is in the name, it doesn't really have to do
with the LLVM Target library -- this isn't an abstraction which LLVM
targets generally need to implement or extend. It has much more to do
with modeling the various runtime libraries on different OSes and with
different runtime environments. The "target" in this sense is the more
general sense of a target of cross compilation.

This is in preparation for porting this analysis to the new pass
manager.

No functionality changed, and updates inbound for Clang and Polly.

llvm-svn: 226078

62d4215b

Fix PR22222 · 8c252bde

Sanjoy Das authored Jan 15, 2015

The bug was introduced in r225282. r225282 assumed that sub X, Y is
the same as add X, -Y. This is not correct if we are going to upgrade
the sub to sub nuw. This change fixes the issue by making the
optimization ignore sub instructions.

Differential Revision: http://reviews.llvm.org/D6979

llvm-svn: 226075

8c252bde

Jan 14, 2015
- InstCombine: Don't take A-B<0 into A<B if A-B has other uses · a0afb55f
  David Majnemer authored Jan 14, 2015
```
This fixes PR22226.

llvm-svn: 226023
```
  a0afb55f
- reapply: SLPVectorizer: Cache results from memory alias checking. · 13c4ab89
  Erik Eckstein authored Jan 14, 2015
```
This speeds up the dependency calculations for blocks with many load/store/call instructions.
Beside the improved runtime, there is no functional change.

Compared to the original commit, this re-applied commit contains a bug fix which ensures that there are
no incorrect collisions in the alias cache.

llvm-svn: 225977
```
  13c4ab89
- Fix a wrong comment in LoopVectorize. · e28d154c
  Hao Liu authored Jan 14, 2015
```
  I.E. more than two -> exactly two
Fix a typo function name in LoopVectorize.
  I.E. collectStrideAcccess() -> collectStrideAccess()

llvm-svn: 225935
```
  e28d154c
- Remove trailing slash from r225924 · e65b0663
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
llvm-svn: 225929
```
  e65b0663
- Utils: Remove unreachable break, NFC · e54cd9a6
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
llvm-svn: 225924
```
  e54cd9a6
- Utils: Handle remapping distinct MDLocations · a5a0f576
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
Part of PR21433.

llvm-svn: 225921
```
  a5a0f576
- Utils: Thread distinct-ness through the cloneMD*() functions, NFC · b84840c0
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
The new logic isn't actually reachable yet, so no functionality change.

llvm-svn: 225918
```
  b84840c0
- Utils: Extract cloneMDNode(), NFC · 7c69c1eb
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
llvm-svn: 225917
```
  7c69c1eb
- Utils: Move cloneMD*() up, NFC · b6515d6a
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
llvm-svn: 225915
```
  b6515d6a
- Utils: Add mapping for uniqued MDLocations · 47d82981
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
Still doesn't handle distinct ones.  Part of PR21433.

llvm-svn: 225914
```
  47d82981
- Utils: Extract cloneMDTuple(), NFC · 4766e012
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
llvm-svn: 225912
```
  4766e012
- Utils: Extract shouldRemapUniquedNode(), NFC · fb9d128a
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
llvm-svn: 225911
```
  fb9d128a
- Utils: Simplify code, NFC · 637e7659
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
llvm-svn: 225906
```
  637e7659
- Utils: Extract mapUniquedNode(), NFC · b557989a
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
llvm-svn: 225905
```
  b557989a
- Utils: MDNode => UniquableMDNode, NFC · 8725ca8c
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
Although this makes the `cast<>` assert more often, the
`assert(Node->isResolved())` on the following line would assert in all
those cases.  So, no functionality change here.

llvm-svn: 225903
```
  8725ca8c
- Utils: Separate out mapDistinctNode(), NFC · 14cc94c1
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
llvm-svn: 225902
```
  14cc94c1
- Utils: Use helper function directly, NFC · 3956a85e
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
llvm-svn: 225901
```
  3956a85e
- Utils: Extract helper function, NFC · 077affdb
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
llvm-svn: 225897
```
  077affdb
- Utils: Use MDTuple::get() directly, NFC · 34651ee2
  Duncan P. N. Exon Smith authored Jan 14, 2015
```
Working towards supporting `MDLocation` in `MapMetadata()`.

llvm-svn: 225896
```
  34651ee2
- [SimplifyLibCalls] Don't try to simplify indirect calls. · 71d7b18e
  Ahmed Bougacha authored Jan 14, 2015
```
It turns out, all callsites of the simplifier are guarded by a check for
CallInst::getCalledFunction (i.e., to make sure the callee is direct).

This check wasn't done when trying to further optimize a simplified fortified
libcall, introduced by a refactoring in r225640.

Fix that, add a testcase, and document the requirement.

llvm-svn: 225895
```
  71d7b18e
Jan 13, 2015

Fix non-determinism issue in SLP · 0473cb5a

Julien Lerouge authored Jan 13, 2015

The issue was introduced in r214638:

+  for (auto &BSIter : BlocksSchedules) {
+    scheduleBlock(BSIter.second.get());
+  }

Because BlocksSchedules is a DenseMap with BasicBlock* keys, blocks are
scheduled in non-deterministic order, resulting in unpredictable IR.

Patch by Daniel Reynaud!

llvm-svn: 225821

0473cb5a

Revert "SLPVectorizer: Cache results from memory alias checking." · a168ef75

Erik Eckstein authored Jan 13, 2015

The alias cache has a problem of incorrect collisions in case a new instruction is allocated at the same address as a previously deleted instruction.

llvm-svn: 225790

a168ef75

SLPVectorizer: Cache results from memory alias checking. · 4a445c04

Erik Eckstein authored Jan 13, 2015

This speeds up the dependency calculations for blocks with many load/store/call instructions.
Beside the improved runtime, there is no functional change.

llvm-svn: 225786

4a445c04

fix {typo, build failure} in r225760 · 181233b2
Ramkumar Ramachandra authored Jan 13, 2015
```
llvm-svn: 225762
```
181233b2

Standardize {pred,succ,use,user}_empty() · 40c3e03e

Ramkumar Ramachandra authored Jan 13, 2015

The functions {pred,succ,use,user}_{begin,end} exist, but many users
have to check *_begin() with *_end() by hand to determine if the
BasicBlock or User is empty. Fix this with a standard *_empty(),
demonstrating a few usecases.

llvm-svn: 225760

40c3e03e

fix typo; NFC · db8e6f47
Sanjay Patel authored Jan 13, 2015
```
llvm-svn: 225753
```
db8e6f47

Jan 12, 2015

80-cols; NFC · 06d5589a
Sanjay Patel authored Jan 12, 2015
```
llvm-svn: 225700
```
06d5589a

IR: Split GenericMDNode into MDTuple and UniquableMDNode · 118632db

Duncan P. N. Exon Smith authored Jan 12, 2015

Split `GenericMDNode` into two classes (with more descriptive names).

  - `UniquableMDNode` will be a common subclass for `MDNode`s that are
    sometimes uniqued like constants, and sometimes 'distinct'.

    This class gets the (short-lived) RAUW support and related API.

  - `MDTuple` is the basic tuple that has always been returned by
    `MDNode::get()`.  This is as opposed to more specific nodes to be
    added soon, which have additional fields, custom assembly syntax,
    and extra semantics.

    This class gets the hash-related logic, since other sublcasses of
    `UniquableMDNode` may need to hash based on other fields.

To keep this diff from getting too big, I've added casts to `MDTuple`
that won't really scale as new subclasses of `UniquableMDNode` are
added, but I'll clean those up incrementally.

(No functionality change intended.)

llvm-svn: 225682

118632db

GVN: propagate equalities for floating point compares · 5f1d9eaa

Sanjay Patel authored Jan 12, 2015

Allow optimizations based on FP comparison values in the same way
as integers. 

This resolves PR17713:
http://llvm.org/bugs/show_bug.cgi?id=17713

Differential Revision: http://reviews.llvm.org/D6911

llvm-svn: 225660

5f1d9eaa