Commits · 04d4e9312c69ef1fa4896e9289beeb3e763e7415 · Roger Ferrer / llvm-epi-0.8

Apr 23, 2013

Move C++ code out of the C headers and into either C++ headers · 04d4e931

Eric Christopher authored Apr 22, 2013

or the C++ files themselves. This enables people to use
just a C compiler to interoperate with LLVM.

llvm-svn: 180063

04d4e931

Apr 22, 2013

Changed back (relative to commit 179786) the operations executed when... · 10260a75

Anat Shemer authored Apr 22, 2013

Changed back (relative to commit 179786) the operations executed when extract(cast) is transformed to cast(extract). It uses the Builder class as before. In addition the result node is added to the Worklist, so all the previous extract users will become the new scalar cast users.

llvm-svn: 180045

10260a75

Clarify that llvm.used can contain aliases. · 74f2e46e

Rafael Espindola authored Apr 22, 2013

Also add a check for llvm.used in the verifier and simplify clients now that
they can assume they have a ConstantArray.

llvm-svn: 180019

74f2e46e

Apr 21, 2013

SROA: Don't crash on a select with two identical operands. · 0212dc27

Benjamin Kramer authored Apr 21, 2013

This is an edge case that can happen if we modify a chain of multiple selects.
Update all operands in that case and remove the assert. PR15805.

llvm-svn: 179982

0212dc27

Revert "SimplifyCFG: If convert single conditional stores" · 6eb32b31

Arnold Schwaighofer authored Apr 21, 2013

There is the temptation to make this tranform dependent on target information as
it is not going to be beneficial on all (sub)targets. Therefore, we should
probably do this in MI Early-Ifconversion.

This reverts commit r179957. Original commit message:

"SimplifyCFG: If convert single conditional stores

This transformation will transform a conditional store with a preceeding
uncondtional store to the same location:

a[i] =
may-alias with a[i] load
if (cond)
    a[i] = Y
into an unconditional store.

a[i] = X
may-alias with a[i] load
tmp = cond ? Y : X;
a[i] = tmp

We assume that on average the cost of a mispredicted branch is going to be
higher than the cost of a second store to the same location, and that the
secondary benefits of creating a bigger basic block for other optimizations to
work on outway the potential case were the branch would be correctly predicted
and the cost of the executing the second store would be noticably reflected in
performance.

hmmer's execution time improves by 30% on an imac12,2 on ref data sets. With
this change we are on par with gcc's performance (gcc also performs this
transformation). There was a 1.2 % performance improvement on a ARM swift chip.
Other tests in the test-suite+external seem to be mostly uninfluenced in my
experiments:
This optimization was triggered on 41 tests such that the executable was
different before/after the patch. Only 1 out of the 40 tests (dealII) was
reproducable below 100% (by about .4%). Given that hmmer benefits so much I
believe this to be a fair trade off.

I am going to watch performance numbers across the builtbots and will revert
this if anything unexpected comes up."

llvm-svn: 179980

6eb32b31

SLPVectorize: Add support for vectorization of casts. · c57af326
Nadav Rotem authored Apr 21, 2013
```
llvm-svn: 179975
```
c57af326
SLPVectorizer: Fix a bug in the code that scans the tree in search of nodes with multiple users. · 98ad5f0f
Nadav Rotem authored Apr 21, 2013
```
We did not terminate the switch case and we executed the search routine twice.

llvm-svn: 179974
```
98ad5f0f

When we strength reduce an objc_retainBlock call to objc_retain, increment... · 3eab2e43

Michael Gottesman authored Apr 21, 2013

When we strength reduce an objc_retainBlock call to objc_retain, increment NumPeeps and make sure that Changed is set to true.

llvm-svn: 179968

3eab2e43

Fixed comment typo. · 1e430042
Michael Gottesman authored Apr 21, 2013
```
llvm-svn: 179967
```
1e430042
[objc-arc] Fixed typo in debug message. · df110ac9
Michael Gottesman authored Apr 21, 2013
```
llvm-svn: 179966
```
df110ac9
[objc-arc] Fixed comment typo. · cdb7c15c
Michael Gottesman authored Apr 21, 2013
```
llvm-svn: 179965
```
cdb7c15c
[objc-arc] Refactored OptimizeReturns so that it uses continue instead of a... · fb9ece9a
Michael Gottesman authored Apr 21, 2013
```
[objc-arc] Refactored OptimizeReturns so that it uses continue instead of a large multi-level nested if statement.

llvm-svn: 179964
```
fb9ece9a

[objc-arc] Added debug statement saying when we are resetting a sequence's progress. · 01338a44

Michael Gottesman authored Apr 20, 2013

This will make it clearer when we are actually resetting a sequence's progress
vs just changing state. This is an important distinction because the former case
clears any pointers that we are tracking while the later does not.

llvm-svn: 179963

01338a44

Fix PR15800. Do not try to vectorize vectors and structs. · 8aca44a6
Nadav Rotem authored Apr 20, 2013
```
llvm-svn: 179960
```
8aca44a6

Apr 20, 2013

SimplifyCFG: If convert single conditional stores · 3546ccf4

Arnold Schwaighofer authored Apr 20, 2013

This transformation will transform a conditional store with a preceeding
uncondtional store to the same location:

 a[i] =
 may-alias with a[i] load
 if (cond)
   a[i] = Y

into an unconditional store.

 a[i] = X
 may-alias with a[i] load
 tmp = cond ? Y : X;
 a[i] = tmp

We assume that on average the cost of a mispredicted branch is going to be
higher than the cost of a second store to the same location, and that the
secondary benefits of creating a bigger basic block for other optimizations to
work on outway the potential case were the branch would be correctly predicted
and the cost of the executing the second store would be noticably reflected in
performance.

hmmer's execution time improves by 30% on an imac12,2 on ref data sets. With
this change we are on par with gcc's performance (gcc also performs this
transformation). There was a 1.2 % performance improvement on a ARM swift chip.
Other tests in the test-suite+external seem to be mostly uninfluenced in my
experiments:
This optimization was triggered on 41 tests such that the executable was
different before/after the patch. Only 1 out of the 40 tests (dealII) was
reproducable below 100% (by about .4%). Given that hmmer benefits so much I
believe this to be a fair trade off.

I am going to watch performance numbers across the builtbots and will revert
this if anything unexpected comes up.

llvm-svn: 179957

3546ccf4

VecUtils: Clean up uses of dyn_cast. · 519b2e30
Benjamin Kramer authored Apr 20, 2013
```
llvm-svn: 179936
```
519b2e30
SLPVectorizer: Strength reduce SmallVectors to ArrayRefs. · 4600bcc3
Benjamin Kramer authored Apr 20, 2013
```
Avoids a couple of copies and allows more flexibility in the clients.

llvm-svn: 179935
```
4600bcc3

SLPVectorizer: Reduce the compile time by eliminating the search for some of... · ce2660d6

Nadav Rotem authored Apr 20, 2013

SLPVectorizer: Reduce the compile time by eliminating the search for some of the more expensive patterns. After this change will only check basic arithmetic trees that start at cmpinstr.

llvm-svn: 179933

ce2660d6

refactor tryToVectorizePair to a new method that supports vectorization of lists. · 998e035c
Nadav Rotem authored Apr 20, 2013
```
llvm-svn: 179932
```
998e035c
Fix an unused variable warning. · 89038728
Nadav Rotem authored Apr 20, 2013
```
llvm-svn: 179931
```
89038728
SLPVectorizer: Improve the cost model for loop invariant broadcast values. · 83c7c41b
Nadav Rotem authored Apr 20, 2013
```
llvm-svn: 179930
```
83c7c41b
Report the number of stores that were found in the debug message. · dfe1c93c
Nadav Rotem authored Apr 20, 2013
```
llvm-svn: 179929
```
dfe1c93c
Fix the header comment. · dfd8fcbb
Nadav Rotem authored Apr 20, 2013
```
llvm-svn: 179928
```
dfd8fcbb
Use 64bit arithmetic for calculating distance between pointers. · 5ed99674
Nadav Rotem authored Apr 20, 2013
```
llvm-svn: 179927
```
5ed99674

MergeFunc: Make pointer and integer types generate the same hash. · 630e6e14

Benjamin Kramer authored Apr 19, 2013

The logic that actually compares the types considers pointers and integers the
same if they are of the same size. This created a strange mismatch between hash
and reality and made the test case for this fail on some platforms (yay,
test cases).

llvm-svn: 179905

630e6e14

Apr 19, 2013
- LoopVectorizer: Use matcher from PatternMatch.h for the min/max patterns · 51469403
  Arnold Schwaighofer authored Apr 19, 2013
```
Also make some static function class functions to avoid having to mention the
class namespace for enums all the time.

No functionality change intended.

llvm-svn: 179886
```
  51469403
- Keep coding stanard. Don't use "else if" after "return". · 99317268
  Jakub Staszak authored Apr 19, 2013
```
llvm-svn: 179826
```
  99317268
- Implement a better fix for PR15185. · 3b21eb69
  Bill Wendling authored Apr 18, 2013
```
If the return type is a pointer and the call returns an integer, then do the
inttoptr convertions. And vice versa.

llvm-svn: 179817
```
  3b21eb69
Apr 18, 2013

Fix a -Wdocumentation warning · d29ea044
Dmitri Gribenko authored Apr 18, 2013
```
llvm-svn: 179789
```
d29ea044

In the function InstCombiner::visitExtractElementInst() removed the limitation... · 5570318f

Anat Shemer authored Apr 18, 2013

In the function InstCombiner::visitExtractElementInst() removed the limitation that extract is promoted over a cast only if the cast has only one use.

llvm-svn: 179786

5570318f

Added a function scalarizePHI() that sclarizes a vector phi instruction if it... · 0c95efad

Anat Shemer authored Apr 18, 2013

Added a function scalarizePHI() that sclarizes a vector phi instruction if it has only 2 uses: one to promote the vector phi in a loop and the other use is an extract operation of one element at a constant location.

llvm-svn: 179783

0c95efad

Fix a comment, PR15777. · 8cf09416
Chris Lattner authored Apr 18, 2013
```
llvm-svn: 179775
```
8cf09416

LoopVectorizer: Recognize min/max reductions · 4cd6aa11

Arnold Schwaighofer authored Apr 18, 2013

A min/max operation is represented by a select(cmp(lt/le/gt/ge, X, Y), X, Y)
sequence in LLVM. If we see such a sequence we can treat it just as any other
commutative binary instruction and reduce it.

This appears to help bzip2 by about 1.5% on an imac12,2.

radar://12960601

llvm-svn: 179773

4cd6aa11

LoopVectorize: Use a set to avoid longer cycles in the reduction chain too. · 8df2cfb8
Benjamin Kramer authored Apr 18, 2013
```
Fixes PR15748.

llvm-svn: 179757
```
8df2cfb8
Revert "Combine bit test + conditional or into simple math" · 81af06e0
David Majnemer authored Apr 18, 2013
```
It is causing stage2 builds to fail, let's get them running again.

llvm-svn: 179750
```
81af06e0

Combine bit test + conditional or into simple math · bdf0caf6

David Majnemer authored Apr 18, 2013

Simplify:
(select (icmp eq (and X, C1), 0), Y, (or Y, C2))

Into:
(or (shl (and X, C1), C3), y)

Where:
C3 = Log(C2) - Log(C1)

If:
C1 and C2 are both powers of two

llvm-svn: 179748

bdf0caf6

[objc-arc] Do not mismatch up retains inside a for loop with releases outside... · 323964ca

Michael Gottesman authored Apr 18, 2013

[objc-arc] Do not mismatch up retains inside a for loop with releases outside said for loop in the presense of differing provenance caused by escaping blocks.

This occurs due to an alloca representing a separate ownership from the
original pointer. Thus consider the following pseudo-IR:

  objc_retain(%a)
  for (...) {
    objc_retain(%a)
    %block <- %a
    F(%block)
    objc_release(%block)
  }
  objc_release(%a)

From the perspective of the optimizer, the %block is a separate
provenance from the original %a. Thus the optimizer pairs up the inner
retain for %a and the outer release from %a, resulting in segfaults.

This is fixed by noting that the signature of a mismatch of
retain/releases inside the for loop is a Use/CanRelease top down with an
None bottom up (since bottom up the Retain-CanRelease-Use-Release
sequence is completed by the inner objc_retain, but top down due to the
differing provenance from the objc_release said sequence is not
completed). In said case in CheckForCFGHazards, we now clear the state
of %a implying that no pairing will occur.

Additionally a test case is included.

rdar://12969722

llvm-svn: 179747

323964ca

Removed trailing whitespace. · 9e518139
Michael Gottesman authored Apr 18, 2013
```
llvm-svn: 179746
```
9e518139

Apr 17, 2013
- [objc-arc] Added annotation option to only emit annotations for a specific ssa identifier. · 4e88ce68
  Michael Gottesman authored Apr 17, 2013
```
llvm-svn: 179729
```
  4e88ce68
- Fixed typo. · adb921af
  Michael Gottesman authored Apr 17, 2013
```
llvm-svn: 179721
```
  adb921af