Commits · bf4db4fe1194560d99d5b968d1478789a610e71f · Roger Ferrer / llvm-epi-0.8

Jan 29, 2013

Unroll again after running BBVectorize · bf4db4fe

Hal Finkel authored Jan 29, 2013

Because BBVectorize may significantly shorten a loop body, unroll
again after vectorization. This is especially important when using
runtime or partial unrolling.

llvm-svn: 173730

bf4db4fe

Jan 28, 2013
- Vectorization Factor clarification · 12585196
  Renato Golin authored Jan 28, 2013
```
llvm-svn: 173691
```
  12585196
- [msan] Mostly disable msan-handle-icmp-exact. · 6f85ef30
  Evgeniy Stepanov authored Jan 28, 2013
```
It is way too slow. Change the default option value to 0.
Always do exact shadow propagation for unsigned ICmp with constants, it is
cheap (under 1% cpu time) and required for correctness.

llvm-svn: 173682
```
  6f85ef30
- Revert r173678. · 52c7b1b9
  Evgeniy Stepanov authored Jan 28, 2013
```
Broken tests.

llvm-svn: 173679
```
  52c7b1b9
- [msan] Make msan-handle-icmp-exact=0 by default. · 5ec2ff57
  Evgeniy Stepanov authored Jan 28, 2013
```
50% slowdown on one of the specs.

llvm-svn: 173678
```
  5ec2ff57
- Created ObjCARCUtil.cpp for functions which in my humble opinion are too large... · 5ed40afe
  Michael Gottesman authored Jan 28, 2013
```
Created ObjCARCUtil.cpp for functions which in my humble opinion are too large to static inline and place in a header file such as ObjCARC.h.

llvm-svn: 173666
```
  5ed40afe
- Cleaned up includes in various ObjCARC files and removed some whitespace violations. · 9bfcf28d
  Michael Gottesman authored Jan 28, 2013
```
llvm-svn: 173663
```
  9bfcf28d
- Refactor ObjCARCAliasAnalysis into its own file. · 294e7daa
  Michael Gottesman authored Jan 28, 2013
```
llvm-svn: 173662
```
  294e7daa
- Refactored out pass ObjCARCAPElim from ObjCARCOpts.cpp => ObjCARCAPElim.cpp. · fa0939f7
  Michael Gottesman authored Jan 28, 2013
```
llvm-svn: 173654
```
  fa0939f7
- Fixed case insensitive issue. · 283e079f
  Michael Gottesman authored Jan 28, 2013
```
llvm-svn: 173653
```
  283e079f
- Removed extraneous doxygen end module statement. · 0d90b12a
  Michael Gottesman authored Jan 28, 2013
```
llvm-svn: 173652
```
  0d90b12a
- Extracted pass ObjCARCExpand from ObjCARC.cpp => ObjCARCExpand.cpp. · 08904e3b
  Michael Gottesman authored Jan 28, 2013
```
I also added the local header ObjCARC.h for common functions used by the
various passes.

llvm-svn: 173651
```
  08904e3b
- Extracted ObjCARC.cpp into its own library libLLVMObjCARCOpts in preparation... · 79d8d812
  Michael Gottesman authored Jan 28, 2013
```
Extracted ObjCARC.cpp into its own library libLLVMObjCARCOpts in preparation for refactoring the ARC Optimizer.

llvm-svn: 173647
```
  79d8d812
Jan 27, 2013

BBVectorize: Better use of TTI->getShuffleCost · 293a41d1

Hal Finkel authored Jan 27, 2013

When flipping the pair of subvectors that form a vector, if the
vector length is 2, we can use the SK_Reverse shuffle kind to get
more-accurate cost information. Also we can use the SK_ExtractSubvector
shuffle kind to get accurate subvector extraction costs.

The current cost model implementations don't yet seem complex enough
for this to make a difference (thus, there are no test cases with this
commit), but it should help in future.

Depending on how the various targets optimize and combine shuffles in
practice, we might be able to get more-accurate costs by combining the
costs of multiple shuffle kinds. For example, the cost of flipping the
subvector pairs could be modeled as two extractions and two subvector
insertions. These changes, however, should probably be motivated
by specific test cases.

llvm-svn: 173621

293a41d1

Re-revert r173342, without losing the compile time improvements, flat · 329b590e
Chandler Carruth authored Jan 27, 2013
```
out bug fixes, or functionality preserving refactorings.

llvm-svn: 173610
```
329b590e

Renamed function IsPotentialUse to IsPotentialRetainableObjPtr. · 5300cdd8

Michael Gottesman authored Jan 27, 2013

This name change does the following:

1. Causes the function name to use proper ARC terminology.
2. Makes it clear what the function truly does.

llvm-svn: 173609

5300cdd8

Use the AttributeSet instead of AttributeWithIndex. · 3575c8c6

Bill Wendling authored Jan 27, 2013

In the future, AttributeWithIndex won't be used anymore. Besides, it exposes the
internals of the AttributeSet to outside users, which isn't goodness.

llvm-svn: 173602

3575c8c6

Use the AttributeSet instead of AttributeWithIndex. · 37a52df9

Bill Wendling authored Jan 27, 2013

In the future, AttributeWithIndex won't be used anymore. Besides, it exposes the
internals of the AttributeSet to outside users, which isn't goodness.

llvm-svn: 173601

37a52df9

Use the AttributeSet instead of AttributeWithIndex. · 6eaab61b

Bill Wendling authored Jan 27, 2013

In the future, AttributeWithIndex won't be used anymore. Besides, it exposes the
internals of the AttributeSet to outside users, which isn't goodness.

llvm-svn: 173600

6eaab61b

Jan 26, 2013
- BBVectorize: Add a additional comment about the cost computation · 2d443e94
  Hal Finkel authored Jan 26, 2013
```
llvm-svn: 173580
```
  2d443e94
- BBVectorize: Fix anomalous capital letter in comment · 351a75b6
  Hal Finkel authored Jan 26, 2013
```
llvm-svn: 173579
```
  351a75b6
- Convert BuildLibCalls.cpp to using the AttributeSet methods instead of AttributeWithIndex. · 201d7b25
  Bill Wendling authored Jan 26, 2013
```
llvm-svn: 173536
```
  201d7b25
- Remove some introspection functions. · 57625a49
  Bill Wendling authored Jan 25, 2013
```
The 'getSlot' function and its ilk allow introspection into the AttributeSet
class. However, that class should be opaque. Allow access through accessor
methods instead.

llvm-svn: 173522
```
  57625a49
Jan 25, 2013

LoopVectorize: Refactor the code that vectorizes loads/stores to remove duplication. · 69a040d3
Nadav Rotem authored Jan 25, 2013
```
llvm-svn: 173500
```
69a040d3
Use the new 'getSlotIndex' method to retrieve the attribute's slot index. · 8649283e
Bill Wendling authored Jan 25, 2013
```
llvm-svn: 173499
```
8649283e
LoopVectorize: Simplify code. No functionality change. · 21e8da59
Benjamin Kramer authored Jan 25, 2013
```
llvm-svn: 173475
```
21e8da59
added ability to dynamically change the ExportList of an already · b95c98fa
Pedro Artigas authored Jan 25, 2013
```
created InternalizePass (useful for pass reuse)

llvm-svn: 173474
```
b95c98fa
LoopVectorizer: Refactor more code to use the IRBuilder. · 8e9ca2f8
Nadav Rotem authored Jan 25, 2013
```
llvm-svn: 173471
```
8e9ca2f8
Refactor some code to use the IRBuilder. · c8adf3ff
Nadav Rotem authored Jan 25, 2013
```
llvm-svn: 173467
```
c8adf3ff
[msan] A comment on ICmp handling logic. · 2cb0fa10
Evgeniy Stepanov authored Jan 25, 2013
```
llvm-svn: 173453
```
2cb0fa10

[msan] Implement exact shadow propagation for relational ICmp. · fac84032

Evgeniy Stepanov authored Jan 25, 2013

Only for integers, pointers, and vectors of those. No floats.
Instrumentation seems very heavy, and may need to be replaced
with some approximation in the future.

llvm-svn: 173452

fac84032

Switch this code away from Value::isUsedInBasicBlock. That code either · ceff222d

Chandler Carruth authored Jan 25, 2013

loops over instructions in the basic block or the use-def list of the
value, neither of which are really efficient when repeatedly querying
about values in the same basic block.

What's more, we already know that the CondBB is small, and so we can do
a much more efficient test by counting the uses in CondBB, and seeing if
those account for all of the uses.

Finally, we shouldn't blanket fail on any such instruction, instead we
should conservatively assume that those instructions are part of the
cost.

Note that this actually fixes a bug in the pass because
isUsedInBasicBlock has a really terrible bug in it. I'll fix that in my
next commit, but the fix for it would make this code suddenly take the
compile time hit I thought it already was taking, so I wanted to go
ahead and migrate this code to a faster & better pattern.

The bug in isUsedInBasicBlock was also causing other tests to test the
wrong thing entirely: for example we weren't actually disabling
speculation for floating point operations as intended (and tested), but
the test passed because we failed to speculate them due to the
isUsedInBasicBlock failure.

llvm-svn: 173417

ceff222d

Jan 24, 2013

Added comment to ObjCARC elaborating what is meant by the term 'Provenance' in... · 12780c2d
Michael Gottesman authored Jan 24, 2013
```
Added comment to ObjCARC elaborating what is meant by the term 'Provenance' in 'Provenance Analysis'.

llvm-svn: 173374
```
12780c2d

Reapply chandlerc's r173342 now that the miscompile it was triggering is fixed. · 1c4e323f

Benjamin Kramer authored Jan 24, 2013

Original commit message:
Plug TTI into the speculation logic, giving it a real cost interface
that can be specialized by targets.

The goal here is not to be more aggressive, but to just be more accurate
with very obvious cases. There are instructions which are known to be
truly free and which were not being modeled as such in this code -- see
the regression test which is distilled from an inner loop of zlib.

Everywhere the TTI cost model is insufficiently conservative I've added
explicit checks with FIXME comments to go add proper modelling of these
cost factors.

If this causes regressions, the likely solution is to make TTI even more
conservative in its cost estimates, but test cases will help here.

llvm-svn: 173357

1c4e323f

Revert r173342 temporarily. It appears to cause a very late miscompile · 321c6a7c
Chandler Carruth authored Jan 24, 2013
```
of stage2 in a bootstrap. Still investigating....

llvm-svn: 173343
```
321c6a7c

Plug TTI into the speculation logic, giving it a real cost interface · 5f451930

Chandler Carruth authored Jan 24, 2013

that can be specialized by targets.

The goal here is not to be more aggressive, but to just be more accurate
with very obvious cases. There are instructions which are known to be
truly free and which were not being modeled as such in this code -- see
the regression test which is distilled from an inner loop of zlib.

Everywhere the TTI cost model is insufficiently conservative I've added
explicit checks with FIXME comments to go add proper modelling of these
cost factors.

If this causes regressions, the likely solution is to make TTI even more
conservative in its cost estimates, but test cases will help here.

llvm-svn: 173342

5f451930

Address a large chunk of this FIXME by accumulating the cost for · 01bffaad
Chandler Carruth authored Jan 24, 2013
```
unfolded constant expressions rather than checking each one
independently.

llvm-svn: 173341
```
01bffaad

Switch the constant expression speculation cost evaluation away from · 8a21005c

Chandler Carruth authored Jan 24, 2013

a cost fuction that seems both a bit ad-hoc and also poorly suited to
evaluating constant expressions.

Notably, it is missing any support for trivial expressions such as
'inttoptr'. I could fix this routine, but it isn't clear to me all of
the constraints its other users are operating under.

The core protection that seems relevant here is avoiding the formation
of a select instruction wich a further chain of select operations in
a constant expression operand. Just explicitly encode that constraint.

Also, update the comments and organization here to make it clear where
this needs to go -- this should be driven off of real cost measurements
which take into account the number of constants expressions and the
depth of the constant expression tree.

llvm-svn: 173340

8a21005c

Rephrase the speculating scan of the conditional BB to be phrased in · 7481ca8f

Chandler Carruth authored Jan 24, 2013

terms of cost rather than hoisting a single instruction.

This does *not* change the cost model! We still set the cost threshold
at 1 here, it's just that we track it by accumulating cost rather than
by storing an instruction.

The primary advantage is that we no longer leave no-op intrinsics in the
basic block. For example, this will now move both debug info intrinsics
and a single instruction, instead of only moving the instruction and
leaving a basic block with nothing bug debug info intrinsics in it, and
those intrinsics now no longer ordered correctly with the hoisted value.

Instead, we now splice the entire conditional basic block's instruction
sequence.

This also places the code for checking the safety of hoisting next to
the code computing the cost.

Currently, the only observable side-effect of this change is that debug
info intrinsics are no longer abandoned. I'm not sure how to craft
a test case for this, and my real goal was the refactoring, but I'll
talk to Dave or Eric about how to add a test case for this.

llvm-svn: 173339

7481ca8f

[asan] fix 32-bit builds · e35d59a8
Kostya Serebryany authored Jan 24, 2013
```
llvm-svn: 173338
```
e35d59a8