Commits · 77b2accbcac4bb19f77b97a683904418c5464bd5 · Roger Ferrer / llvm-epi-0.8

Aug 06, 2007

· 77b2accb
David Greene authored Aug 06, 2007
```
Make this code more efficient.

llvm-svn: 40861
```
77b2accb
remove some dead lines · c7ba2257
Chris Lattner authored Aug 06, 2007
```
llvm-svn: 40859
```
c7ba2257

Chris Lattner authored Aug 06, 2007

2. Make domtree printing print dfin/dfout #'s
3. Fix the Transforms/LoopSimplify/2004-04-13-LoopSimplifyUpdateDomFrontier.ll failure from last night (in DominanceFrontier::splitBlock).

w.r.t. #3, my patches last night happened to expose the bug, but this
has been broken since Owen's r35839 patch to LoopSimplify. The code
was subsequently moved over from LoopSimplify into Dominators, carrying
the latent bug. Fun stuff.

llvm-svn: 40858

67f1c333

update for new domtree dump format · 278c9249
Chris Lattner authored Aug 06, 2007
```
llvm-svn: 40857
```
278c9249
Various random cleanups, add two accessors to DomTreeNode: getDFSNumIn/getDFSNumOut · 2eb1e7d1
Chris Lattner authored Aug 06, 2007
```
llvm-svn: 40856
```
2eb1e7d1
Update links to the command guide generated documentation. · ade052e5
Reid Spencer authored Aug 05, 2007
```
llvm-svn: 40855
```
ade052e5

Aug 05, 2007
- Fix minor doxygen nits. · 446282ae
  Reid Spencer authored Aug 05, 2007
```
llvm-svn: 40854
```
  446282ae
- Comment out configuration tags not supported by doxygen 1.3.9 · 5dc3672e
  Reid Spencer authored Aug 05, 2007
```
llvm-svn: 40853
```
  5dc3672e
- Document a missing parameter. · 187b41ac
  Reid Spencer authored Aug 05, 2007
```
llvm-svn: 40852
```
  187b41ac
- Silence some warnings from doxygen about @param argument name not matching the · d959cfc8
  Reid Spencer authored Aug 05, 2007
```
actual argument name of the documented function.

llvm-svn: 40851
```
  d959cfc8
- Escape some escapes that confuse doxygen. · f13bcdc4
  Reid Spencer authored Aug 05, 2007
```
llvm-svn: 40850
```
  f13bcdc4
- Fix a doxygen directive. · 05d55b39
  Reid Spencer authored Aug 05, 2007
```
llvm-svn: 40849
```
  05d55b39
- Long double patch 4 of N: initial x87 implementation. · b1888e73
  Dale Johannesen authored Aug 05, 2007
```
Lots of problems yet but some simple things work.

llvm-svn: 40847
```
  b1888e73
- allow this to pass on ppc hosts. · 39d75105
  Chris Lattner authored Aug 05, 2007
```
llvm-svn: 40846
```
  39d75105
- shorten this name · 6299a452
  Chris Lattner authored Aug 05, 2007
```
llvm-svn: 40843
```
  6299a452
- at the end of instcombine, explicitly clear WorklistMap. · f0da7975
  Chris Lattner authored Aug 05, 2007
```
This shrinks it down to something small.  On the testcase
from PR1432, this speeds up instcombine from 0.7959s to 0.5000s,
(59%)

llvm-svn: 40840
```
  f0da7975
- Fix a bug in DenseMap::clear, where we never reset a tombstone · 4515601f
  Chris Lattner authored Aug 05, 2007
```
to EmptyKey.

llvm-svn: 40839
```
  4515601f
- Upgrade BasicAliasAnalysis::getModRefBehavior to not call Value::getName, · 6493fc79
  Chris Lattner authored Aug 05, 2007
```
which dynamically allocates the string result.  This speeds up dse on the
testcase from PR1432 from 0.3781s to 0.1804s (2.1x).

llvm-svn: 40838
```
  6493fc79
- When clearing a SmallPtrSet, if the set had a huge capacity, but the · 44f7d3aa
  Chris Lattner authored Aug 05, 2007
```
contents of the set were small, deallocate and shrink the set.  This
avoids having us to memset as much data, significantly speeding up
some pathological cases.  For example, this speeds up the verifier
from 0.3899s to 0.0763 (5.1x) on the testcase from PR1432 in a 
release build.

llvm-svn: 40837
```
  44f7d3aa
- Fix an iterator invalidation bug I induced. · d2eb0c96
  Chris Lattner authored Aug 05, 2007
```
llvm-svn: 40830
```
  d2eb0c96
- Switch some std::sets to SmallPtrSet. This speeds up · 0e8f85f8
  Chris Lattner authored Aug 05, 2007
```
domtree by 10% and postdomtree by 17%

llvm-svn: 40829
```
  0e8f85f8
- Switch DomTreeNode::assignDFSNumber from using a std::set to using · 5f5585c4
  Chris Lattner authored Aug 05, 2007
```
a smallptrset.  This speeds up domtree by about 15% and postdomtree by 20%.

llvm-svn: 40828
```
  5f5585c4
- Switch the internal "Info" map from an std::map to a DenseMap. This · 77e05fe2
  Chris Lattner authored Aug 05, 2007
```
speeds up idom by about 45% and postidom by about 33%.

Some extra precautions must be taken not to invalidate densemap iterators.

llvm-svn: 40827
```
  77e05fe2
- switch the DomTreeNodes and IDoms maps in idom/postidom to a · bd0fe01d
  Chris Lattner authored Aug 04, 2007
```
DenseMap instead of an std::map.  This speeds up postdomtree
by about 25% and domtree by about 23%.  It also speeds up clients,
for example, domfrontier by 11%, mem2reg by 4% and ADCE by 6%.

llvm-svn: 40826
```
  bd0fe01d
- rewrite the code used to construct pruned SSA form with the IDF method. · edce70d2
  Chris Lattner authored Aug 04, 2007
```
In the old way, we computed and inserted phi nodes for the whole IDF of 
the definitions of the alloca, then computed which ones were dead and
removed them.

In the new method, we first compute the region where the value is live,
and use that information to only insert phi nodes that are live.  This
eliminates the need to compute liveness later, and stops the algorithm
from inserting a bunch of phis which it then later removes.

This speeds up the testcase in PR1432 from 2.00s to 0.15s (14x) in a
release build and 6.84s->0.50s (14x) in a debug build.

llvm-svn: 40825
```
  edce70d2
Aug 04, 2007

Factor out a whole bunch of code into it's own method. · d91576b0
Chris Lattner authored Aug 04, 2007
```
llvm-svn: 40824
```
d91576b0
Use getNumPreds(BB) instead of computing them manually. This is a very small but · 4e1b4140
Chris Lattner authored Aug 04, 2007
```
measurable speedup.

llvm-svn: 40823
```
4e1b4140

Change the rename pass to be "tail recursive", only adding N-1 successors · b6a4ba80

Chris Lattner authored Aug 04, 2007

to the worklist, and handling the last one with a 'tail call'.  This speeds
up PR1432 from 2.0578s to 2.0012s (2.8%)

llvm-svn: 40822

b6a4ba80

cache computation of #preds for a BB. This speeds up · 840259c8
Chris Lattner authored Aug 04, 2007
```
mem2reg from 2.0742->2.0522s on PR1432.

llvm-svn: 40821
```
840259c8
reserve operand space for phi nodes when we insert them. · 050bac4b
Chris Lattner authored Aug 04, 2007
```
llvm-svn: 40820
```
050bac4b
use continue to avoid nesting, no functionality change. · 9318785d
Chris Lattner authored Aug 04, 2007
```
llvm-svn: 40819
```
9318785d

Promoting allocas with the 'single store' fastpath is · 6b04ecba

Chris Lattner authored Aug 04, 2007

faster than with the 'local to a block' fastpath.  This speeds
up PR1432 from 2.1232 to 2.0686s (2.6%)

llvm-svn: 40818

6b04ecba

When PromoteLocallyUsedAllocas promoted allocas, it didn't remember · 4a930f94

Chris Lattner authored Aug 04, 2007

to increment NumLocalPromoted, and didn't actually delete the
dead alloca, leading to an extra iteration of mem2reg.

llvm-svn: 40817

4a930f94

std::map -> DenseMap · 63c03978
Chris Lattner authored Aug 04, 2007
```
llvm-svn: 40816
```
63c03978
Clean up comments, fix up some confusing code logic. · 20f0811f
Nick Lewycky authored Aug 04, 2007
```
Predsimplify fails llvm-gcc bootstrap.

llvm-svn: 40815
```
20f0811f

fix a logic bug where we wouldn't promote single store allocas if the · 7d382f76

Chris Lattner authored Aug 04, 2007

stored value was a non-instruction value.  Doh.

This increase the # single store allocas from 8982 to 9026, and
speeds up mem2reg on the testcase in PR1432 from 2.17 to 2.13s.

llvm-svn: 40813

7d382f76

When we do the single-store optimization, delete both the store · 1b215f06
Chris Lattner authored Aug 04, 2007
```
and the alloca so they don't get reprocessed.

This speeds up PR1432 from 2.20s to 2.17s.

llvm-svn: 40812
```
1b215f06

Three improvements: · 862f1254

Chris Lattner authored Aug 04, 2007

1. Check for revisiting a block before checking domination, which is faster.
2. If the stored value isn't an instruction, we don't have to check for domination.
3. If we have a value used in the same block more than once, make sure to remove the
block from the UsingBlocks vector. Not doing so forces us to go through the slow
path for the alloca.

The combination of these improvements increases the number of allocas on the fastpath
from 8935 to 8982 on PR1432. This speeds it up from 2.90s to 2.20s (31%)

llvm-svn: 40811

862f1254

switch from using a std::set to using a SmallPtrSet. This speeds up the · ae1e00eb
Chris Lattner authored Aug 04, 2007
```
testcase in PR1432 from 6.33s to 2.90s (2.22x)

llvm-svn: 40810
```
ae1e00eb

In mem2reg, when handling the single-store case, make sure to remove · 9181801b

Chris Lattner authored Aug 04, 2007

a using block from the list if we handle it.  Not doing this caused us
to not be able to promote (with the fast path) allocas which have uses (whoops).

This increases the # allocas hitting this fastpath from 4042 to 8935 on the
testcase in PR1432, speeding up mem2reg by 2.6x

llvm-svn: 40809

9181801b