Commits · ddf58010bd4566e8c225aa6ce2481f2f9485a315 · Roger Ferrer / llvm-epi-0.8

Jan 02, 2011

Allow loop-idiom to run on multiple BB loops, but still only scan the loop · ddf58010

Chris Lattner authored Jan 02, 2011

header for now for memset/memcpy opportunities.  It turns out that loop-rotate
is successfully rotating loops, but *DOESN'T MERGE THE BLOCKS*, turning "for 
loops" into 2 basic block loops that loop-idiom was ignoring.

With this fix, we form many *many* more memcpy and memsets than before, including
on the "history" loops in the viterbi benchmark, which look like this:

        for (j=0; j<MAX_history; ++j) {
          history_new[i][j+1] = history[2*i][j];
        }

Transforming these loops into memcpy's speeds up the viterbi benchmark from
11.98s to 3.55s on my machine.  Woo.

llvm-svn: 122685

ddf58010

Remove the #ifdef'd code for balancing the eval-link data structure. It doesn't · 528511b1

Cameron Zwarich authored Jan 02, 2011

compile, and everyone's tests have shown it to be slower in practice, even for
quite large graphs.

I also hope to do an optimization that is only correct with the simpler data
structure, which would break this even further.

llvm-svn: 122684

528511b1

remove debugging code. · 5b5a043d
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122683
```
5b5a043d
add some -stats output. · 12f91bef
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122682
```
12f91bef

improve loop rotation to use CodeMetrics to analyze the · 679572e5

Chris Lattner authored Jan 02, 2011

size of a loop header instead of its own code size estimator.
This allows it to handle bitcasts etc more precisely.

llvm-svn: 122681

679572e5

Speed up dominator computation some more by optimizing bucket processing. When · a0800337

Cameron Zwarich authored Jan 02, 2011

naively implemented, the Lengauer-Tarjan algorithm requires a separate bucket
for each vertex. However, this is unnecessary, because each vertex is only
placed into a single bucket (that of its semidominator), and each vertex's
bucket is processed before it is added to any bucket itself.

Instead of using a bucket per vertex, we use a single array Buckets that has two
purposes. Before the vertex V with DFS number i is processed, Buckets[i] stores
the index of the first element in V's bucket. After V's bucket is processed,
Buckets[i] stores the index of the next element in the bucket to which V now
belongs, if any.

Reading from the buckets can also be optimized. Instead of processing the bucket
of V's parent at the end of processing V, we process the bucket of V itself at
the beginning of processing V. This means that the case of the root vertex can
be simplified somewhat. It also means that we don't need to look up the DFS
number of the semidominator of every node in the bucket we are processing,
since we know it is the current index being processed.

This is a 6.5% speedup running -domtree on test-suite + SPEC2000/2006, with
larger speedups of around 12% on the larger benchmarks like GCC.

llvm-svn: 122680

a0800337

Add support for passing variables declared to use a xmm register to asm · 47731fe3
Rafael Espindola authored Jan 02, 2011
```
statements using the "x" constraint.

llvm-svn: 122679
```
47731fe3
teach loop idiom recognition to form memcpy's from simple loops. · 85b6d81d
Chris Lattner authored Jan 02, 2011
```
llvm-svn: 122678
```
85b6d81d

Remove functions from the FnSet when one of their callee's is being merged. This · 4e250c82

Nick Lewycky authored Jan 02, 2011

maintains the guarantee that the DenseSet expects two elements it contains to
not go from inequal to equal under its nose.

As a side-effect, this also lets us switch from iterating to a fixed-point to
actually maintaining a work queue of functions to look at again, and we don't
add thunks to our work queue so we don't need to detect and ignore them.

llvm-svn: 122677

4e250c82

Jan 01, 2011
- a missed __builtin_object_size case. · 6c3fc0a5
  Chris Lattner authored Jan 01, 2011
```
llvm-svn: 122676
```
  6c3fc0a5
- various updates. · e5d5a41a
  Chris Lattner authored Jan 01, 2011
```
llvm-svn: 122675
```
  e5d5a41a
- fix a globalopt crash on two Adobe-C++ testcases that the recent · 1903c42b
  Chris Lattner authored Jan 01, 2011
```
loop idiom pass exposed.

llvm-svn: 122674
```
  1903c42b
- Fix darwin bots. · abe3eaa4
  Rafael Espindola authored Jan 01, 2011
```
llvm-svn: 122672
```
  abe3eaa4
- Remove empty directories left behind by git-svn users. · f20ffec1
  Benjamin Kramer authored Jan 01, 2011
```
llvm-svn: 122671
```
  f20ffec1
- Produce a better error message for invalid register names. · 478abcab
  Rafael Espindola authored Jan 01, 2011
```
llvm-svn: 122670
```
  478abcab
- Fix typo and add comment. · 5734edc1
  Rafael Espindola authored Jan 01, 2011
```
llvm-svn: 122669
```
  5734edc1
- More empty directory removal. · d772dbd6
  Benjamin Kramer authored Jan 01, 2011
```
llvm-svn: 122668
```
  d772dbd6
- Add support for the 'H' modifier. · d606e547
  Rafael Espindola authored Jan 01, 2011
```
llvm-svn: 122667
```
  d606e547
- Update the test · 879be84a
  Anton Korobeynikov authored Jan 01, 2011
```
llvm-svn: 122666
```
  879be84a
- Remove empty directories. · 2bd35f1b
  Nick Lewycky authored Jan 01, 2011
```
llvm-svn: 122665
```
  2bd35f1b
- turn on memset idiom recognition by default. Though there are still lots of · 423bef8f
  Chris Lattner authored Jan 01, 2011
```
limitations, this kicks in dozens of times in the 4 specfp2000 benchmarks, 
and hundreds of times in the int part.  It also kicks in hundreds of times 
in multisource.

This kicks in right before loop deletion, which has the pleasant effect of
deleting loops that *just* do a memset.

llvm-svn: 122664
```
  423bef8f
- Model operand restrictions of mul-like instructions on ARMv5 via · 62acecd7
  Anton Korobeynikov authored Jan 01, 2011
```
earlyclobber stuff. This should fix PRs 2313 and 8157.

Unfortunately, no testcase, since it'd be dependent on register
assignments.

llvm-svn: 122663
```
  62acecd7
- add a validity check that was missed, fixing a crash on the · a3514441
  Chris Lattner authored Jan 01, 2011
```
new testcase.

llvm-svn: 122662
```
  a3514441
- Revert commit 122654 at the request of Chris, who reckons that instsimplify · 772749ae
  Duncan Sands authored Jan 01, 2011
```
is the wrong hammer for this nail, and is probably right.

llvm-svn: 122661
```
  772749ae
- improve validity check to handle constant-trip-count loops more · 91a44358
  Chris Lattner authored Jan 01, 2011
```
aggressively.  In practice, this doesn't help anything though,
see the todo.

llvm-svn: 122660
```
  91a44358
- implement the "no aliasing accesses in loop" safety check. This pass · 8b3baf6d
  Chris Lattner authored Jan 01, 2011
```
should be correct now.

llvm-svn: 122659
```
  8b3baf6d
- Fix PR8878. · 36864735
  Rafael Espindola authored Jan 01, 2011
```
llvm-svn: 122658
```
  36864735
- Correct a bunch of mistakes which meant that the example pass didn't · 7f3a6566
  Duncan Sands authored Jan 01, 2011
```
even compile, let alone work.

llvm-svn: 122657
```
  7f3a6566
- I was unable to get the instructions to work if LLVM was built · f5897f8a
  Duncan Sands authored Jan 01, 2011
```
using a separate objects directory.

llvm-svn: 122656
```
  f5897f8a
- Clarify that the loadable module turns up in the top-level directory, · 92ad41f9
  Duncan Sands authored Jan 01, 2011
```
not locally.

llvm-svn: 122655
```
  92ad41f9
- Fix a README item by having InstructionSimplify do a mild form of value · e3c53958
  Duncan Sands authored Jan 01, 2011
```
numbering, in which it considers (for example) "%a = add i32 %x, %y" and
"%b = add i32 %x, %y" to be equal because the operands are equal and the
result of the instructions only depends on the values of the operands.
This has almost no effect (it removes 4 instructions from gcc-as-one-file),
and perhaps slows down compilation: I measured a 0.4% slowdown on the large
gcc-as-one-file testcase, but it wasn't statistically significant.

llvm-svn: 122654
```
  e3c53958
- ptx: remove reg-reg addressing mode and st.const · 5451fc91
  Che-Liang Chiou authored Jan 01, 2011
```
llvm-svn: 122653
```
  5451fc91
- ptx: add store instruction · 15e8d2c5
  Che-Liang Chiou authored Jan 01, 2011
```
llvm-svn: 122652
```
  15e8d2c5
- Add a reference to the OCamlLangImpl8. · 5d556ab1
  Erick Tryzelaar authored Jan 01, 2011
```
llvm-svn: 122651
```
  5d556ab1
- Add an OCaml tutorial page 8 · 619714a6
  Erick Tryzelaar authored Jan 01, 2011
```
llvm-svn: 122650
```
  619714a6
Dec 31, 2010

Add to the list of cmake files the object file, not the asm file. This · a8eb6043
Oscar Fuentes authored Dec 31, 2010
```
is necessary for executing the custom command that runs the
assember. Fixes PR8877.

llvm-svn: 122649
```
a8eb6043

CMake (MSVC): cmake automatically adds the /EHsc and /GR compiler · eb6d7778

Oscar Fuentes authored Dec 31, 2010

options. If we are building with exceptions/rtti disabled, we replace
/EHsc with /EHs-c- and /GR with /GR-, respectively. If we just add the
disabling options we get warnings like this:

cl : Command line warning D9025 : overriding '/EHs' with '/EHs-'

llvm-svn: 122648

eb6d7778

Simplify this pass by using a depth-first iterator to ensure that all · 2c440fa4
Duncan Sands authored Dec 31, 2010
```
operands are visited before the instructions themselves.

llvm-svn: 122647
```
2c440fa4

Remove stray emacs mode markers in all these files that was causing emacs to · 6da90771

Nick Lewycky authored Dec 31, 2010

open them in fundamental-mode instead of c++-mode.
Also twiddle whitespace for consistency in ToolChains.cpp.

llvm-svn: 122646

6da90771

Zap dead instructions harder. · 6cc7126e
Duncan Sands authored Dec 31, 2010
```
llvm-svn: 122645
```
6cc7126e