Commits · b0dd27ee91d9d93e3226e6f3a03700e63bb38fe7 · Lorenzo Albano / LLVM bpEVL

Nov 27, 2007

Make LoopInfoBase more generic, in preparation for having MachineLoopInfo. ... · b0dd27ee

Owen Anderson authored Nov 27, 2007

Make LoopInfoBase more generic, in preparation for having MachineLoopInfo.  This involves a small interface change.

llvm-svn: 44348

b0dd27ee

Nov 09, 2007
- Fix indent · 550b98e1
  Anton Korobeynikov authored Nov 09, 2007
```
llvm-svn: 43941
```
  550b98e1
- Forget to commit users part of value mapper interface · 98638aed
  Anton Korobeynikov authored Nov 09, 2007
```
llvm-svn: 43940
```
  98638aed
- And delete this one · 8eeca1c2
  Anton Korobeynikov authored Nov 09, 2007
```
llvm-svn: 43939
```
  8eeca1c2
Nov 04, 2007
- Finishing initial docs for all transformations in Passes.html. · d568767e
  Gordon Henriksen authored Nov 04, 2007
```
Also cleaned up some comments in source files.

llvm-svn: 43674
```
  d568767e
Nov 02, 2007
- Add std:: to sort calls. · d7917b62
  Dan Gohman authored Nov 02, 2007
```
llvm-svn: 43652
```
  d7917b62
- Change illegal uses of ++ to uses of STLExtra.h's next function. · c981d72d
  Dan Gohman authored Nov 02, 2007
```
llvm-svn: 43651
```
  c981d72d
Nov 01, 2007

Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. · 44b8721d

Duncan Sands authored Nov 01, 2007

The meaning of getTypeSize was not clear - clarifying it is important
now that we have x86 long double and arbitrary precision integers.
The issue with long double is that it requires 80 bits, and this is
not a multiple of its alignment.  This gives a primitive type for
which getTypeSize differed from getABITypeSize.  For arbitrary precision
integers it is even worse: there is the minimum number of bits needed to
hold the type (eg: 36 for an i36), the maximum number of bits that will
be overwriten when storing the type (40 bits for i36) and the ABI size
(i.e. the storage size rounded up to a multiple of the alignment; 64 bits
for i36).

This patch removes getTypeSize (not really - it is still there but
deprecated to allow for a gradual transition).  Instead there is:

(1) getTypeSizeInBits - a number of bits that suffices to hold all
values of the type.  For a primitive type, this is the minimum number
of bits.  For an i36 this is 36 bits.  For x86 long double it is 80.
This corresponds to gcc's TYPE_PRECISION.

(2) getTypeStoreSizeInBits - the maximum number of bits that is
written when storing the type (or read when reading it).  For an
i36 this is 40 bits, for an x86 long double it is 80 bits.  This
is the size alias analysis is interested in (getTypeStoreSize
returns the number of bytes).  There doesn't seem to be anything
corresponding to this in gcc.

(3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded
up to a multiple of the alignment.  For an i36 this is 64, for an
x86 long double this is 96 or 128 depending on the OS.  This is the
spacing between consecutive elements when you form an array out of
this type (getABITypeSize returns the number of bytes).  This is
TYPE_SIZE in gcc.

Since successive elements in a SequentialType (arrays, pointers
and vectors) need to be aligned, the spacing between them will be
given by getABITypeSize.  This means that the size of an array
is the length times the getABITypeSize.  It also means that GEP
computations need to use getABITypeSize when computing offsets.
Furthermore, if an alloca allocates several elements at once then
these too need to be aligned, so the size of the alloca has to be
the number of elements multiplied by getABITypeSize.  Logically
speaking this doesn't have to be the case when allocating just
one element, but it is simpler to also use getABITypeSize in this
case.  So alloca's and mallocs should use getABITypeSize.  Finally,
since gcc's only notion of size is that given by getABITypeSize, if
you want to output assembler etc the same as gcc then getABITypeSize
is the size you want.

Since a store will overwrite no more than getTypeStoreSize bytes,
and a read will read no more than that many bytes, this is the
notion of size appropriate for alias analysis calculations.

In this patch I have corrected all type size uses except some of
those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard
cases).  I will get around to auditing these too at some point,
but I could do with some help.

Finally, I made one change which I think wise but others might
consider pointless and suboptimal: in an unpacked struct the
amount of space allocated for a field is now given by the ABI
size rather than getTypeStoreSize.  I did this because every
other place that reserves memory for a type (eg: alloca) now
uses getABITypeSize, and I didn't want to make an exception
for unpacked structs, i.e. I did it to make things more uniform.
This only effects structs containing long doubles and arbitrary
precision integers.  If someone wants to pack these types more
tightly they can always use a packed struct.

llvm-svn: 43620

44b8721d

Oct 29, 2007
- Fix PR1752 and LoopSimplify/2007-10-28-InvokeCrash.ll: terminators · 4a15e04a
  Chris Lattner authored Oct 29, 2007
```
can have uses too.  Wouldn't it be nice if invoke didn't exist? :)

llvm-svn: 43426
```
  4a15e04a
Oct 22, 2007

Reg2Mem cleanup and optimizations: · 7499a3b0

Anton Korobeynikov authored Oct 21, 2007

 - enable phi instructions demotion to stack
 - create alloca instructions in the entry block

llvm-svn: 43208

7499a3b0

Oct 18, 2007
- Move Split<...>() into DomTreeBase. This should make the #include's of DominatorInternals.h · ca831a82
  Owen Anderson authored Oct 18, 2007
```
in CodeExtractor and LoopSimplify unnecessary.

Hartmut, could you confirm that this fixes the issues you were seeing?

llvm-svn: 43115
```
  ca831a82
Oct 17, 2007
- Fixed linker errors (unresolved externals: split<>(...)) when compiling with VC++. Please review. · 2f842e61
  Hartmut Kaiser authored Oct 17, 2007
```
llvm-svn: 43081
```
  2f842e61
Sep 17, 2007
- Fix comment. · 9d1af9b6
  Devang Patel authored Sep 17, 2007
```
llvm-svn: 42048
```
  9d1af9b6
- Merge DenseMapKeyInfo & DenseMapValueInfo into DenseMapInfo · 0625bd64
  Chris Lattner authored Sep 17, 2007
```
Add a new DenseMapInfo::isEqual method to allow clients to redefine
the equality predicate used when probing the hash table.

llvm-svn: 42042
```
  0625bd64
Sep 04, 2007
- Insert cloned loop basic blocks before original loop header. · f6ef552f
  Devang Patel authored Sep 04, 2007
```
llvm-svn: 41713
```
  f6ef552f
- · c656cbb8
  David Greene authored Sep 04, 2007
```
Update GEP constructors to use an iterator interface to fix
GLIBCXX_DEBUG issues.

llvm-svn: 41697
```
  c656cbb8
Sep 03, 2007
- Silence warning while compiling with gcc 4.2 · 35322d74
  Anton Korobeynikov authored Sep 02, 2007
```
llvm-svn: 41676
```
  35322d74
Aug 27, 2007

· 703623d5

David Greene authored Aug 27, 2007

Update InvokeInst to work like CallInst

llvm-svn: 41506

703623d5

Aug 26, 2007

Don't promote volatile loads/stores. This is needed (for example) to handle... · 24fb6b2f

Anton Korobeynikov authored Aug 26, 2007

Don't promote volatile loads/stores. This is needed (for example) to handle setjmp/longjmp properly.
This fixes PR1520.

llvm-svn: 41461

24fb6b2f

Aug 21, 2007
- Use SmallVector instead of std::vector. · b5933bbb
  Devang Patel authored Aug 21, 2007
```
llvm-svn: 41207
```
  b5933bbb
Aug 17, 2007
- When one branch of condition is eliminated then head of the other · d1fcfcc7
  Devang Patel authored Aug 17, 2007
```
branch is not necessary immediate dominators of merge blcok in all cases.

llvm-svn: 41144
```
  d1fcfcc7
Aug 15, 2007
- Break infinite loop. · 22c7993e
  Devang Patel authored Aug 14, 2007
```
llvm-svn: 41091
```
  22c7993e
Aug 13, 2007
- If NewBB dominates DestBB then DestBB is not part of NewBB's dominance frontier. · da48cf40
  Devang Patel authored Aug 13, 2007
```
llvm-svn: 41051
```
  da48cf40
Aug 10, 2007
- Add utility to clone loops. · aa36a439
  Devang Patel authored Aug 10, 2007
```
llvm-svn: 40997
```
  aa36a439
Aug 06, 2007
- remove some dead lines · c7ba2257
  Chris Lattner authored Aug 06, 2007
```
llvm-svn: 40859
```
  c7ba2257
Aug 05, 2007

rewrite the code used to construct pruned SSA form with the IDF method. · edce70d2

Chris Lattner authored Aug 04, 2007

In the old way, we computed and inserted phi nodes for the whole IDF of 
the definitions of the alloca, then computed which ones were dead and
removed them.

In the new method, we first compute the region where the value is live,
and use that information to only insert phi nodes that are live.  This
eliminates the need to compute liveness later, and stops the algorithm
from inserting a bunch of phis which it then later removes.

This speeds up the testcase in PR1432 from 2.00s to 0.15s (14x) in a
release build and 6.84s->0.50s (14x) in a debug build.

llvm-svn: 40825

edce70d2

Aug 04, 2007

Factor out a whole bunch of code into it's own method. · d91576b0
Chris Lattner authored Aug 04, 2007
```
llvm-svn: 40824
```
d91576b0
Use getNumPreds(BB) instead of computing them manually. This is a very small but · 4e1b4140
Chris Lattner authored Aug 04, 2007
```
measurable speedup.

llvm-svn: 40823
```
4e1b4140

Change the rename pass to be "tail recursive", only adding N-1 successors · b6a4ba80

Chris Lattner authored Aug 04, 2007

to the worklist, and handling the last one with a 'tail call'.  This speeds
up PR1432 from 2.0578s to 2.0012s (2.8%)

llvm-svn: 40822

b6a4ba80

cache computation of #preds for a BB. This speeds up · 840259c8
Chris Lattner authored Aug 04, 2007
```
mem2reg from 2.0742->2.0522s on PR1432.

llvm-svn: 40821
```
840259c8
reserve operand space for phi nodes when we insert them. · 050bac4b
Chris Lattner authored Aug 04, 2007
```
llvm-svn: 40820
```
050bac4b
use continue to avoid nesting, no functionality change. · 9318785d
Chris Lattner authored Aug 04, 2007
```
llvm-svn: 40819
```
9318785d

Promoting allocas with the 'single store' fastpath is · 6b04ecba

Chris Lattner authored Aug 04, 2007

faster than with the 'local to a block' fastpath.  This speeds
up PR1432 from 2.1232 to 2.0686s (2.6%)

llvm-svn: 40818

6b04ecba

When PromoteLocallyUsedAllocas promoted allocas, it didn't remember · 4a930f94

Chris Lattner authored Aug 04, 2007

to increment NumLocalPromoted, and didn't actually delete the
dead alloca, leading to an extra iteration of mem2reg.

llvm-svn: 40817

4a930f94

std::map -> DenseMap · 63c03978
Chris Lattner authored Aug 04, 2007
```
llvm-svn: 40816
```
63c03978

fix a logic bug where we wouldn't promote single store allocas if the · 7d382f76

Chris Lattner authored Aug 04, 2007

stored value was a non-instruction value.  Doh.

This increase the # single store allocas from 8982 to 9026, and
speeds up mem2reg on the testcase in PR1432 from 2.17 to 2.13s.

llvm-svn: 40813

7d382f76

When we do the single-store optimization, delete both the store · 1b215f06
Chris Lattner authored Aug 04, 2007
```
and the alloca so they don't get reprocessed.

This speeds up PR1432 from 2.20s to 2.17s.

llvm-svn: 40812
```
1b215f06

Three improvements: · 862f1254

Chris Lattner authored Aug 04, 2007

1. Check for revisiting a block before checking domination, which is faster.
2. If the stored value isn't an instruction, we don't have to check for domination.
3. If we have a value used in the same block more than once, make sure to remove the
block from the UsingBlocks vector. Not doing so forces us to go through the slow
path for the alloca.

The combination of these improvements increases the number of allocas on the fastpath
from 8935 to 8982 on PR1432. This speeds it up from 2.90s to 2.20s (31%)

llvm-svn: 40811

862f1254

switch from using a std::set to using a SmallPtrSet. This speeds up the · ae1e00eb
Chris Lattner authored Aug 04, 2007
```
testcase in PR1432 from 6.33s to 2.90s (2.22x)

llvm-svn: 40810
```
ae1e00eb

In mem2reg, when handling the single-store case, make sure to remove · 9181801b

Chris Lattner authored Aug 04, 2007

a using block from the list if we handle it.  Not doing this caused us
to not be able to promote (with the fast path) allocas which have uses (whoops).

This increases the # allocas hitting this fastpath from 4042 to 8935 on the
testcase in PR1432, speeding up mem2reg by 2.6x

llvm-svn: 40809

9181801b