Commits · d210e9d8809d5eb1693aa20bdc0d4b64f08768e2 · Roger Ferrer / llvm-epi-0.8

Nov 04, 2007

Change uses of getTypeSize to getABITypeSize, getTypeStoreSize · 399d9798

Duncan Sands authored Nov 04, 2007

or getTypeSizeInBits as appropriate in ScalarReplAggregates.
The right change to make was not always obvious, so it would
be good to have an sroa guru review this.  While there I noticed
some bugs, and fixed them: (1) arrays of x86 long double have
holes due to alignment padding, but this wasn't being spotted
by HasStructPadding (renamed to HasPadding).  The same goes
for arrays of oddly sized ints.  Vectors also suffer from this,
in fact the problem for vectors is much worse because basic
vector assumptions seem to be broken by vectors of type with
alignment padding.   I didn't try to fix any of these vector
problems.  (2) The code for extracting smaller integers from
larger ones (in the "int union" case) was wrong on big-endian
machines for integers with size not a multiple of 8, like i1.
Probably this is impossible to hit via llvm-gcc, but I fixed
it anyway while there and added a testcase.  I also got rid of
some trailing whitespace and changed a function name which
had an obvious typo in it.

llvm-svn: 43672

399d9798

Disable tail duplication of call instructions. The cost · ce8c6266

Chris Lattner authored Nov 04, 2007

metric is way off for these in general, and this works around
buggy code like that in PR1764.  we'll see if there is a big
performance impact of this.  If so, I'll revert it tomorrow.

llvm-svn: 43668

ce8c6266

Nov 02, 2007
- Add std:: to sort calls. · d7917b62
  Dan Gohman authored Nov 02, 2007
```
llvm-svn: 43652
```
  d7917b62
- Change illegal uses of ++ to uses of STLExtra.h's next function. · c981d72d
  Dan Gohman authored Nov 02, 2007
```
llvm-svn: 43651
```
  c981d72d
Nov 01, 2007

Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. · 44b8721d

Duncan Sands authored Nov 01, 2007

The meaning of getTypeSize was not clear - clarifying it is important
now that we have x86 long double and arbitrary precision integers.
The issue with long double is that it requires 80 bits, and this is
not a multiple of its alignment.  This gives a primitive type for
which getTypeSize differed from getABITypeSize.  For arbitrary precision
integers it is even worse: there is the minimum number of bits needed to
hold the type (eg: 36 for an i36), the maximum number of bits that will
be overwriten when storing the type (40 bits for i36) and the ABI size
(i.e. the storage size rounded up to a multiple of the alignment; 64 bits
for i36).

This patch removes getTypeSize (not really - it is still there but
deprecated to allow for a gradual transition).  Instead there is:

(1) getTypeSizeInBits - a number of bits that suffices to hold all
values of the type.  For a primitive type, this is the minimum number
of bits.  For an i36 this is 36 bits.  For x86 long double it is 80.
This corresponds to gcc's TYPE_PRECISION.

(2) getTypeStoreSizeInBits - the maximum number of bits that is
written when storing the type (or read when reading it).  For an
i36 this is 40 bits, for an x86 long double it is 80 bits.  This
is the size alias analysis is interested in (getTypeStoreSize
returns the number of bytes).  There doesn't seem to be anything
corresponding to this in gcc.

(3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded
up to a multiple of the alignment.  For an i36 this is 64, for an
x86 long double this is 96 or 128 depending on the OS.  This is the
spacing between consecutive elements when you form an array out of
this type (getABITypeSize returns the number of bytes).  This is
TYPE_SIZE in gcc.

Since successive elements in a SequentialType (arrays, pointers
and vectors) need to be aligned, the spacing between them will be
given by getABITypeSize.  This means that the size of an array
is the length times the getABITypeSize.  It also means that GEP
computations need to use getABITypeSize when computing offsets.
Furthermore, if an alloca allocates several elements at once then
these too need to be aligned, so the size of the alloca has to be
the number of elements multiplied by getABITypeSize.  Logically
speaking this doesn't have to be the case when allocating just
one element, but it is simpler to also use getABITypeSize in this
case.  So alloca's and mallocs should use getABITypeSize.  Finally,
since gcc's only notion of size is that given by getABITypeSize, if
you want to output assembler etc the same as gcc then getABITypeSize
is the size you want.

Since a store will overwrite no more than getTypeStoreSize bytes,
and a read will read no more than that many bytes, this is the
notion of size appropriate for alias analysis calculations.

In this patch I have corrected all type size uses except some of
those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard
cases).  I will get around to auditing these too at some point,
but I could do with some help.

Finally, I made one change which I think wise but others might
consider pointless and suboptimal: in an unpacked struct the
amount of space allocated for a field is now given by the ABI
size rather than getTypeStoreSize.  I did this because every
other place that reserves memory for a type (eg: alloca) now
uses getABITypeSize, and I didn't want to make an exception
for unpacked structs, i.e. I did it to make things more uniform.
This only effects structs containing long doubles and arbitrary
precision integers.  If someone wants to pack these types more
tightly they can always use a packed struct.

llvm-svn: 43620

44b8721d

Fix test/Transforms/DeadStoreElimination/PartialStore.ll, which had been · 2ed651ac
Owen Anderson authored Nov 01, 2007
```
silently failing because of an incorrect run line for some time.

llvm-svn: 43605
```
2ed651ac
Fix InstCombine/2007-10-31-RangeCrash.ll · 74709473
Chris Lattner authored Nov 01, 2007
```
llvm-svn: 43596
```
74709473

Oct 31, 2007
- Fix a typo in a comment. · 54048ec9
  Dan Gohman authored Oct 31, 2007
```
llvm-svn: 43553
```
  54048ec9
- At end of LSR, replace uses of now constant (as result of SplitCriticalEdge)... · 240c1ada
  Evan Cheng authored Oct 30, 2007
```
At end of LSR, replace uses of now constant (as result of SplitCriticalEdge) PHI node with the constant value.

llvm-svn: 43533
```
  240c1ada
Oct 30, 2007

It's not safe to tell SplitCriticalEdge to merge identical edges. It may... · c2dbfee4

Evan Cheng authored Oct 30, 2007

It's not safe to tell SplitCriticalEdge to merge identical edges. It may delete the phi instruction that's being processed.

llvm-svn: 43524

c2dbfee4

Oct 29, 2007
- - Bug fixes. · b024c4c8
  Evan Cheng authored Oct 29, 2007
```
- Allow icmp rewrite using an iv / stride of a smaller integer type.

llvm-svn: 43480
```
  b024c4c8
- Don't bitcast from pointer-to-vector to pointer-to-array when · 2aec186d
  Dan Gohman authored Oct 29, 2007
```
lowering load and store instructions.

llvm-svn: 43468
```
  2aec186d
- Use an array instead of a fixed-length std::vector. · 3bcd5fe9
  Dan Gohman authored Oct 29, 2007
```
llvm-svn: 43467
```
  3bcd5fe9
- Do a real assert if there is an unhandled vector instruction instead · d9911e21
  Dan Gohman authored Oct 29, 2007
```
of just printing to cerr.

llvm-svn: 43466
```
  d9911e21
- Update a comment to reflect the current code. · 7414e21e
  Dan Gohman authored Oct 29, 2007
```
llvm-svn: 43463
```
  7414e21e
- Remove an unused function argument. · f5feb010
  Dan Gohman authored Oct 29, 2007
```
llvm-svn: 43462
```
  f5feb010
- Fix a typo in a comment. · 50d42224
  Dan Gohman authored Oct 29, 2007
```
llvm-svn: 43461
```
  50d42224
- Avoid calling ValidStride when not all uses are addresses. · 8e8adada
  Dan Gohman authored Oct 29, 2007
```
llvm-svn: 43460
```
  8e8adada
- Fix PR1752 and LoopSimplify/2007-10-28-InvokeCrash.ll: terminators · 4a15e04a
  Chris Lattner authored Oct 29, 2007
```
can have uses too.  Wouldn't it be nice if invoke didn't exist? :)

llvm-svn: 43426
```
  4a15e04a
Oct 27, 2007

A number of LSR fixes: · 9dbe99dc

Evan Cheng authored Oct 26, 2007

- ChangeCompareStride only reuse stride that is larger than current stride. It
  will let the general reuse mechanism to try to reuse a smaller stride.
- Watch out for multiplication overflow in ChangeCompareStride.
- Replace std::set with SmallPtrSet.

llvm-svn: 43408

9dbe99dc

Oct 26, 2007

Fix a crash. Make sure TLI is not null. · d78a3e55
Evan Cheng authored Oct 26, 2007
```
llvm-svn: 43384
```
d78a3e55
More fleshing out of docs/Passes.html, plus some typo fixes and · 78c63ac4
Gordon Henriksen authored Oct 26, 2007
```
improved wording in source files.

llvm-svn: 43377
```
78c63ac4

Loosen up iv reuse to allow reuse of the same stride but a larger type when... · 7f3d0247

Evan Cheng authored Oct 26, 2007

Loosen up iv reuse to allow reuse of the same stride but a larger type when truncating from the larger type to smaller type is free.
e.g.
Turns this loop:
LBB1_1: # entry.bb_crit_edge
        xorl    %ecx, %ecx
        xorw    %dx, %dx
        movw    %dx, %si
LBB1_2: # bb
        movl    L_X$non_lazy_ptr, %edi
        movw    %si, (%edi)
        movl    L_Y$non_lazy_ptr, %edi
        movw    %dx, (%edi)
		addw    $4, %dx
		incw    %si
		incl    %ecx
		cmpl    %eax, %ecx
		jne     LBB1_2  # bb
	
into

LBB1_1: # entry.bb_crit_edge
        xorl    %ecx, %ecx
        xorw    %dx, %dx
LBB1_2: # bb
        movl    L_X$non_lazy_ptr, %esi
        movw    %cx, (%esi)
        movl    L_Y$non_lazy_ptr, %esi
        movw    %dx, (%esi)
        addw    $4, %dx
		incl    %ecx
        cmpl    %eax, %ecx
        jne     LBB1_2  # bb

llvm-svn: 43375

7f3d0247

Do not rewrite compare instruction using iv of a different stride if the new · 29e29e63
Evan Cheng authored Oct 25, 2007
```
stride may be rewritten using the stride of the compare instruction.

llvm-svn: 43367
```
29e29e63

Oct 25, 2007

Remove code that's commented out. · 5a381083
Evan Cheng authored Oct 25, 2007
```
llvm-svn: 43356
```
5a381083

If a loop termination compare instruction is the only use of its stride, · 133694db

Evan Cheng authored Oct 25, 2007

and the compaison is against a constant value, try eliminate the stride
by moving the compare instruction to another stride and change its
constant operand accordingly. e.g.

loop:
...
v1 = v1 + 3
v2 = v2 + 1
if (v2 < 10) goto loop
=>
loop:
...
v1 = v1 + 3
if (v1 < 30) goto loop

llvm-svn: 43336

133694db

Oct 24, 2007
- Fix off by 1 bug in printf->puts lowering. · 4d06391c
  Dale Johannesen authored Oct 24, 2007
```
llvm-svn: 43309
```
  4d06391c
- simplify some code by using the new isNaN predicate · 55b8302d
  Chris Lattner authored Oct 24, 2007
```
llvm-svn: 43305
```
  55b8302d
- Implement a couple of foldings for ordered and unordered comparisons, · c62877e9
  Chris Lattner authored Oct 24, 2007
```
implementing cases related to PR1738.

llvm-svn: 43289
```
  c62877e9
Oct 22, 2007

Strength reduction improvements. · e0c3d9f3

Dan Gohman authored Oct 22, 2007

 - Avoid attempting stride-reuse in the case that there are users that
   aren't addresses. In that case, there will be places where the
   multiplications won't be folded away, so it's better to try to
   strength-reduce them.

 - Several SSE intrinsics have operands that strength-reduction can
   treat as addresses. The previous item makes this more visible, as
   any non-address use of an IV can inhibit stride-reuse.

 - Make ValidStride aware of whether there's likely to be a base
   register in the address computation. This prevents it from thinking
   that things like stride 9 are valid on x86 when the base register is
   already occupied.

Also, XFAIL the 2007-08-10-LEA16Use32.ll test; the new logic to avoid
stride-reuse elimintes the LEA in the loop, so the test is no longer
testing what it was intended to test.

llvm-svn: 43231

e0c3d9f3

Move the SCEV object factors from being static members of the individual · a37eaf2b
Dan Gohman authored Oct 22, 2007
```
SCEV subclasses to being non-static member functions of the ScalarEvolution
class.

llvm-svn: 43224
```
a37eaf2b

Reg2Mem cleanup and optimizations: · 7499a3b0

Anton Korobeynikov authored Oct 21, 2007

 - enable phi instructions demotion to stack
 - create alloca instructions in the entry block

llvm-svn: 43208

7499a3b0

Oct 18, 2007
- Try again. · df49cf52
  Devang Patel authored Oct 18, 2007
```
Instead of loading small global string from memory, use
integer constant.

llvm-svn: 43148
```
  df49cf52
- Allow GVN to eliminate redundant calls to functions without side effects. · 09b83ba6
  Owen Anderson authored Oct 18, 2007
```
llvm-svn: 43147
```
  09b83ba6
- Fix PR1735 and Transforms/DeadArgElim/2007-10-18-VarargsReturn.ll by · 9715d9fb
  Chris Lattner authored Oct 18, 2007
```
fixing some obviously broken code :(

llvm-svn: 43141
```
  9715d9fb
- Move Split<...>() into DomTreeBase. This should make the #include's of DominatorInternals.h · ca831a82
  Owen Anderson authored Oct 18, 2007
```
in CodeExtractor and LoopSimplify unnecessary.

Hartmut, could you confirm that this fixes the issues you were seeing?

llvm-svn: 43115
```
  ca831a82
- Reverting r43070 for now. It's causing llc test failures. · cdcc1d04
  Evan Cheng authored Oct 17, 2007
```
llvm-svn: 43103
```
  cdcc1d04
Oct 17, 2007
- Do not raise free() call that is called through invoke instruction. · b3dac3f5
  Devang Patel authored Oct 17, 2007
```
llvm-svn: 43083
```
  b3dac3f5
- Fixed linker errors (unresolved externals: split<>(...)) when compiling with VC++. Please review. · 2f842e61
  Hartmut Kaiser authored Oct 17, 2007
```
llvm-svn: 43081
```
  2f842e61
- Apply "Instead of loading small c string constant, use integer constant... · 91ff13ed
  Devang Patel authored Oct 17, 2007
```
Apply "Instead of loading small c string constant, use integer constant directly" transformation while processing load instruction.

llvm-svn: 43070
```
  91ff13ed