Commits · 73eff2e6e87beff78154818727eabe2d52c64bce · Roger Ferrer / llvm-epi

Feb 03, 2009

Make SROA produce a vector only when the alloca is actually · 73eff2e6

Chris Lattner authored Feb 03, 2009

accessed at least once as a vector.  This prevents it from
compiling the example in not-a-vector into:

define double @test(double %A, double %B) {
	%tmp4 = insertelement <7 x double> undef, double %A, i32 0
	%tmp = insertelement <7 x double> %tmp4, double %B, i32 4
	%tmp2 = extractelement <7 x double> %tmp, i32 4
	ret double %tmp2
}

instead, producing the integer code.  Producing vectors when they
aren't otherwise in the program is dangerous because a lot of other
code treats them carefully and doesn't want to break them down.
OTOH, many things want to break down tasty i448's.

llvm-svn: 63638

73eff2e6

add another case of undefined behavior without crashing, PR3466. · 80810b4c
Chris Lattner authored Feb 03, 2009
```
llvm-svn: 63620
```
80810b4c

Teach ConvertUsesToScalar to handle memset, allowing it to handle · 6aa6b1f2

Chris Lattner authored Feb 03, 2009

crazy cases like:

struct f {  int A, B, C, D, E, F; };
short test4() {
  struct f A;
  A.A = 1;
  memset(&A.B, 2, 12);
  return A.C;
}

llvm-svn: 63596

6aa6b1f2

rearrange how SRoA handles promotion of allocas to vectors. · 09b65ab2

Chris Lattner authored Feb 03, 2009

With the new world order, it can handle cases where the first
store into the alloca is an element of the vector, instead of
requiring the first analyzed store to have the vector type 
itself.  This allows us to un-xfail 
test/CodeGen/X86/vec_ins_extract.ll.

llvm-svn: 63590

09b65ab2

Feb 02, 2009
- inline SROA::ConvertToScalar, no functionality change. · 43cecd7c
  Chris Lattner authored Feb 02, 2009
```
llvm-svn: 63544
```
  43cecd7c
- Fix a bug which caused us to miscompile a couple of Ada · 18eba4f2
  Chris Lattner authored Feb 02, 2009
```
tests.  Thanks for the beautiful reduced testcase Duncan!

llvm-svn: 63529
```
  18eba4f2
- Fix a comment (bytes -> bits), reformat a comment · 6f361ff3
  Duncan Sands authored Feb 02, 2009
```
and remove trailing whitespace.  No functionality
change.

llvm-svn: 63511
```
  6f361ff3
- Fix an obvious thinko. · 33d6e97e
  Duncan Sands authored Feb 02, 2009
```
llvm-svn: 63510
```
  33d6e97e
Jan 31, 2009

Simplify and generalize the SROA "convert to scalar" transformation to · ec99c46d

Chris Lattner authored Jan 31, 2009

be able to handle *ANY* alloca that is poked by loads and stores of 
bitcasts and GEPs with constant offsets.  Before the code had a number
of annoying limitations and caused it to miss cases such as storing into
holes in structs and complex casts (as in bitfield-sroa) where we had
unions of bitfields etc.  This also handles a number of important cases
that are exposed due to the ABI lowering stuff we do to pass stuff by
value.

One case that is pretty great is that we compile 
2006-11-07-InvalidArrayPromote.ll into:

define i32 @func(<4 x float> %v0, <4 x float> %v1) nounwind {
	%tmp10 = call <4 x i32> @llvm.x86.sse2.cvttps2dq(<4 x float> %v1)
	%tmp105 = bitcast <4 x i32> %tmp10 to i128
	%tmp1056 = zext i128 %tmp105 to i256	
	%tmp.upgrd.43 = lshr i256 %tmp1056, 96
	%tmp.upgrd.44 = trunc i256 %tmp.upgrd.43 to i32	
	ret i32 %tmp.upgrd.44
}

which turns into:

_func:
	subl	$28, %esp
	cvttps2dq	%xmm1, %xmm0
	movaps	%xmm0, (%esp)
	movl	12(%esp), %eax
	addl	$28, %esp
	ret

Which is pretty good code all things considering :).

One effect of this is that SROA will start generating arbitrary bitwidth 
integers that are a multiple of 8 bits.  In the case above, we got a 
256 bit integer, but the codegen guys assure me that it can handle the 
simple and/or/shift/zext stuff that we're doing on these operations.

This addresses rdar://6532315

llvm-svn: 63469

ec99c46d

Jan 28, 2009
- Fix some issues with volatility, move "CanConvertToScalar" check · df17987c
  Chris Lattner authored Jan 28, 2009
```
after the others.

llvm-svn: 63227
```
  df17987c
Jan 12, 2009
- Rename getABITypeSize to getTypePaddedSize, as · dc020f9c
  Duncan Sands authored Jan 12, 2009
```
suggested by Chris.

llvm-svn: 62099
```
  dc020f9c
Jan 09, 2009
- Fix PR3304 · ae0e857b
  Chris Lattner authored Jan 09, 2009
```
llvm-svn: 61995
```
  ae0e857b
Jan 08, 2009

This implements the second half of the fix for PR3290, handling · c518dfd1

Chris Lattner authored Jan 08, 2009

loads from allocas that cover the entire aggregate.  This handles
some memcpy/byval cases that are produced by llvm-gcc.  This triggers
a few times in kc++ (with std::pair<std::_Rb_tree_const_iterator
<kc::impl_abstract_phylum*>,bool>) and once in 176.gcc (with %struct..0anon).

llvm-svn: 61915

c518dfd1

Jan 07, 2009

Implement the first half of PR3290: if there is a store of an · f2b8c82a

Chris Lattner authored Jan 07, 2009

integer to a (transitive) bitcast the alloca and if that integer
has the full size of the alloca, then it clobbers the whole thing.
Handle this by extracting pieces out of the stored integer and 
filing them away in the SROA'd elements.

This triggers fairly frequently because the CFE uses integers to
pass small structs by value and the inliner exposes these.  For 
example, in kimwitu++, I see a bunch of these with i64 stores to
"%struct.std::pair<std::_Rb_tree_const_iterator<kc::impl_abstract_phylum*>,bool>"

In 176.gcc I see a few i32 stores to "%struct..0anon".

In the testcase, this is a difference between compiling test1 to:

_test1:
	subl	$12, %esp
	movl	20(%esp), %eax
	movl	%eax, 4(%esp)
	movl	16(%esp), %eax
	movl	%eax, (%esp)
	movl	(%esp), %eax
	addl	4(%esp), %eax
	addl	$12, %esp
	ret

vs:

_test1:
	movl	8(%esp), %eax
	addl	4(%esp), %eax
	ret

The second half of this will be to handle loads of the same form.

llvm-svn: 61853

f2b8c82a

Factor a bunch of code out into a helper method. · 9a2de65f
Chris Lattner authored Jan 07, 2009
```
llvm-svn: 61852
```
9a2de65f
use continue to simplify code and reduce nesting, no functionality · db561146
Chris Lattner authored Jan 07, 2009
```
change.

llvm-svn: 61851
```
db561146
Get TargetData once up front and cache as an ivar instead of · 938b54f3
Chris Lattner authored Jan 07, 2009
```
requerying it all over the place.

llvm-svn: 61850
```
938b54f3
Use the hasAllZeroIndices predicate to simplify some · a63dba9e
Chris Lattner authored Jan 07, 2009
```
code, no functionality change.

llvm-svn: 61849
```
a63dba9e

Nov 04, 2008

Allow SROA of vectors. Removing this caused a · 0a7b4f58

Dale Johannesen authored Nov 04, 2008

huge performance regression in something we care
about.  This may not be final fix.

llvm-svn: 58718

0a7b4f58

Oct 06, 2008

Allow scalarrepl to treat an all-zero GEP just as bitcast. · cbe5e16e

Matthijs Kooijman authored Oct 06, 2008

This includes not marking a GEP involving a vector as unsafe, but only when it
has all zero indices. This allows scalarrepl to work in a few more cases.

llvm-svn: 57177

cbe5e16e

Sep 04, 2008
- Tidy up several unbeseeming casts from pointer to intptr_t. · a79db30d
  Dan Gohman authored Sep 04, 2008
```
llvm-svn: 55779
```
  a79db30d
Aug 23, 2008
- Fix PR2423 by checking all indices for out of range access, not only · 3f972c91
  Chris Lattner authored Aug 23, 2008
```
indices that start with an array subscript.  x->field[10000] is just 
as bad as (*X)[14][10000].

llvm-svn: 55226
```
  3f972c91
Jun 23, 2008
- minor tidying of comments. · 4d754bc9
  Chris Lattner authored Jun 23, 2008
```
llvm-svn: 52630
```
  4d754bc9
Jun 22, 2008

Fix PR2369 by making scalarrepl more careful about promoting · 6ff85681

Chris Lattner authored Jun 22, 2008

structures.  Its default threshold is to promote things that are
smaller than 128 bytes, which is sane.  However, it is not sane
to do this for things that turn into 128 *registers*.  Add a cap
on the number of registers introduced, defaulting to 128/4=32.

llvm-svn: 52611

6ff85681

Jun 05, 2008

Learn ScalarReplAggregrates how stores and loads of first class aggregrates · 812989b1

Matthijs Kooijman authored Jun 05, 2008

work and how to replace them into individual values. Also, when trying to
replace an aggregrate that is used by load or store with a single (large)
integer, don't crash (but don't replace the aggregrate either).

Also adds a testcase for both structs and arrays.

llvm-svn: 51997

812989b1

Jun 04, 2008

Change packed struct layout so that field sizes · fc3c489b

Duncan Sands authored Jun 04, 2008

are the same as in unpacked structs, only field
positions differ.  This only matters for structs
containing x86 long double or an apint; it may
cause backwards compatibility problems if someone
has bitcode containing a packed struct with a
field of one of those types.
The issue is that only 10 bytes are needed to
hold an x86 long double: the store size is 10
bytes, but the ABI size is 12 or 16 bytes (linux/
darwin) which comes from rounding the store size
up by the alignment.  Because it seemed silly not
to pack an x86 long double into 10 bytes in a
packed struct, this is what was done.  I now
think this was a mistake.  Reserving the ABI size
for an x86 long double field even in a packed
struct makes things more uniform: the ABI size is
now always used when reserving space for a type.
This means that developers are less likely to
make mistakes.  It also makes life easier for the
CBE which otherwise could not represent all LLVM
packed structs (PR2402).
Front-end people might need to adjust the way
they create LLVM structs - see following change
to llvm-gcc.

llvm-svn: 51928

fc3c489b

May 23, 2008
- Use isSingleValueType instead of isFirstClassType to · 7a0566b9
  Dan Gohman authored May 23, 2008
```
exclude struct and array types.

llvm-svn: 51456
```
  7a0566b9
May 16, 2008

API change for {BinaryOperator|CmpInst|CastInst}::create*() --> Create. Legacy... · e1f6e4b2

Gabor Greif authored May 16, 2008

API change for {BinaryOperator|CmpInst|CastInst}::create*() --> Create. Legacy interfaces will be in place for some time.  (Merge from use-diet branch.)

llvm-svn: 51200

e1f6e4b2

May 13, 2008

Clean up the use of static and anonymous namespaces. This turned up · d78c400b

Dan Gohman authored May 13, 2008

several things that were neither in an anonymous namespace nor static
but not intended to be global.

llvm-svn: 51017

d78c400b

Apr 06, 2008

API changes for class Use size reduction, wave 1. · e9ecc68d

Gabor Greif authored Apr 06, 2008

Specifically, introduction of XXX::Create methods
for Users that have a potentially variable number of
Uses.

llvm-svn: 49277

e9ecc68d

Feb 29, 2008

fix a bug Anders ran into where scalarrepl would crash when promoting · c966cebe

Chris Lattner authored Feb 29, 2008

a union containing a vector and an array whose elements were smaller than
the vector elements.  this means we need to compile the load of the 
array elements into an extract element plus a truncate.

llvm-svn: 47752

c966cebe

Refactor some code out of ConvertUsesToScalar into their own methods, no · 77205def
Chris Lattner authored Feb 29, 2008
```
functionality change.

llvm-svn: 47751
```
77205def

Feb 10, 2008
- Fix scalarrepl to not 'miscompile' undefined code, part #2. · dcddd644
  Chris Lattner authored Feb 10, 2008
```
This fixes the store case, my previous patch just fixed the load
case.  rdar://5707076.

llvm-svn: 46932
```
  dcddd644
Jan 30, 2008

Fix a bug where scalarrepl would discard offset if type would match. · b9e5b8fb

Chris Lattner authored Jan 30, 2008

In practice this can only happen on code with already undefined behavior, 
but this is still a good thing to handle correctly.

llvm-svn: 46539

b9e5b8fb

Dec 29, 2007
- Remove attribution from file headers, per discussion on llvmdev. · f3ebc3f3
  Chris Lattner authored Dec 29, 2007
```
llvm-svn: 45418
```
  f3ebc3f3
Nov 06, 2007

At the point of calculating the shift amount, the · f042e862

Duncan Sands authored Nov 06, 2007

type of SV has changed from what it originally was.
However we need the store width of the original.

llvm-svn: 43775

f042e862

Nov 05, 2007
- If a long double is in a packed struct, it may be · f07fa242
  Duncan Sands authored Nov 05, 2007
```
that there is no padding.

llvm-svn: 43691
```
  f07fa242
Nov 04, 2007

Change uses of getTypeSize to getABITypeSize, getTypeStoreSize · 399d9798

Duncan Sands authored Nov 04, 2007

or getTypeSizeInBits as appropriate in ScalarReplAggregates.
The right change to make was not always obvious, so it would
be good to have an sroa guru review this.  While there I noticed
some bugs, and fixed them: (1) arrays of x86 long double have
holes due to alignment padding, but this wasn't being spotted
by HasStructPadding (renamed to HasPadding).  The same goes
for arrays of oddly sized ints.  Vectors also suffer from this,
in fact the problem for vectors is much worse because basic
vector assumptions seem to be broken by vectors of type with
alignment padding.   I didn't try to fix any of these vector
problems.  (2) The code for extracting smaller integers from
larger ones (in the "int union" case) was wrong on big-endian
machines for integers with size not a multiple of 8, like i1.
Probably this is impossible to hit via llvm-gcc, but I fixed
it anyway while there and added a testcase.  I also got rid of
some trailing whitespace and changed a function name which
had an obvious typo in it.

llvm-svn: 43672

399d9798

Sep 28, 2007
- Don't do SRA for unions with long double fields. · 1d1d0e77
  Dale Johannesen authored Sep 28, 2007
```
Fixes a SWB crash.

llvm-svn: 42422
```
  1d1d0e77
Sep 04, 2007

· c656cbb8

David Greene authored Sep 04, 2007

Update GEP constructors to use an iterator interface to fix
GLIBCXX_DEBUG issues.

llvm-svn: 41697

c656cbb8