Commits · 43a1161379f7b91b68a249aed23425518aa4b68d · Roger Ferrer / llvm-epi-0.8

Feb 03, 2009

If "optimize for size" attribute is set then block non-trivial loop unswitches... · 43a11613
Devang Patel authored Feb 03, 2009
```
If "optimize for size" attribute is set then block non-trivial loop unswitches but allow trivial loop unswitches.

llvm-svn: 63670
```
43a11613
teach "convert from scalar" to handle loads of fca's. · ef37dc85
Chris Lattner authored Feb 03, 2009
```
llvm-svn: 63659
```
ef37dc85
refactor the interface to ConvertUsesOfLoadToScalar, · f5df53cb
Chris Lattner authored Feb 03, 2009
```
renaming it to ConvertScalar_ExtractValue

llvm-svn: 63658
```
f5df53cb
convert ConvertUsesOfLoadToScalar to use IRBuilder, · 576baa4a
Chris Lattner authored Feb 03, 2009
```
no functionality change.

llvm-svn: 63652
```
576baa4a
switch ConvertScalar_InsertValue to use an IRBuilder, no · c1fb96d3
Chris Lattner authored Feb 03, 2009
```
functionality change.

llvm-svn: 63651
```
c1fb96d3
make scalar conversion handle stores of first class · 18f56c29
Chris Lattner authored Feb 03, 2009
```
aggregate values.  loads are not yet handled (coming
soon to an sroa near you).

llvm-svn: 63649
```
18f56c29

Make SROA produce a vector only when the alloca is actually · 73eff2e6

Chris Lattner authored Feb 03, 2009

accessed at least once as a vector.  This prevents it from
compiling the example in not-a-vector into:

define double @test(double %A, double %B) {
	%tmp4 = insertelement <7 x double> undef, double %A, i32 0
	%tmp = insertelement <7 x double> %tmp4, double %B, i32 4
	%tmp2 = extractelement <7 x double> %tmp, i32 4
	ret double %tmp2
}

instead, producing the integer code.  Producing vectors when they
aren't otherwise in the program is dangerous because a lot of other
code treats them carefully and doesn't want to break them down.
OTOH, many things want to break down tasty i448's.

llvm-svn: 63638

73eff2e6

APInt'fy SimplifyDemandedVectorElts so it can analyze vectors with more than 64 elements. · 8542caa3
Evan Cheng authored Feb 03, 2009
```
llvm-svn: 63631
```
8542caa3
add another case of undefined behavior without crashing, PR3466. · 80810b4c
Chris Lattner authored Feb 03, 2009
```
llvm-svn: 63620
```
80810b4c

Teach ConvertUsesToScalar to handle memset, allowing it to handle · 6aa6b1f2

Chris Lattner authored Feb 03, 2009

crazy cases like:

struct f {  int A, B, C, D, E, F; };
short test4() {
  struct f A;
  A.A = 1;
  memset(&A.B, 2, 12);
  return A.C;
}

llvm-svn: 63596

6aa6b1f2

rearrange how SRoA handles promotion of allocas to vectors. · 09b65ab2

Chris Lattner authored Feb 03, 2009

With the new world order, it can handle cases where the first
store into the alloca is an element of the vector, instead of
requiring the first analyzed store to have the vector type 
itself.  This allows us to un-xfail 
test/CodeGen/X86/vec_ins_extract.ll.

llvm-svn: 63590

09b65ab2

Feb 02, 2009
- inline SROA::ConvertToScalar, no functionality change. · 43cecd7c
  Chris Lattner authored Feb 02, 2009
```
llvm-svn: 63544
```
  43cecd7c
- Fix a bug which caused us to miscompile a couple of Ada · 18eba4f2
  Chris Lattner authored Feb 02, 2009
```
tests.  Thanks for the beautiful reduced testcase Duncan!

llvm-svn: 63529
```
  18eba4f2
- Fix a comment (bytes -> bits), reformat a comment · 6f361ff3
  Duncan Sands authored Feb 02, 2009
```
and remove trailing whitespace.  No functionality
change.

llvm-svn: 63511
```
  6f361ff3
- Fix an obvious thinko. · 33d6e97e
  Duncan Sands authored Feb 02, 2009
```
llvm-svn: 63510
```
  33d6e97e
- reduce indentation, (~XorCST->getValue()).isSignBit() -> isMaxSignedValue() · 1aafe4ce
  Chris Lattner authored Feb 02, 2009
```
llvm-svn: 63500
```
  1aafe4ce
Jan 31, 2009

Reinstate this optimization to fold icmp of xor when possible. Don't try to · f2390815

Nick Lewycky authored Jan 31, 2009

turn icmp eq a+x, b+x into icmp eq a, b if a+x or b+x has other uses. This
may have been increasing register pressure leading to the bzip2 slowdown.

llvm-svn: 63487

f2390815

Fix PR3452 (an infinite loop bootstrapping) by disabling the recent · 9e2b9f32

Chris Lattner authored Jan 31, 2009

improvements to the EvaluateInDifferentType code.  This code works 
by just inserted a bunch of new code and then seeing if it is 
useful.  Instcombine is not allowed to do this: it can only insert
new code if it is useful, and only when it is converging to a more
canonical fixed point.  Now that we iterate when DCE makes progress,
this causes an infinite loop when the code ends up not being used.

llvm-svn: 63483

9e2b9f32

now that all the pieces are in place, teach instcombine's · 76a63ed0

Chris Lattner authored Jan 31, 2009

simplifydemandedbits to simplify instructions with *multiple
uses* in contexts where it can get away with it.  This allows
it to simplify the code in multi-use-or.ll into a single 'add 
double'.

This change is particularly interesting because it will cover
up for some common codegen bugs with large integers created due
to the recent SROA patch.  When working on fixing those bugs,
this should be disabled.

llvm-svn: 63481

76a63ed0

simplify/clarify control flow and improve comments, no functionality change. · 3e2cb66c
Chris Lattner authored Jan 31, 2009
```
llvm-svn: 63480
```
3e2cb66c

make some fairly meaty internal changes to how SimplifyDemandedBits works. · 83c6a141

Chris Lattner authored Jan 31, 2009

Now, if it detects that "V" is the same as some other value, 
SimplifyDemandedBits returns the new value instead of RAUW'ing it immediately.
This has two benefits:
1) simpler code in the recursive SimplifyDemandedBits routine.
2) it allows future fun stuff in instcombine where an operation has multiple
   uses and can be simplified in one context, but not all.

#2 isn't implemented yet, this patch should have no functionality change.

llvm-svn: 63479

83c6a141

minor cleanups · 585cfb2c
Chris Lattner authored Jan 31, 2009
```
llvm-svn: 63477
```
585cfb2c

make sure to set Changed=true when instcombine hacks on the code, · 94cfb281

Chris Lattner authored Jan 31, 2009

not doing so prevents it from properly iterating and prevents it
from deleting the entire body of dce-iterate.ll

llvm-svn: 63476

94cfb281

Simplify and generalize the SROA "convert to scalar" transformation to · ec99c46d

Chris Lattner authored Jan 31, 2009

be able to handle *ANY* alloca that is poked by loads and stores of 
bitcasts and GEPs with constant offsets.  Before the code had a number
of annoying limitations and caused it to miss cases such as storing into
holes in structs and complex casts (as in bitfield-sroa) where we had
unions of bitfields etc.  This also handles a number of important cases
that are exposed due to the ABI lowering stuff we do to pass stuff by
value.

One case that is pretty great is that we compile 
2006-11-07-InvalidArrayPromote.ll into:

define i32 @func(<4 x float> %v0, <4 x float> %v1) nounwind {
	%tmp10 = call <4 x i32> @llvm.x86.sse2.cvttps2dq(<4 x float> %v1)
	%tmp105 = bitcast <4 x i32> %tmp10 to i128
	%tmp1056 = zext i128 %tmp105 to i256	
	%tmp.upgrd.43 = lshr i256 %tmp1056, 96
	%tmp.upgrd.44 = trunc i256 %tmp.upgrd.43 to i32	
	ret i32 %tmp.upgrd.44
}

which turns into:

_func:
	subl	$28, %esp
	cvttps2dq	%xmm1, %xmm0
	movaps	%xmm0, (%esp)
	movl	12(%esp), %eax
	addl	$28, %esp
	ret

Which is pretty good code all things considering :).

One effect of this is that SROA will start generating arbitrary bitwidth 
integers that are a multiple of 8 bits.  In the case above, we got a 
256 bit integer, but the codegen guys assure me that it can handle the 
simple and/or/shift/zext stuff that we're doing on these operations.

This addresses rdar://6532315

llvm-svn: 63469

ec99c46d

Jan 28, 2009
- Fix some issues with volatility, move "CanConvertToScalar" check · df17987c
  Chris Lattner authored Jan 28, 2009
```
after the others.

llvm-svn: 63227
```
  df17987c
- Rename getAnalysisToUpdate to getAnalysisIfAvailable. · 5a913d61
  Duncan Sands authored Jan 28, 2009
```
llvm-svn: 63198
```
  5a913d61
Jan 26, 2009
- Fixed optimization of combining two shuffles where the first shuffle inputs · 3537a627
  Mon P Wang authored Jan 26, 2009
```
has a different number of elements than the output.

llvm-svn: 62998
```
  3537a627
- Handle single-entry phi nodes gracefully in condprop. · 9449991c
  Chris Lattner authored Jan 26, 2009
```
llvm-svn: 62985
```
  9449991c
- Fix PR3408 by making a non-obvious assumption very obvious, and · 7b6647c1
  Chris Lattner authored Jan 26, 2009
```
handling the flaw inherent in that assumption.  :)

llvm-svn: 62984
```
  7b6647c1
- More cleanups and simplifications, no functionality change. · 57cb472b
  Chris Lattner authored Jan 26, 2009
```
llvm-svn: 62983
```
  57cb472b
- tidy asserts · d67aaa65
  Chris Lattner authored Jan 26, 2009
```
llvm-svn: 62982
```
  d67aaa65
Jan 24, 2009

testcase for PR3381. · f4395ea9
Torok Edwin authored Jan 24, 2009
```
Also it was an empty struct, not a void after all.

llvm-svn: 62920
```
f4395ea9

void* is represented as pointer to empty struct {}. · 73ff9227

Torok Edwin authored Jan 24, 2009

Thus we need to check whether the struct is empty before trying to index into
it. This fixes PR3381.

llvm-svn: 62918

73ff9227

Make InstCombineStoreToCast handle aggregates more aggressively, · 72cd68fe

Chris Lattner authored Jan 24, 2009

handling the case in Transforms/InstCombine/cast-store-gep.ll, which
is a heavily reduced testcase from Clang on x86-64.

llvm-svn: 62904

72cd68fe

Jan 23, 2009

Simplify the logic of getting hold of a PHI predecessor block. · eb61fcf2

Gabor Greif authored Jan 23, 2009

There is now a direct way from value-use-iterator to incoming block in PHINode's API.
This way we avoid the iterator->index->iterator trip, and especially the costly
getOperandNo() invocation. Additionally there is now an assertion that the iterator
really refers to one of the PHI's Uses.

llvm-svn: 62869

eb61fcf2

Jan 21, 2009
- Remove uses of uint32_t in favor of 'unsigned' for better · 77527f58
  Chris Lattner authored Jan 21, 2009
```
compatibility with cygwin.  Patch by Jay Foad!

llvm-svn: 62695
```
  77527f58
- Make special cases (0 inf nan) work for frem. · b5721632
  Dale Johannesen authored Jan 21, 2009
```
Besides APFloat, this involved removing code
from two places that thought they knew the
result of frem(0., x) but were wrong.

llvm-svn: 62645
```
  b5721632
Jan 19, 2009
- improve compatibility with cygwin, patch by Jay Foad! · 73d7fe5a
  Chris Lattner authored Jan 19, 2009
```
llvm-svn: 62535
```
  73d7fe5a
- Fix PR3353, infinitely jump threading an infinite loop make from switches. · 6f34e317
  Chris Lattner authored Jan 19, 2009
```
llvm-svn: 62529
```
  6f34e317
Jan 18, 2009
- Fix rdar://6505632, an llc crash on 483.xalancbmk · 64b7bd7f
  Chris Lattner authored Jan 18, 2009
```
llvm-svn: 62470
```
  64b7bd7f