Commits · 0af861c43aef57e49316e6a4bc326937749953e2 · Roger Ferrer / llvm-epi-0.8

Jan 25, 2008

add a testcase for a bug Duncan pointed out. · 0af861c4
Chris Lattner authored Jan 25, 2008
```
llvm-svn: 46372
```
0af861c4
Test for PR1942. · e5433a90
Duncan Sands authored Jan 25, 2008
```
llvm-svn: 46357
```
e5433a90

DeadStoreElimination can treat byval parameters as if there were alloca's for... · 6af19fd1

Owen Anderson authored Jan 25, 2008

DeadStoreElimination can treat byval parameters as if there were alloca's for the purpose of removing end-of-function stores.

llvm-svn: 46351

6af19fd1

Add target-specific dag combines for FAND(x,0) and FOR(x,0). This allows · 84ab724e

Chris Lattner authored Jan 25, 2008

us to compile:

double test(double X) {
  return copysign(0.0, X);
}

into:

_test:
	andpd	LCPI1_0(%rip), %xmm0
	ret

instead of:
_test:
	pxor	%xmm1, %xmm1
	andpd	LCPI1_0(%rip), %xmm1
	movapd	%xmm0, %xmm2
	andpd	LCPI1_1(%rip), %xmm2
	movapd	%xmm1, %xmm0
	orpd	%xmm2, %xmm0
	ret

llvm-svn: 46344

84ab724e

New test. · 0c4e4da6
Devang Patel authored Jan 24, 2008
```
llvm-svn: 46333
```
0c4e4da6

Jan 24, 2008

Teach basicaa that 'byval' arguments define a new memory location that · 9104d712

Chris Lattner authored Jan 24, 2008

can't be aliased to other known objects.  This allows us to know that byval 
pointer args don't alias globals, etc.

llvm-svn: 46315

9104d712

Significantly simplify and improve handling of FP function results on x86-32. · a91f77ea

Chris Lattner authored Jan 24, 2008

This case returns the value in ST(0) and then has to convert it to an SSE
register.  This causes significant codegen ugliness in some cases.  For 
example in the trivial fp-stack-direct-ret.ll testcase we used to generate:

_bar:
	subl	$28, %esp
	call	L_foo$stub
	fstpl	16(%esp)
	movsd	16(%esp), %xmm0
	movsd	%xmm0, 8(%esp)
	fldl	8(%esp)
	addl	$28, %esp
	ret

because we move the result of foo() into an XMM register, then have to
move it back for the return of bar.

Instead of hacking ever-more special cases into the call result lowering code
we take a much simpler approach: on x86-32, fp return is modeled as always 
returning into an f80 register which is then truncated to f32 or f64 as needed.
Similarly for a result, we model it as an extension to f80 + return.

This exposes the truncate and extensions to the dag combiner, allowing target
independent code to hack on them, eliminating them in this case.  This gives 
us this code for the example above:

_bar:
	subl	$12, %esp
	call	L_foo$stub
	addl	$12, %esp
	ret

The nasty aspect of this is that these conversions are not legal, but we want
the second pass of dag combiner (post-legalize) to be able to hack on them.
To handle this, we lie to legalize and say they are legal, then custom expand
them on entry to the isel pass (PreprocessForFPConvert).  This is gross, but
less gross than the code it is replacing :)

This also allows us to generate better code in several other cases.  For 
example on fp-stack-ret-conv.ll, we now generate:

_test:
	subl	$12, %esp
	call	L_foo$stub
	fstps	8(%esp)
	movl	16(%esp), %eax
	cvtss2sd	8(%esp), %xmm0
	movsd	%xmm0, (%eax)
	addl	$12, %esp
	ret

where before we produced (incidentally, the old bad code is identical to what
gcc produces):

_test:
	subl	$12, %esp
	call	L_foo$stub
	fstpl	(%esp)
	cvtsd2ss	(%esp), %xmm0
	cvtss2sd	%xmm0, %xmm0
	movl	16(%esp), %eax
	movsd	%xmm0, (%eax)
	addl	$12, %esp
	ret

Note that we generate slightly worse code on pr1505b.ll due to a scheduling 
deficiency that is unrelated to this patch.

llvm-svn: 46307

a91f77ea

take these with a pr # · 001d781c
Chris Lattner authored Jan 24, 2008
```
llvm-svn: 46303
```
001d781c

Let each target decide byval alignment. For X86, it's 4-byte unless the... · 35abd840

Evan Cheng authored Jan 23, 2008

Let each target decide byval alignment. For X86, it's 4-byte unless the aggregare contains SSE vector(s). For x86-64, it's max of 8 or alignment of the type.

llvm-svn: 46286

35abd840

Jan 23, 2008
- SSE varargs arguments are passed in memory. · 1e0d4d2a
  Evan Cheng authored Jan 22, 2008
```
llvm-svn: 46262
```
  1e0d4d2a
Jan 22, 2008
- update this test to pass with duncan's change. · 2b2f10fb
  Chris Lattner authored Jan 22, 2008
```
llvm-svn: 46246
```
  2b2f10fb
- Multiply can be evaluated in a different type, so long as the target type has · 78712e5b
  Nick Lewycky authored Jan 22, 2008
```
a smaller bitwidth.

llvm-svn: 46244
```
  78712e5b
Jan 21, 2008
- New test. · 5ce024f5
  Devang Patel authored Jan 21, 2008
```
llvm-svn: 46220
```
  5ce024f5
- New test. · 57b2a041
  Devang Patel authored Jan 21, 2008
```
llvm-svn: 46209
```
  57b2a041
Jan 18, 2008
- Implement flt_rounds for PowerPC. · 5c94cb35
  Dale Johannesen authored Jan 18, 2008
```
llvm-svn: 46174
```
  5c94cb35
- remove extraneous &&'s from tests, as Scott is apparently not going to. · 1b35211f
  Chris Lattner authored Jan 18, 2008
```
llvm-svn: 46173
```
  1b35211f
- Test is correct again for the moment. · 4768c3c9
  Dale Johannesen authored Jan 18, 2008
```
llvm-svn: 46172
```
  4768c3c9
- Fix a latent bug exposed by my truncstore patch. We compiled stfiwx-2.ll to: · f5b46f7d
  Chris Lattner authored Jan 18, 2008
```
_test:
	fctiwz f0, f1
	stfiwx f0, 0, r4
	blr 

instead of:

_test:
	fctiwz f0, f1
	stfd f0, -8(r1)
	nop
	nop
	lwz r2, -4(r1)
	stb r2, 0(r4)
	blr 

The former is not correct (stores 4 bytes, not 1).

llvm-svn: 46161
```
  f5b46f7d
Jan 17, 2008

Forward progress: crtbegin.c now compiles successfully! · e4d3e3c0

Scott Michel authored Jan 17, 2008

Fixed CellSPU's A-form (local store) address mode, so that all globals,
externals, constant pool and jump table symbols are now wrapped within
a SPUISD::AFormAddr pseudo-instruction. This now identifies all local
store memory addresses, although it requires a bit of legerdemain during
instruction selection to properly select loads to and stores from local
store, properly generating "LQA" instructions.

Also added mul_ops.ll test harness for exercising integer multiplication.

llvm-svn: 46142

e4d3e3c0

This commit changes: · 1ea55cf8

Chris Lattner authored Jan 17, 2008

1. Legalize now always promotes truncstore of i1 to i8. 
2. Remove patterns and gunk related to truncstore i1 from targets.
3. Rename the StoreXAction stuff to TruncStoreAction in TLI.
4. Make the TLI TruncStoreAction table a 2d table to handle from/to conversions.
5. Mark a wide variety of invalid truncstores as such in various targets, e.g.
   X86 currently doesn't support truncstore of any of its integer types.
6. Add legalize support for truncstores with invalid value input types.
7. Add a dag combine transform to turn store(truncate) into truncstore when
   safe.

The later allows us to compile CodeGen/X86/storetrunc-fp.ll to:

_foo:
	fldt	20(%esp)
	fldt	4(%esp)
	faddp	%st(1)
	movl	36(%esp), %eax
	fstps	(%eax)
	ret

instead of:

_foo:
	subl	$4, %esp
	fldt	24(%esp)
	fldt	8(%esp)
	faddp	%st(1)
	fstps	(%esp)
	movl	40(%esp), %eax
	movss	(%esp), %xmm0
	movss	%xmm0, (%eax)
	addl	$4, %esp
	ret

llvm-svn: 46140

1ea55cf8

new testcase. · 9f7fed1c
Chris Lattner authored Jan 17, 2008
```
llvm-svn: 46139
```
9f7fed1c
Test case for varargs parameter attribute issue I just fixed. · 9a93dc95
Evan Cheng authored Jan 17, 2008
```
llvm-svn: 46127
```
9a93dc95
add testcase that has been sitting in my tree for awhile. · 89126bde
Chris Lattner authored Jan 17, 2008
```
llvm-svn: 46124
```
89126bde

When a live virtual register is being clobbered by an implicit def, it is spilled · 54c20b55

Evan Cheng authored Jan 17, 2008

and the spill is its kill. However, if the local allocator has determined the
register has not been modified (possible when its value was reloaded), it would
not issue a restore. In that case, mark the last use of the virtual register as
kill.

llvm-svn: 46111

54c20b55

Fix arg promotion to propagate the correct attrs on the calls to · 5630c4f2

Chris Lattner authored Jan 17, 2008

promoted functions.  This is important for varargs calls in 
particular.  Thanks to duncan for providing a great testcase.

llvm-svn: 46108

5630c4f2

Fixes a nasty dag combiner bug that causes a bunch of tests to fail at -O0. · 7be15280

Evan Cheng authored Jan 16, 2008

It's not safe to use the two value CombineTo variant to combine away a dead load.
e.g. 
v1, chain2 = load chain1, loc
v2, chain3 = load chain2, loc
v3         = add v2, c 
Now we replace use of v1 with undef, use of chain2 with chain1.
ReplaceAllUsesWith() will iterate through uses of the first load and update operands:
v1, chain2 = load chain1, loc
v2, chain3 = load chain1, loc
v3         = add v2, c 
Now the second load is the same as the first load, SelectionDAG cse will ensure
the use of second load is replaced with the first load.
v1, chain2 = load chain1, loc
v3         = add v1, c
Then v1 is replaced with undef and bad things happen.

llvm-svn: 46099

7be15280

Jan 16, 2008

Trampoline support for x86-64. This looks like · 32b0ff68

Duncan Sands authored Jan 16, 2008

it should work, but I have no machine to test
it on.  Committed because it will at least
cause no harm, and maybe someone can test it
for me!

llvm-svn: 46098

32b0ff68

add testcase for regression · aebbe470
Chris Lattner authored Jan 16, 2008
```
llvm-svn: 46073
```
aebbe470
make sure to use a cpu that has sse. · 6e3379c0
Chris Lattner authored Jan 16, 2008
```
llvm-svn: 46060
```
6e3379c0

My previous commit had an incomplete message, it should have been: · 8f7cec85

Chris Lattner authored Jan 16, 2008

make the 'fp return in ST(0)' optimization smart enough to
look through token factor nodes.  THis allows us to compile 
testcases like CodeGen/X86/fp-stack-retcopy.ll into:

_carg:
	subl	$12, %esp
	call	L_foo$stub
	fstpl	(%esp)
	fldl	(%esp)
	addl	$12, %esp
	ret

instead of:

_carg:
	subl	$28, %esp
	call	L_foo$stub
	fstpl	16(%esp)
	movsd	16(%esp), %xmm0
	movsd	%xmm0, 8(%esp)
	fldl	8(%esp)
	addl	$28, %esp
	ret

Still not optimal, but much better and this is a trivial patch.  Fixing 
the rest requires invasive surgery that is is not llvm 2.2 material.

llvm-svn: 46054

8f7cec85

Do not strip llvm.used values. · b3696e4f
Devang Patel authored Jan 16, 2008
```
llvm-svn: 46045
```
b3696e4f

Jan 15, 2008
- add a test to ensure that argpromote of one argument doesn't · f3e1155c
  Chris Lattner authored Jan 15, 2008
```
break the byval attr on some other argument.

llvm-svn: 46025
```
  f3e1155c
- verify x86 generates ud2 for llvm.trap · 915ec140
  Chris Lattner authored Jan 15, 2008
```
llvm-svn: 46023
```
  915ec140
- new testcase for llvm.trap. · 50baecd3
  Chris Lattner authored Jan 15, 2008
```
llvm-svn: 46020
```
  50baecd3
- Testcase for gimplify_expr crash caused by an · 60bd7160
  Duncan Sands authored Jan 15, 2008
```
unexpected placeholder_expr.

llvm-svn: 46006
```
  60bd7160
Jan 14, 2008
- I noticed that the trampoline straightening transformation could · b5ca2e9f
  Duncan Sands authored Jan 14, 2008
```
drop attributes on varargs call arguments.  Also, it could generate
invalid IR if the transformed call already had the 'nest' attribute
somewhere (this can never happen for code coming from llvm-gcc,
but it's a theoretical possibility).  Fix both problems.

llvm-svn: 45973
```
  b5ca2e9f
- This test is now the same as byval-1.ll, so remove it. · ae8c041b
  Duncan Sands authored Jan 14, 2008
```
llvm-svn: 45960
```
  ae8c041b
- Test that byval cannot be used with pointers to · 4e079479
  Duncan Sands authored Jan 14, 2008
```
types with no size.

llvm-svn: 45959
```
  4e079479
- We now allow byval on fairly general pointer types. · 1f5340c0
  Duncan Sands authored Jan 14, 2008
```
llvm-svn: 45956
```
  1f5340c0
- Fix the miscompilation of MiBench/consumer-lame that was exposed by Evan's · 26fe7ebc
  Chris Lattner authored Jan 14, 2008
```
byval work.  This miscompilation is due to the program indexing an array out
of range and us doing a transformation that broke this.

llvm-svn: 45949
```
  26fe7ebc