Commits · 4a3c5eab344a74108123928535aa318f3496b78c · Roger Ferrer / llvm-epi-0.8

Mar 10, 2008

- Fix a subtle bug in RemoveCopyByCommutingDef. ALR is the live range where... · 4a3c5eab

Evan Cheng authored Mar 10, 2008

- Fix a subtle bug in RemoveCopyByCommutingDef. ALR is the live range where the source is defined; BLR is the live range which is defined by the copy.
If ALR and BLR overlaps and end of BLR extends beyond end of ALR, e.g.                                                                                                 
 A = or A, B                                                                                                                                                            
 ...                                                                                                                                                                    
 B = A                                                                                                                                                                  
 ...                                                                                                                                                                    
 C = A<kill>                                                                                                                                                            
 ...                                                                                                                                                                    
   = B                                                                                                                                                                  
                                                                                                                                                                        
then do not add kills of A to the newly created B interval.
- Also fix some kill info update bug.

llvm-svn: 48141

4a3c5eab

Doh · 831ae495
Evan Cheng authored Mar 10, 2008
```
llvm-svn: 48140
```
831ae495
Move StrongPHIElimination after live interval analysis. This will make things... · 75d04819
Owen Anderson authored Mar 10, 2008
```
Move StrongPHIElimination after live interval analysis.  This will make things happier down the road.

llvm-svn: 48138
```
75d04819

Avoid creating BUILD_VECTOR of all zero elements of "non-normalized" type... · b5d11980

Evan Cheng authored Mar 10, 2008

Avoid creating BUILD_VECTOR of all zero elements of "non-normalized" type (e.g. v8i16 on x86) after legalizer. Instruction selection does not expect to see them. In all likelihood this can only be an issue in a bugpoint reduced test case.

llvm-svn: 48136

b5d11980

Allow insert_subreg into implicit, target-specific values. · 4ba3f043

Christopher Lamb authored Mar 10, 2008

Change insert/extract subreg instructions to be able to be used in TableGen patterns.
Use the above features to reimplement an x86-64 pseudo instruction as a pattern.

llvm-svn: 48130

4ba3f043

Update the .cvs files to match today's asm syntax change. · 5cb21155
Nick Lewycky authored Mar 10, 2008
```
llvm-svn: 48128
```
5cb21155
Turn unwind_to into "unwinds to". · fb2c1a99
Nick Lewycky authored Mar 10, 2008
```
llvm-svn: 48123
```
fb2c1a99

Increase ISD::ParamFlags to 64 bits. Increase the ByValSize · 4e622ec8

Dale Johannesen authored Mar 10, 2008

field to 32 bits, thus enabling correct handling of ByVal
structs bigger than 0x1ffff.  Abstract interface a bit.
Fixes gcc.c-torture/execute/pr23135.c and 
gcc.c-torture/execute/pr28982b.c in gcc testsuite (were ICE'ing
on ppc32, quietly producing wrong code on x86-32.)

llvm-svn: 48122

4e622ec8

Mar 09, 2008

Darwin PPC64 indirect call target goes in X12, not R12. This fixes these · aed9406b

Chris Lattner authored Mar 09, 2008

two regression tests:
test/CodeGen/PowerPC/2007-10-21-LocalRegAllocAssert.ll
test/CodeGen/PowerPC/2007-10-21-LocalRegAllocAssert2.ll

llvm-svn: 48120

aed9406b

cell really does support cross-regclass moves, because R3 is in lots of... · 78e817d7

Chris Lattner authored Mar 09, 2008

cell really does support cross-regclass moves, because R3 is in lots of different regclasses, and the code is not consistent when it comes to value tracking.

llvm-svn: 48119

78e817d7

make sure ar.pfs is in a register class, this fixes test/CodeGen/IA64/ret-0.ll · 3ba79ed9
Chris Lattner authored Mar 09, 2008
```
llvm-svn: 48118
```
3ba79ed9
remove an extraneous (and ugly) default argument, thanks Duncan. · 4c4234b5
Chris Lattner authored Mar 09, 2008
```
llvm-svn: 48117
```
4c4234b5

Fix some compilation errors on msvc: · d48ed17d

Ted Kremenek authored Mar 09, 2008

- "Redefinition of I" (iterator masks previous definition)
- include missing header file

Patch by Argiris Kirtzidis!

llvm-svn: 48115

d48ed17d

And again. · 0ac65c3b
Nick Lewycky authored Mar 09, 2008
```
llvm-svn: 48112
```
0ac65c3b
Braces belong here. No functionality change. · 929703b2
Nick Lewycky authored Mar 09, 2008
```
llvm-svn: 48111
```
929703b2
SCCP also needs to be taught to follow unwind_to · 83750d9c
Nick Lewycky authored Mar 09, 2008
```
llvm-svn: 48109
```
83750d9c

fp_round's produced by getCopyFromParts should always be exact, because · ce5f841b

Chris Lattner authored Mar 09, 2008

they are produced by calls (which are known exact) and by cross block copies
which are known to be produced by extends.

This improves:

define double @test2() {
	%tmp85 = call double asm sideeffect "fld0", "={st(0)}"()
	ret double %tmp85
}

from:

_test2:
	subl	$20, %esp
	# InlineAsm Start
	fld0
	# InlineAsm End
	fstpl	8(%esp)
	movsd	8(%esp), %xmm0
	movsd	%xmm0, (%esp)
	fldl	(%esp)
	addl	$20, %esp
	#FP_REG_KILL
	ret

to:

_test2:
	# InlineAsm Start
	fld0
	# InlineAsm End
	#FP_REG_KILL
	ret

by avoiding a f64 <-> f80 trip

llvm-svn: 48108

ce5f841b

teach X86InstrInfo::copyRegToReg how to copy into ST(0) from · 86829f0f

Chris Lattner authored Mar 09, 2008

an RFP register class.

Teach ScheduleDAG how to handle CopyToReg with different src/dst 
reg classes.

This allows us to compile trivial inline asms that expect stuff
on the top of x87-fp stack.

llvm-svn: 48107

86829f0f

Don't eliminate blocks that are only reachable by unwind_to. · 271506f2
Nick Lewycky authored Mar 09, 2008
```
llvm-svn: 48106
```
271506f2

Add ScheduleDAG support for copytoreg where the src/dst register are · 9e07537e

Chris Lattner authored Mar 09, 2008

in different register classes, e.g. copy of ST(0) to RFP*.  This gets
some really trivial inline asm working that plops things on the top of
stack (PR879)

llvm-svn: 48105

9e07537e

add some code to support cross-register class copying from · b79bafce
Chris Lattner authored Mar 09, 2008
```
RST -> RFP{32/64/80}.  We only handle ST(0) for now.

llvm-svn: 48104
```
b79bafce
rearrange some code, no functionality change. · c4c9dde0
Chris Lattner authored Mar 09, 2008
```
llvm-svn: 48101
```
c4c9dde0
fix 80 col violation · 381bbdb9
Chris Lattner authored Mar 09, 2008
```
llvm-svn: 48100
```
381bbdb9

Firstly, having a BranchInst isn't exclusive with having an unwind_to. · 42445be0

Nick Lewycky authored Mar 09, 2008

Secondly, we have to check whether the branch is actually pointing to the block
with the unwind in it. We could have gotten here because of the unwind_to alone.

llvm-svn: 48099

42445be0

claim ST(x) registers are 80 bits, which is true. This doesn't affect · 459f5187
Chris Lattner authored Mar 09, 2008
```
codegen yet because these can't be spilled (they don't exist until after RA).

llvm-svn: 48098
```
459f5187
extend fp values with FP_EXTEND not FP_ROUND. · 83b3473d
Chris Lattner authored Mar 09, 2008
```
llvm-svn: 48097
```
83b3473d
A BB that unwind_to an "unwind" inst is that same as one that doesn't unwind_to · f3d637fa
Nick Lewycky authored Mar 09, 2008
```
at all.

llvm-svn: 48096
```
f3d637fa
rename FP_SETRESULT -> FP_SET_ST0 · 4c869594
Chris Lattner authored Mar 09, 2008
```
llvm-svn: 48094
```
4c869594

rename FpGETRESULT32 -> FpGET_ST0_32 etc. Add support for · d587e580

Chris Lattner authored Mar 09, 2008

isel'ing value preserving FP roundings from one fp stack reg to another
into a noop, instead of stack traffic.

llvm-svn: 48093

d587e580

Finish implementing a readme entry: when inserting an i64 variable · b6387c8a

Chris Lattner authored Mar 09, 2008

into a vector of zeros or undef, and when the top part is obviously
zero, we can just use movd + shuffle.  This allows us to compile
vec_set-B.ll into:

_test3:
	movl	$1234567, %eax
	andl	4(%esp), %eax
	movd	%eax, %xmm0
	ret

instead of:

_test3:
	subl	$28, %esp
	movl	$1234567, %eax
	andl	32(%esp), %eax
	movl	%eax, (%esp)
	movl	$0, 4(%esp)
	movq	(%esp), %xmm0
	addl	$28, %esp
	ret

llvm-svn: 48090

b6387c8a

Update the block cloner which fixes bugpoint on code using unwind_to (phew!) · 11fc6f87
Nick Lewycky authored Mar 09, 2008
```
and also update the cloning interface's major user, the loop optimizations.

llvm-svn: 48088
```
11fc6f87
Update the inliner and simplifycfg to handle unwind_to. · 5ce9b521
Nick Lewycky authored Mar 09, 2008
```
llvm-svn: 48086
```
5ce9b521

Two things. Preserve the unwind_to when splitting a BB. · cc241047

Nick Lewycky authored Mar 09, 2008

Add the ability to remove just one instance of a BB from a phi node. This fixes
the compile error in the tree now.

llvm-svn: 48085

cc241047

Prune the unwind_to labels on BBs that don't need them. Another step in the · 4d0ed842
Nick Lewycky authored Mar 09, 2008
```
removal of invoke, PR1269.

llvm-svn: 48084
```
4d0ed842
add a note · 93930dc2
Chris Lattner authored Mar 09, 2008
```
llvm-svn: 48064
```
93930dc2

Implement a readme entry, compiling · eef374c1

Chris Lattner authored Mar 09, 2008

#include <xmmintrin.h>
__m128i doload64(short x) {return _mm_set_epi16(0,0,0,0,0,0,0,1);}

into:
	movl	$1, %eax
	movd	%eax, %xmm0
	ret

instead of a constant pool load.

llvm-svn: 48063

eef374c1

Fix two problems in SelectionDAGLegalize::ExpandBUILD_VECTOR's handling · 322c826c

Chris Lattner authored Mar 09, 2008

of BUILD_VECTORS that only have two unique elements:

1. The previous code was nondeterminstic, because it walked a map in
   SDOperand order, which isn't determinstic.
2. The previous code didn't handle the case when one element was undef
   very well.  Now we ensure that the generated shuffle mask has the
   undef vector on the RHS (instead of potentially being on the LHS)
   and that any elements that refer to it are themselves undef.  This
   allows us to compile CodeGen/X86/vec_set-9.ll into:

_test3:
	movd	%rdi, %xmm0
	punpcklqdq	%xmm0, %xmm0
	ret

instead of:

_test3:
	movd	%rdi, %xmm1
	#IMPLICIT_DEF %xmm0
	punpcklqdq	%xmm1, %xmm0
	ret

... saving a register.

llvm-svn: 48060

322c826c

Teach SD some vector identities, allowing us to compile vec_set-9 into: · a1f25b00

Chris Lattner authored Mar 08, 2008

_test3:
	movd	%rdi, %xmm1
	#IMPLICIT_DEF %xmm0
	punpcklqdq	%xmm1, %xmm0
	ret

instead of:

_test3:
	#IMPLICIT_DEF %rax
	movd	%rax, %xmm0
	movd	%rdi, %xmm1
	punpcklqdq	%xmm1, %xmm0
	ret

This is still not ideal.  There is no reason to two xmm regs.

llvm-svn: 48058

a1f25b00

Mar 08, 2008

1) Improve comments. · ad588283

Chris Lattner authored Mar 08, 2008

2) Don't try to insert an i64 value into the low part of a 
   vector with movq on an x86-32 target.  This allows us to 
   compile:

__m128i doload64(short x) {return _mm_set_epi16(0,0,0,0,0,0,0,1);}

into:

_doload64:
	movaps	LCPI1_0, %xmm0
	ret

instead of:

_doload64:
	subl	$28, %esp
	movl	$0, 4(%esp)
	movl	$1, (%esp)
	movq	(%esp), %xmm0
	addl	$28, %esp
	ret

llvm-svn: 48057

ad588283

minor simplifications to this code, don't create a dead · 8a6ebd23
Chris Lattner authored Mar 08, 2008
```
SCALAR_TO_VECTOR on paths that end up not using it.

llvm-svn: 48056
```
8a6ebd23