Commits · 8ea81e8ba46486f512dd60d8334d5495e450655b · Roger Ferrer / llvm-epi-0.8

Jan 28, 2008
- Handle some more combinations of extend and icmp. Fixes PR1940. · 8ea81e8b
  Nick Lewycky authored Jan 28, 2008
```
llvm-svn: 46431
```
  8ea81e8b
- Fix PR1932 by disabling an xform invalid for fdiv. · 710b4411
  Chris Lattner authored Jan 28, 2008
```
llvm-svn: 46429
```
  710b4411
- Fix PR1938 by forcing the code that uses an undefined value to branch one · 1b706dd6
  Chris Lattner authored Jan 28, 2008
```
way or the other.  Rewriting the code itself prevents subsequent analysis
passes from making contradictory conclusions about the code that could 
cause an infeasible path to be made feasible.

llvm-svn: 46427
```
  1b706dd6
- Fix PowerPC/./2007-10-18-PtrArithmetic.ll · 2ee91f43
  Chris Lattner authored Jan 27, 2008
```
llvm-svn: 46424
```
  2ee91f43
- fix a crash on CodeGen/X86/vector-rem.ll · d0496d04
  Chris Lattner authored Jan 27, 2008
```
llvm-svn: 46422
```
  d0496d04
Jan 27, 2008

Reg alloc doesn't really need LiveVariables. · 9a8c890c
Owen Anderson authored Jan 27, 2008
```
llvm-svn: 46420
```
9a8c890c
Be more careful modifying the use_list while also iterating through it. · efb16f70
Nick Lewycky authored Jan 27, 2008
```
llvm-svn: 46417
```
efb16f70
Revert r46393: readonly/readnone functions are no · 053c9871
Duncan Sands authored Jan 27, 2008
```
longer allowed to write through byval arguments.

llvm-svn: 46416
```
053c9871

Implement some dag combines that allow doing fneg/fabs/fcopysign in integer · 888560d6

Chris Lattner authored Jan 27, 2008

registers if used by a bitconvert or using a bitconvert.  This allows us to
avoid constant pool loads and use cheaper integer instructions when the
values come from or end up in integer regs anyway.  For example, we now 
compile CodeGen/X86/fp-in-intregs.ll to:

_test1:
	movl	$2147483648, %eax
	xorl	4(%esp), %eax
	ret
_test2:
	movl	$1065353216, %eax
	orl	4(%esp), %eax
	andl	$3212836864, %eax
	ret

Instead of:
_test1:
	movss	4(%esp), %xmm0
	xorps	LCPI2_0, %xmm0
	movd	%xmm0, %eax
	ret
_test2:
	movss	4(%esp), %xmm0
	andps	LCPI3_0, %xmm0
	movss	LCPI3_1, %xmm1
	andps	LCPI3_2, %xmm1
	orps	%xmm0, %xmm1
	movd	%xmm1, %eax
	ret

bitconverts can happen due to various calling conventions that require
fp values to passed in integer regs in some cases, e.g. when returning
a complex.

llvm-svn: 46414

888560d6

add a note · 2e4719ec
Chris Lattner authored Jan 27, 2008
```
llvm-svn: 46413
```
2e4719ec
Use fldz and fld1 for long double constants instead of a constant pool load. · d05d2011
Chris Lattner authored Jan 27, 2008
```
llvm-svn: 46411
```
d05d2011
The CorrelatedExpressionElimination pass is known to be buggy. Remove it. · 60361a16
Bill Wendling authored Jan 27, 2008
```
This fixes PR1769.

llvm-svn: 46408
```
60361a16
For long double constants, print an approximation of their value to the .s... · f1a6c9fe
Chris Lattner authored Jan 27, 2008
```
For long double constants, print an approximation of their value to the .s file to make it easier to read.

llvm-svn: 46407
```
f1a6c9fe
Fold fptrunc(add (fpextend x), (fpextend y)) -> add(x,y), as GCC does. · fa1e7eef
Chris Lattner authored Jan 27, 2008
```
llvm-svn: 46406
```
fa1e7eef

Jan 26, 2008

Add some notes. · 2dd23b9f
Chris Lattner authored Jan 26, 2008
```
llvm-svn: 46405
```
2dd23b9f
Remove some code for inferring alignment info from the x86 backend · 250789f1
Chris Lattner authored Jan 26, 2008
```
now that the dag combiner does it.

llvm-svn: 46404
```
250789f1

Infer alignment of loads and increase their alignment when we can tell they are · e30e33af

Chris Lattner authored Jan 26, 2008

from the stack.  This allows us to compile stack-align.ll to:

_test:
	movsd	LCPI1_0, %xmm0
	movapd	%xmm0, %xmm1
***	andpd	4(%esp), %xmm1
	andpd	_G, %xmm0
	addsd	%xmm1, %xmm0
	movl	20(%esp), %eax
	movsd	%xmm0, (%eax)
	ret

instead of:

_test:
	movsd	LCPI1_0, %xmm0
**	movsd	4(%esp), %xmm1
**	andpd	%xmm0, %xmm1
	andpd	_G, %xmm0
	addsd	%xmm1, %xmm0
	movl	20(%esp), %eax
	movsd	%xmm0, (%eax)
	ret

llvm-svn: 46401

e30e33af

If there's no instructions being emitted on X86 for a function, emit a · 1a17ef02
Bill Wendling authored Jan 26, 2008
```
nop. Emit the nop directly for PPC.

llvm-svn: 46398
```
1a17ef02

If there are no machine instructions emitted for a function, then insert · 50794839

Bill Wendling authored Jan 26, 2008

a "nop" instruction so that we don't have the function's label associated
with something that it's not supposed to be associated with.

llvm-svn: 46394

50794839

Create an explicit copy for byval parameters even · c4dc3dc3
Duncan Sands authored Jan 26, 2008
```
when inlining a readonly function.

llvm-svn: 46393
```
c4dc3dc3

If we have a function like this: · 0862e342

Bill Wendling authored Jan 26, 2008

void bork() {
  int *address = 0;
  *address = 0;
}

It's compiled into LLVM code that looks like this:

define void @bork() noreturn nounwind  {
entry:
        unreachable
}

This is bad on some platforms (like PPC) because it will generate the label for
the function but no body. The label could end up being associated with some
non-code related stuff, like a section. This places a "trap" instruction if the
SimplifyCFG pass removed all code from the function leaving only one
"unreachable" instruction.

llvm-svn: 46387

0862e342

Fix some bugs in SimplifyNodeWithTwoResults where it would call deletenode to · 31e9edce

Chris Lattner authored Jan 26, 2008

delete a node even if it was not dead in some cases.  Instead, just add it to
the worklist.  Also, make sure to use the CombineTo methods, as it was doing
things that were unsafe: the top level combine loop could touch dangling memory.

This fixes CodeGen/Generic/2008-01-25-dag-combine-mul.ll

llvm-svn: 46384

31e9edce

don't bother making x&-1 only to simplify it in dag combine. This commonly... · 720d8999
Chris Lattner authored Jan 26, 2008
```
don't bother making x&-1 only to simplify it in dag combine.  This commonly occurs expanding i64 ops.

llvm-svn: 46383
```
720d8999
reduce indentation · cb3cf546
Chris Lattner authored Jan 25, 2008
```
llvm-svn: 46377
```
cb3cf546

Jan 25, 2008
- Do this more neatly. · f52faf9a
  Duncan Sands authored Jan 25, 2008
```
llvm-svn: 46369
```
  f52faf9a
- fix long lines. · fc80996a
  Chris Lattner authored Jan 25, 2008
```
llvm-svn: 46355
```
  fc80996a
- JITEmitter.cpp was trying to sync the icache for function stubs, but · 919ad97c
  Chris Lattner authored Jan 25, 2008
```
was actually passing a completely incorrect size to sys_icache_invalidate.
Instead of having the JITEmitter do this (which doesn't have the correct 
size), just make the target sync its own stubs.

llvm-svn: 46354
```
  919ad97c
- DeadStoreElimination can treat byval parameters as if there were alloca's for... · 6af19fd1
  Owen Anderson authored Jan 25, 2008
```
DeadStoreElimination can treat byval parameters as if there were alloca's for the purpose of removing end-of-function stores.

llvm-svn: 46351
```
  6af19fd1
- Add skeletal code to increase the alignment of loads and stores when · 2d7a830f
  Chris Lattner authored Jan 25, 2008
```
we can infer it.  This will eventually help stuff, though it doesn't
do much right now because all fixed FI's have an alignment of 1.

llvm-svn: 46349
```
  2d7a830f
- move MachineFrameInfo::CreateFixedObject out of line, give MachineFrameInfo · 6068832d
  Chris Lattner authored Jan 25, 2008
```
a reference to TargetFrameInfo.  Rearrange order of fields in StackObject to
save a word.

llvm-svn: 46348
```
  6068832d
- include alignment and volatility information in -view-*-dags output · da52d9e0
  Chris Lattner authored Jan 25, 2008
```
llvm-svn: 46347
```
  da52d9e0
- optimize fxor like for · f4523c35
  Chris Lattner authored Jan 25, 2008
```
llvm-svn: 46345
```
  f4523c35
- Add target-specific dag combines for FAND(x,0) and FOR(x,0). This allows · 84ab724e
  Chris Lattner authored Jan 25, 2008
```
us to compile:

double test(double X) {
  return copysign(0.0, X);
}

into:

_test:
	andpd	LCPI1_0(%rip), %xmm0
	ret

instead of:
_test:
	pxor	%xmm1, %xmm1
	andpd	LCPI1_0(%rip), %xmm1
	movapd	%xmm0, %xmm2
	andpd	LCPI1_1(%rip), %xmm2
	movapd	%xmm1, %xmm0
	orpd	%xmm2, %xmm0
	ret

llvm-svn: 46344
```
  84ab724e
- Provide correct DWARF register numbering for debug information emission on x86-32/Darwin. · fcde6168
  Anton Korobeynikov authored Jan 25, 2008
```
This should fix bunch of issues.

llvm-svn: 46337
```
  fcde6168
Jan 24, 2008

Don't dump the function! · 8d83271b
Chris Lattner authored Jan 24, 2008
```
llvm-svn: 46320
```
8d83271b
getUnderlyingObject can return null, handle this. · 23dd0551
Chris Lattner authored Jan 24, 2008
```
llvm-svn: 46318
```
23dd0551

Teach basicaa that 'byval' arguments define a new memory location that · 9104d712

Chris Lattner authored Jan 24, 2008

can't be aliased to other known objects.  This allows us to know that byval 
pointer args don't alias globals, etc.

llvm-svn: 46315

9104d712

Add hasByValAttr() and hasNoAliasAttr() methods to the Argument class. · e30f09d0
Chris Lattner authored Jan 24, 2008
```
llvm-svn: 46314
```
e30f09d0
clarify a comment, thanks Duncan. · 34ed27c4
Chris Lattner authored Jan 24, 2008
```
llvm-svn: 46313
```
34ed27c4

Significantly simplify and improve handling of FP function results on x86-32. · a91f77ea

Chris Lattner authored Jan 24, 2008

This case returns the value in ST(0) and then has to convert it to an SSE
register.  This causes significant codegen ugliness in some cases.  For 
example in the trivial fp-stack-direct-ret.ll testcase we used to generate:

_bar:
	subl	$28, %esp
	call	L_foo$stub
	fstpl	16(%esp)
	movsd	16(%esp), %xmm0
	movsd	%xmm0, 8(%esp)
	fldl	8(%esp)
	addl	$28, %esp
	ret

because we move the result of foo() into an XMM register, then have to
move it back for the return of bar.

Instead of hacking ever-more special cases into the call result lowering code
we take a much simpler approach: on x86-32, fp return is modeled as always 
returning into an f80 register which is then truncated to f32 or f64 as needed.
Similarly for a result, we model it as an extension to f80 + return.

This exposes the truncate and extensions to the dag combiner, allowing target
independent code to hack on them, eliminating them in this case.  This gives 
us this code for the example above:

_bar:
	subl	$12, %esp
	call	L_foo$stub
	addl	$12, %esp
	ret

The nasty aspect of this is that these conversions are not legal, but we want
the second pass of dag combiner (post-legalize) to be able to hack on them.
To handle this, we lie to legalize and say they are legal, then custom expand
them on entry to the isel pass (PreprocessForFPConvert).  This is gross, but
less gross than the code it is replacing :)

This also allows us to generate better code in several other cases.  For 
example on fp-stack-ret-conv.ll, we now generate:

_test:
	subl	$12, %esp
	call	L_foo$stub
	fstps	8(%esp)
	movl	16(%esp), %eax
	cvtss2sd	8(%esp), %xmm0
	movsd	%xmm0, (%eax)
	addl	$12, %esp
	ret

where before we produced (incidentally, the old bad code is identical to what
gcc produces):

_test:
	subl	$12, %esp
	call	L_foo$stub
	fstpl	(%esp)
	cvtsd2ss	(%esp), %xmm0
	cvtss2sd	%xmm0, %xmm0
	movl	16(%esp), %eax
	movsd	%xmm0, (%eax)
	addl	$12, %esp
	ret

Note that we generate slightly worse code on pr1505b.ll due to a scheduling 
deficiency that is unrelated to this patch.

llvm-svn: 46307

a91f77ea