Commits · 8df7cc1119f6613336b5281921341c6f83a90c6f · Roger Ferrer / llvm-epi-0.8

Jan 31, 2008

Revert 46556 and 46585. Dan please fix the PseudoSourceValue problem and re-commit. · 27b32b87
Evan Cheng authored Jan 31, 2008
```
llvm-svn: 46623
```
27b32b87
Avoid unnecessarily casting away const. · ed346f2e
Dan Gohman authored Jan 31, 2008
```
llvm-svn: 46590
```
ed346f2e
Rename ISD::FLT_ROUNDS to ISD::FLT_ROUNDS_ to avoid conflicting · 9ba4d768
Dan Gohman authored Jan 31, 2008
```
with the real FLT_ROUNDS (defined in <float.h>).

llvm-svn: 46587
```
9ba4d768

Create a new class, MemOperand, for describing memory references · 3646fdda

Dan Gohman authored Jan 31, 2008

in the backend. Introduce a new SDNode type, MemOperandSDNode, for
holding a MemOperand in the SelectionDAG IR, and add a MemOperand
list to MachineInstr, and code to manage them. Remove the offset
field from SrcValueSDNode; uses of SrcValueSDNode that were using
it are all all using MemOperandSDNode now.

Also, begin updating some getLoad and getStore calls to use the
PseudoSourceValue objects.

Most of this was written by Florian Brander, some
reorganization and updating to TOT by me.

llvm-svn: 46585

3646fdda

Jan 30, 2008

Even though InsertAtEndOfBasicBlock is an ugly hack it still deserves a proper... · 29cfb67e

Evan Cheng authored Jan 30, 2008

Even though InsertAtEndOfBasicBlock is an ugly hack it still deserves a proper name. Rename it to EmitInstrWithCustomInserter since it does not necessarily insert
instruction at the end.

llvm-svn: 46562

29cfb67e

Jan 29, 2008

Work in progress. This patch *fixes* x86-64 calls which are modelled as... · 084a1cdc

Evan Cheng authored Jan 29, 2008

Work in progress. This patch *fixes* x86-64 calls which are modelled as StructRet but really should be return in registers, e.g. _Complex long double, some 128-bit aggregates. This is a short term solution that is necessary only because llvm, for now, cannot model i128 nor call's with multiple results.
Status: This only works for direct calls, and only the caller side is done. Disabled for now.

llvm-svn: 46527

084a1cdc

Handle 'X' constraint in asm's better. · 2b3bc304
Dale Johannesen authored Jan 29, 2008
```
llvm-svn: 46485
```
2b3bc304

Jan 27, 2008
- Use fldz and fld1 for long double constants instead of a constant pool load. · d05d2011
  Chris Lattner authored Jan 27, 2008
```
llvm-svn: 46411
```
  d05d2011
Jan 26, 2008
- Remove some code for inferring alignment info from the x86 backend · 250789f1
  Chris Lattner authored Jan 26, 2008
```
now that the dag combiner does it.

llvm-svn: 46404
```
  250789f1
Jan 25, 2008

optimize fxor like for · f4523c35
Chris Lattner authored Jan 25, 2008
```
llvm-svn: 46345
```
f4523c35

Add target-specific dag combines for FAND(x,0) and FOR(x,0). This allows · 84ab724e

Chris Lattner authored Jan 25, 2008

us to compile:

double test(double X) {
  return copysign(0.0, X);
}

into:

_test:
	andpd	LCPI1_0(%rip), %xmm0
	ret

instead of:
_test:
	pxor	%xmm1, %xmm1
	andpd	LCPI1_0(%rip), %xmm1
	movapd	%xmm0, %xmm2
	andpd	LCPI1_1(%rip), %xmm2
	movapd	%xmm1, %xmm0
	orpd	%xmm2, %xmm0
	ret

llvm-svn: 46344

84ab724e

Jan 24, 2008

Significantly simplify and improve handling of FP function results on x86-32. · a91f77ea

Chris Lattner authored Jan 24, 2008

This case returns the value in ST(0) and then has to convert it to an SSE
register.  This causes significant codegen ugliness in some cases.  For 
example in the trivial fp-stack-direct-ret.ll testcase we used to generate:

_bar:
	subl	$28, %esp
	call	L_foo$stub
	fstpl	16(%esp)
	movsd	16(%esp), %xmm0
	movsd	%xmm0, 8(%esp)
	fldl	8(%esp)
	addl	$28, %esp
	ret

because we move the result of foo() into an XMM register, then have to
move it back for the return of bar.

Instead of hacking ever-more special cases into the call result lowering code
we take a much simpler approach: on x86-32, fp return is modeled as always 
returning into an f80 register which is then truncated to f32 or f64 as needed.
Similarly for a result, we model it as an extension to f80 + return.

This exposes the truncate and extensions to the dag combiner, allowing target
independent code to hack on them, eliminating them in this case.  This gives 
us this code for the example above:

_bar:
	subl	$12, %esp
	call	L_foo$stub
	addl	$12, %esp
	ret

The nasty aspect of this is that these conversions are not legal, but we want
the second pass of dag combiner (post-legalize) to be able to hack on them.
To handle this, we lie to legalize and say they are legal, then custom expand
them on entry to the isel pass (PreprocessForFPConvert).  This is gross, but
less gross than the code it is replacing :)

This also allows us to generate better code in several other cases.  For 
example on fp-stack-ret-conv.ll, we now generate:

_test:
	subl	$12, %esp
	call	L_foo$stub
	fstps	8(%esp)
	movl	16(%esp), %eax
	cvtss2sd	8(%esp), %xmm0
	movsd	%xmm0, (%eax)
	addl	$12, %esp
	ret

where before we produced (incidentally, the old bad code is identical to what
gcc produces):

_test:
	subl	$12, %esp
	call	L_foo$stub
	fstpl	(%esp)
	cvtsd2ss	(%esp), %xmm0
	cvtss2sd	%xmm0, %xmm0
	movl	16(%esp), %eax
	movsd	%xmm0, (%eax)
	addl	$12, %esp
	ret

Note that we generate slightly worse code on pr1505b.ll due to a scheduling 
deficiency that is unrelated to this patch.

llvm-svn: 46307

a91f77ea

Let each target decide byval alignment. For X86, it's 4-byte unless the... · 35abd840

Evan Cheng authored Jan 23, 2008

Let each target decide byval alignment. For X86, it's 4-byte unless the aggregare contains SSE vector(s). For x86-64, it's max of 8 or alignment of the type.

llvm-svn: 46286

35abd840

Jan 23, 2008

The last pieces needed for loading arbitrary · 95d46ef8

Duncan Sands authored Jan 23, 2008

precision integers.  This won't actually work
(and most of the code is dead) unless the new
legalization machinery is turned on.  While
there, I rationalized the handling of i1, and
removed some bogus (and unused) sextload patterns.
For i1, this could result in microscopically
better code for some architectures (not X86).
It might also result in worse code if annotating
with AssertZExt nodes turns out to be more harmful
than helpful.

llvm-svn: 46280

95d46ef8

Jan 17, 2008

This commit changes: · 1ea55cf8

Chris Lattner authored Jan 17, 2008

1. Legalize now always promotes truncstore of i1 to i8. 
2. Remove patterns and gunk related to truncstore i1 from targets.
3. Rename the StoreXAction stuff to TruncStoreAction in TLI.
4. Make the TLI TruncStoreAction table a 2d table to handle from/to conversions.
5. Mark a wide variety of invalid truncstores as such in various targets, e.g.
   X86 currently doesn't support truncstore of any of its integer types.
6. Add legalize support for truncstores with invalid value input types.
7. Add a dag combine transform to turn store(truncate) into truncstore when
   safe.

The later allows us to compile CodeGen/X86/storetrunc-fp.ll to:

_foo:
	fldt	20(%esp)
	fldt	4(%esp)
	faddp	%st(1)
	movl	36(%esp), %eax
	fstps	(%eax)
	ret

instead of:

_foo:
	subl	$4, %esp
	fldt	24(%esp)
	fldt	8(%esp)
	faddp	%st(1)
	fstps	(%esp)
	movl	40(%esp), %eax
	movss	(%esp), %xmm0
	movss	%xmm0, (%eax)
	addl	$4, %esp
	ret

llvm-svn: 46140

1ea55cf8

* Introduce a new SelectionDAG::getIntPtrConstant method · 72733e57

Chris Lattner authored Jan 17, 2008

  and switch various codegen pieces and the X86 backend over
  to using it.

* Add some comments to SelectionDAGNodes.h

* Introduce a second argument to FP_ROUND, which indicates
  whether the FP_ROUND changes the value of its input. If
  not it is safe to xform things like fp_extend(fp_round(x)) -> x.

llvm-svn: 46125

72733e57

Jan 16, 2008

Trampoline support for x86-64. This looks like · 32b0ff68

Duncan Sands authored Jan 16, 2008

it should work, but I have no machine to test
it on.  Committed because it will at least
cause no harm, and maybe someone can test it
for me!

llvm-svn: 46098

32b0ff68

make it more clear that this predicate only applies to scalar FP types. · e8bb9f21
Chris Lattner authored Jan 16, 2008
```
llvm-svn: 46058
```
e8bb9f21
introduce a isTypeInSSEReg predicate, which allows us to simplify · 14e616ef
Chris Lattner authored Jan 16, 2008
```
some code.  No functionality change.

llvm-svn: 46055
```
14e616ef

My previous commit had an incomplete message, it should have been: · 8f7cec85

Chris Lattner authored Jan 16, 2008

make the 'fp return in ST(0)' optimization smart enough to
look through token factor nodes.  THis allows us to compile 
testcases like CodeGen/X86/fp-stack-retcopy.ll into:

_carg:
	subl	$12, %esp
	call	L_foo$stub
	fstpl	(%esp)
	fldl	(%esp)
	addl	$12, %esp
	ret

instead of:

_carg:
	subl	$28, %esp
	call	L_foo$stub
	fstpl	16(%esp)
	movsd	16(%esp), %xmm0
	movsd	%xmm0, 8(%esp)
	fldl	8(%esp)
	addl	$28, %esp
	ret

Still not optimal, but much better and this is a trivial patch.  Fixing 
the rest requires invasive surgery that is is not llvm 2.2 material.

llvm-svn: 46054

8f7cec85

make the 'fp return in ST(0)' optimization smart enough to · ea001f1d
Chris Lattner authored Jan 16, 2008
```
look through token factor

llvm-svn: 46053
```
ea001f1d
various whitespace cleanups, no functionality change. · de5c74f1
Chris Lattner authored Jan 16, 2008
```
llvm-svn: 46052
```
de5c74f1

Jan 15, 2008
- no need to expand ISD::TRAP to X86ISD::TRAP, just match ISD::TRAP. · 3c3fefde
  Chris Lattner authored Jan 15, 2008
```
llvm-svn: 46015
```
  3c3fefde
- For PR1839: add initial support for __builtin_trap. llvm-gcc part is missed · 6bbbc4cb
  Anton Korobeynikov authored Jan 15, 2008
```
as well as PPC codegen

llvm-svn: 46001
```
  6bbbc4cb
Jan 13, 2008
- Whitespace tweak. · 51fe7bbc
  Duncan Sands authored Jan 13, 2008
```
llvm-svn: 45940
```
  51fe7bbc
Jan 12, 2008
- Code clean up. · 7411b510
  Evan Cheng authored Jan 12, 2008
```
llvm-svn: 45898
```
  7411b510
Jan 11, 2008

hrm - correct spelling. · 06da9e2d

Arnold Schwaighofer authored Jan 11, 2008

Actually were not riding any arguments. Sadly there is no semantic spell checker that is going to safe you from such a mistake.

llvm-svn: 45868

06da9e2d

Improve tail call optimized call's argument lowering. Before this · 6cf72fbb

Arnold Schwaighofer authored Jan 11, 2008

commit all arguments where moved to the stack slot where they would
reside on a normal function call before the lowering to the tail call
stack slot. This was done to prevent arguments overwriting each other.
Now only arguments sourcing from a FORMAL_ARGUMENTS node or a
CopyFromReg node with virtual register (could also be a caller's
argument) are lowered indirectly.

 --This line, and those below, will be ignored--

M    X86/X86ISelLowering.cpp
M    X86/README.txt

llvm-svn: 45867

6cf72fbb

Correct a copy and paste error. · bf1816ea
Arnold Schwaighofer authored Jan 11, 2008
```
llvm-svn: 45865
```
bf1816ea

Jan 10, 2008
- Mark byval parameter stack objects mutable for now. · a2655249
  Evan Cheng authored Jan 10, 2008
```
llvm-svn: 45813
```
  a2655249
- Do not use the stack pointer directly, issue a copyfromreg instead. Otherwise... · fead113f
  Evan Cheng authored Jan 10, 2008
```
Do not use the stack pointer directly, issue a copyfromreg instead. Otherwise we can end up with something like ADD32ri %esp, x which two-address pass won't like.

llvm-svn: 45798
```
  fead113f
- Remove comments that do not correspond to anything after recent refactoring. · 73d10178
  Evan Cheng authored Jan 10, 2008
```
llvm-svn: 45792
```
  73d10178
Jan 08, 2008
- Unbreak x86-64. · 8242168e
  Evan Cheng authored Jan 07, 2008
```
llvm-svn: 45725
```
  8242168e
Jan 05, 2008
- Remove an incorrect optimization that is performed correctly by · 22950d26
  Nate Begeman authored Jan 05, 2008
```
the target independent legalizer.

llvm-svn: 45631
```
  22950d26
- Refactoring the x86 and x86-64 calling convention implementations, · 92319583
  Gordon Henriksen authored Jan 05, 2008
```
unifying the copied algorithms and saving over 500 LOC. There should
be no functionality change, but please test on your favorite x86
target.

llvm-svn: 45627
```
  92319583
Jan 03, 2008
- First steps in in X86 calling convention cleanup. · f066fc47
  Gordon Henriksen authored Jan 03, 2008
```
llvm-svn: 45536
```
  f066fc47
Dec 31, 2007

Rename SSARegMap -> MachineRegisterInfo in keeping with the idea · a10fff51

Chris Lattner authored Dec 31, 2007

that "machine" classes are used to represent the current state of
the code being compiled.  Given this expanded name, we can start 
moving other stuff into it.  For now, move the UsedPhysRegs and
LiveIn/LoveOuts vectors from MachineFunction into it.

Update all the clients to match.

This also reduces some needless #includes, such as MachineModuleInfo
from MachineFunction.

llvm-svn: 45467

a10fff51

Add new shorter predicates for testing machine operands for various types: · a5bb370a

Chris Lattner authored Dec 30, 2007

e.g. MO.isMBB() instead of MO.isMachineBasicBlock().  I don't plan on 
switching everything over, so new clients should just start using the 
shorter names.

Remove old long accessors, switching everything over to use the short
accessor: getMachineBasicBlock() -> getMBB(), 
getConstantPoolIndex() -> getIndex(), setMachineBasicBlock -> setMBB(), etc.

llvm-svn: 45464

a5bb370a

Dec 29, 2007

Remove attribution from file headers, per discussion on llvmdev. · f3ebc3f3
Chris Lattner authored Dec 29, 2007
```
llvm-svn: 45418
```
f3ebc3f3

Codegen: · 07ccbfa6

Chris Lattner authored Dec 29, 2007

as:

_bar:
	pushl	%esi
	subl	$8, %esp
	movl	16(%esp), %esi
	call	L_foo$stub
	fstps	(%esi)
	addl	$8, %esp
	popl	%esi
	#FP_REG_KILL
	ret

instead of:

_bar:
	pushl	%esi
	subl	$8, %esp
	movl	16(%esp), %esi
	call	L_foo$stub
	fstpl	(%esi)
	cvtsd2ss	(%esi), %xmm0
	movss	%xmm0, (%esi)
	addl	$8, %esp
	popl	%esi
	#FP_REG_KILL
	ret

llvm-svn: 45401

07ccbfa6