Commits · 62f67ea73aa9b727cffc8a99fdf469521bbccec5 · Roger Ferrer / llvm-epi-0.8

Feb 03, 2008
- explicitly include Compiler.h instead of getting it from tblgen in the middle of a class. · 1770fb88
  Chris Lattner authored Feb 03, 2008
```
llvm-svn: 46676
```
  1770fb88
- don't do ReplaceUses on a result that doesn't exist. · e99faac4
  Chris Lattner authored Feb 03, 2008
```
llvm-svn: 46673
```
  e99faac4
Feb 02, 2008

Get rid of the annoying blank lines before labels. · 32e5347e
Evan Cheng authored Feb 02, 2008
```
llvm-svn: 46667
```
32e5347e
Don't use uninitialized values. Fixes vec_align.ll on X86 Linux. · f5b9938e
Nick Lewycky authored Feb 02, 2008
```
llvm-svn: 46666
```
f5b9938e
Unbreak ppc debug support. · 2aa360ad
Evan Cheng authored Feb 02, 2008
```
llvm-svn: 46665
```
2aa360ad

SDIsel processes llvm.dbg.declare by recording the variable debug information... · efd142a9

Evan Cheng authored Feb 02, 2008

SDIsel processes llvm.dbg.declare by recording the variable debug information descriptor and its corresponding stack frame index in MachineModuleInfo. This only works if the local variable is "homed" in the stack frame. It does not work for byval parameter, etc.
Added ISD::DECLARE node type to represent llvm.dbg.declare intrinsic. Now the intrinsic calls are lowered into a SDNode and lives on through out the codegen passes.
For now, since all the debugging information recording is done at isel time, when a ISD::DECLARE node is selected, it has the side effect of also recording the variable. This is a short term solution that should be fixed in time.

llvm-svn: 46659

efd142a9

Frame index can be negative. · 4e7ff941
Evan Cheng authored Feb 02, 2008
```
llvm-svn: 46655
```
4e7ff941

Feb 01, 2008

CBackend: Implement unaligned load/store. · 192c07b7
Lauro Ramos Venancio authored Feb 01, 2008
```
llvm-svn: 46646
```
192c07b7

Remove the nasty LABEL hack with a much less evil one. Now llvm.dbg.func.start... · d6e44ab5

Evan Cheng authored Feb 01, 2008

Remove the nasty LABEL hack with a much less evil one. Now llvm.dbg.func.start implies a stoppoint is set. SelectionDAGISel records a new source line but does not create a ISD::LABEL node for this special stoppoint. Asm printer will magically print this label. This ensures nothing is emitted before.

llvm-svn: 46635

d6e44ab5

Jan 31, 2008

Revert 46556 and 46585. Dan please fix the PseudoSourceValue problem and re-commit. · 27b32b87
Evan Cheng authored Jan 31, 2008
```
llvm-svn: 46623
```
27b32b87

Add an extra operand to LABEL nodes which distinguishes between debug, EH, or... · 1c6c16ea

Evan Cheng authored Jan 31, 2008

Add an extra operand to LABEL nodes which distinguishes between debug, EH, or misc labels. This fixes the EH breakage. However I am not convinced this is *the* solution.

llvm-svn: 46609

1c6c16ea

Allow ComplexExpressions in InstrInfo.td files to be slightly more... complex!... · 0592cf7e

Christopher Lamb authored Jan 31, 2008

Allow ComplexExpressions in InstrInfo.td files to be slightly more... complex! ComplexExpressions can now have attributes which affect how TableGen interprets
the pattern when generating matchin code.

The first (and currently, only) attribute causes the immediate parent node of the ComplexPattern operand to be passed into the matching code rather than the node at the root of the entire DAG containing the pattern.

llvm-svn: 46606

0592cf7e

Add x86 specific getFrameIndexOffset(). This fixes local variable debugging info. · 6332dbec
Evan Cheng authored Jan 31, 2008
```
llvm-svn: 46598
```
6332dbec

MRegisterInfo::getLocation() is a really bad idea. Its function is to... · a41d3bcb

Evan Cheng authored Jan 31, 2008

MRegisterInfo::getLocation() is a really bad idea. Its function is to calculate the offset from frame pointer to a stack slot and then storing the delta in a MachineLocation object. The name is bad (it implies a getter), and MRegisterInfo doesn't need to know about MachineLocation.
Replace getLocation() with getFrameIndexOffset() which returns the delta from frame pointer to stack slot. Dwarf writer can then use the information for whatever it wants.

llvm-svn: 46597

a41d3bcb

Makes the same change in ppc backend: avoid inserting prologue before debug labels. · 3c0486fb
Evan Cheng authored Jan 31, 2008
```
llvm-svn: 46596
```
3c0486fb
Avoid unnecessarily casting away const. · ed346f2e
Dan Gohman authored Jan 31, 2008
```
llvm-svn: 46590
```
ed346f2e
Rename ISD::FLT_ROUNDS to ISD::FLT_ROUNDS_ to avoid conflicting · 9ba4d768
Dan Gohman authored Jan 31, 2008
```
with the real FLT_ROUNDS (defined in <float.h>).

llvm-svn: 46587
```
9ba4d768

Create a new class, MemOperand, for describing memory references · 3646fdda

Dan Gohman authored Jan 31, 2008

in the backend. Introduce a new SDNode type, MemOperandSDNode, for
holding a MemOperand in the SelectionDAG IR, and add a MemOperand
list to MachineInstr, and code to manage them. Remove the offset
field from SrcValueSDNode; uses of SrcValueSDNode that were using
it are all all using MemOperandSDNode now.

Also, begin updating some getLoad and getStore calls to use the
PseudoSourceValue objects.

Most of this was written by Florian Brander, some
reorganization and updating to TOT by me.

llvm-svn: 46585

3646fdda

Jan 30, 2008

Treat the label for the first @llvm.dbg.stoppoint the same way as the... · a3395a61

Evan Cheng authored Jan 30, 2008

Treat the label for the first @llvm.dbg.stoppoint the same way as the dbg_func_start label. Make sure nothing else is inserted before them.
Note this solution might be somewhat fragile since ISD::LABEL may be used for other
purposes. If that ends up to be an issue, we may need to introduce a different node
for debug labels.

llvm-svn: 46571

a3395a61

Even though InsertAtEndOfBasicBlock is an ugly hack it still deserves a proper... · 29cfb67e

Evan Cheng authored Jan 30, 2008

Even though InsertAtEndOfBasicBlock is an ugly hack it still deserves a proper name. Rename it to EmitInstrWithCustomInserter since it does not necessarily insert
instruction at the end.

llvm-svn: 46562

29cfb67e

Skip over the label which marks the beginning of the function before inserting prologue code. · ed17ef7e
Evan Cheng authored Jan 30, 2008
```
llvm-svn: 46546
```
ed17ef7e

More cleanups for CellSPU: · bb713ae0

Scott Michel authored Jan 30, 2008

- Expand tabs... (poss 80-col violations, will get them later...)
- Consolidate logic for SelectDFormAddr and SelectDForm2Addr into a single
  function, simplifying maintenance. Also reduced custom instruction
  generation for SPUvecinsert/INSERT_MASK.

llvm-svn: 46544

bb713ae0

Factor the addressing mode and the load/store VT out of LoadSDNode · 47a7d6fa

Dan Gohman authored Jan 30, 2008

and StoreSDNode into their common base class LSBaseSDNode. Member
functions getLoadedVT and getStoredVT are replaced with the common
getMemoryVT to simplify code that will handle both loads and stores.

llvm-svn: 46538

47a7d6fa

Jan 29, 2008

Work in progress. This patch *fixes* x86-64 calls which are modelled as... · 084a1cdc

Evan Cheng authored Jan 29, 2008

Work in progress. This patch *fixes* x86-64 calls which are modelled as StructRet but really should be return in registers, e.g. _Complex long double, some 128-bit aggregates. This is a short term solution that is necessary only because llvm, for now, cannot model i128 nor call's with multiple results.
Status: This only works for direct calls, and only the caller side is done. Disabled for now.

llvm-svn: 46527

084a1cdc

Use getPreferredAlignmentLog or getPreferredAlignment · 05837eda
Duncan Sands authored Jan 29, 2008
```
to get the alignment of global variables, rather than
using hand-made versions.

llvm-svn: 46495
```
05837eda
Handle 'X' constraint in asm's better. · 2b3bc304
Dale Johannesen authored Jan 29, 2008
```
llvm-svn: 46485
```
2b3bc304

Overhaul Cell SPU's addressing mode internals so that there are now · ceae3bbf

Scott Michel authored Jan 29, 2008

only two addressing mode nodes, SPUaform and SPUindirect (vice the
three previous ones, SPUaform, SPUdform and SPUxform). This improves
code somewhat because we now avoid using reg+reg addressing when
it can be avoided. It also simplifies the address selection logic,
which was the main point for doing this.

Also, for various global variables that would be loaded using SPU's
A-form addressing, prefer D-form offs[reg] addressing, keeping the
base in a register if the variable is used more than once.

llvm-svn: 46483

ceae3bbf

Jan 28, 2008
- If the function has no machine instructions, then emit a "nop" so that · 96a1b810
  Bill Wendling authored Jan 28, 2008
```
the function label isn't associated with something it shouldn't be.

llvm-svn: 46449
```
  96a1b810
Jan 27, 2008
- add a note · 2e4719ec
  Chris Lattner authored Jan 27, 2008
```
llvm-svn: 46413
```
  2e4719ec
- Use fldz and fld1 for long double constants instead of a constant pool load. · d05d2011
  Chris Lattner authored Jan 27, 2008
```
llvm-svn: 46411
```
  d05d2011
Jan 26, 2008
- Add some notes. · 2dd23b9f
  Chris Lattner authored Jan 26, 2008
```
llvm-svn: 46405
```
  2dd23b9f
- Remove some code for inferring alignment info from the x86 backend · 250789f1
  Chris Lattner authored Jan 26, 2008
```
now that the dag combiner does it.

llvm-svn: 46404
```
  250789f1
- If there's no instructions being emitted on X86 for a function, emit a · 1a17ef02
  Bill Wendling authored Jan 26, 2008
```
nop. Emit the nop directly for PPC.

llvm-svn: 46398
```
  1a17ef02
- If there are no machine instructions emitted for a function, then insert · 50794839
  Bill Wendling authored Jan 26, 2008
```
a "nop" instruction so that we don't have the function's label associated
with something that it's not supposed to be associated with.

llvm-svn: 46394
```
  50794839
Jan 25, 2008

JITEmitter.cpp was trying to sync the icache for function stubs, but · 919ad97c

Chris Lattner authored Jan 25, 2008

was actually passing a completely incorrect size to sys_icache_invalidate.
Instead of having the JITEmitter do this (which doesn't have the correct 
size), just make the target sync its own stubs.

llvm-svn: 46354

919ad97c

optimize fxor like for · f4523c35
Chris Lattner authored Jan 25, 2008
```
llvm-svn: 46345
```
f4523c35

Add target-specific dag combines for FAND(x,0) and FOR(x,0). This allows · 84ab724e

Chris Lattner authored Jan 25, 2008

us to compile:

double test(double X) {
  return copysign(0.0, X);
}

into:

_test:
	andpd	LCPI1_0(%rip), %xmm0
	ret

instead of:
_test:
	pxor	%xmm1, %xmm1
	andpd	LCPI1_0(%rip), %xmm1
	movapd	%xmm0, %xmm2
	andpd	LCPI1_1(%rip), %xmm2
	movapd	%xmm1, %xmm0
	orpd	%xmm2, %xmm0
	ret

llvm-svn: 46344

84ab724e

Provide correct DWARF register numbering for debug information emission on x86-32/Darwin. · fcde6168
Anton Korobeynikov authored Jan 25, 2008
```
This should fix bunch of issues.

llvm-svn: 46337
```
fcde6168

Jan 24, 2008

Significantly simplify and improve handling of FP function results on x86-32. · a91f77ea

Chris Lattner authored Jan 24, 2008

This case returns the value in ST(0) and then has to convert it to an SSE
register.  This causes significant codegen ugliness in some cases.  For 
example in the trivial fp-stack-direct-ret.ll testcase we used to generate:

_bar:
	subl	$28, %esp
	call	L_foo$stub
	fstpl	16(%esp)
	movsd	16(%esp), %xmm0
	movsd	%xmm0, 8(%esp)
	fldl	8(%esp)
	addl	$28, %esp
	ret

because we move the result of foo() into an XMM register, then have to
move it back for the return of bar.

Instead of hacking ever-more special cases into the call result lowering code
we take a much simpler approach: on x86-32, fp return is modeled as always 
returning into an f80 register which is then truncated to f32 or f64 as needed.
Similarly for a result, we model it as an extension to f80 + return.

This exposes the truncate and extensions to the dag combiner, allowing target
independent code to hack on them, eliminating them in this case.  This gives 
us this code for the example above:

_bar:
	subl	$12, %esp
	call	L_foo$stub
	addl	$12, %esp
	ret

The nasty aspect of this is that these conversions are not legal, but we want
the second pass of dag combiner (post-legalize) to be able to hack on them.
To handle this, we lie to legalize and say they are legal, then custom expand
them on entry to the isel pass (PreprocessForFPConvert).  This is gross, but
less gross than the code it is replacing :)

This also allows us to generate better code in several other cases.  For 
example on fp-stack-ret-conv.ll, we now generate:

_test:
	subl	$12, %esp
	call	L_foo$stub
	fstps	8(%esp)
	movl	16(%esp), %eax
	cvtss2sd	8(%esp), %xmm0
	movsd	%xmm0, (%eax)
	addl	$12, %esp
	ret

where before we produced (incidentally, the old bad code is identical to what
gcc produces):

_test:
	subl	$12, %esp
	call	L_foo$stub
	fstpl	(%esp)
	cvtsd2ss	(%esp), %xmm0
	cvtss2sd	%xmm0, %xmm0
	movl	16(%esp), %eax
	movsd	%xmm0, (%eax)
	addl	$12, %esp
	ret

Note that we generate slightly worse code on pr1505b.ll due to a scheduling 
deficiency that is unrelated to this patch.

llvm-svn: 46307

a91f77ea

Let each target decide byval alignment. For X86, it's 4-byte unless the... · 35abd840

Evan Cheng authored Jan 23, 2008

Let each target decide byval alignment. For X86, it's 4-byte unless the aggregare contains SSE vector(s). For x86-64, it's max of 8 or alignment of the type.

llvm-svn: 46286

35abd840