Commits · 148562f13e787d7c4d5862ee473234e4bf0a1258 · Roger Ferrer / llvm-epi-0.8

Mar 09, 2008

remove an extraneous (and ugly) default argument, thanks Duncan. · 4c4234b5
Chris Lattner authored Mar 09, 2008
```
llvm-svn: 48117
```
4c4234b5

fp_round's produced by getCopyFromParts should always be exact, because · ce5f841b

Chris Lattner authored Mar 09, 2008

they are produced by calls (which are known exact) and by cross block copies
which are known to be produced by extends.

This improves:

define double @test2() {
	%tmp85 = call double asm sideeffect "fld0", "={st(0)}"()
	ret double %tmp85
}

from:

_test2:
	subl	$20, %esp
	# InlineAsm Start
	fld0
	# InlineAsm End
	fstpl	8(%esp)
	movsd	8(%esp), %xmm0
	movsd	%xmm0, (%esp)
	fldl	(%esp)
	addl	$20, %esp
	#FP_REG_KILL
	ret

to:

_test2:
	# InlineAsm Start
	fld0
	# InlineAsm End
	#FP_REG_KILL
	ret

by avoiding a f64 <-> f80 trip

llvm-svn: 48108

ce5f841b

teach X86InstrInfo::copyRegToReg how to copy into ST(0) from · 86829f0f

Chris Lattner authored Mar 09, 2008

an RFP register class.

Teach ScheduleDAG how to handle CopyToReg with different src/dst 
reg classes.

This allows us to compile trivial inline asms that expect stuff
on the top of x87-fp stack.

llvm-svn: 48107

86829f0f

Add ScheduleDAG support for copytoreg where the src/dst register are · 9e07537e

Chris Lattner authored Mar 09, 2008

in different register classes, e.g. copy of ST(0) to RFP*.  This gets
some really trivial inline asm working that plops things on the top of
stack (PR879)

llvm-svn: 48105

9e07537e

fix 80 col violation · 381bbdb9
Chris Lattner authored Mar 09, 2008
```
llvm-svn: 48100
```
381bbdb9
extend fp values with FP_EXTEND not FP_ROUND. · 83b3473d
Chris Lattner authored Mar 09, 2008
```
llvm-svn: 48097
```
83b3473d

Fix two problems in SelectionDAGLegalize::ExpandBUILD_VECTOR's handling · 322c826c

Chris Lattner authored Mar 09, 2008

of BUILD_VECTORS that only have two unique elements:

1. The previous code was nondeterminstic, because it walked a map in
   SDOperand order, which isn't determinstic.
2. The previous code didn't handle the case when one element was undef
   very well.  Now we ensure that the generated shuffle mask has the
   undef vector on the RHS (instead of potentially being on the LHS)
   and that any elements that refer to it are themselves undef.  This
   allows us to compile CodeGen/X86/vec_set-9.ll into:

_test3:
	movd	%rdi, %xmm0
	punpcklqdq	%xmm0, %xmm0
	ret

instead of:

_test3:
	movd	%rdi, %xmm1
	#IMPLICIT_DEF %xmm0
	punpcklqdq	%xmm1, %xmm0
	ret

... saving a register.

llvm-svn: 48060

322c826c

Teach SD some vector identities, allowing us to compile vec_set-9 into: · a1f25b00

Chris Lattner authored Mar 08, 2008

_test3:
	movd	%rdi, %xmm1
	#IMPLICIT_DEF %xmm0
	punpcklqdq	%xmm1, %xmm0
	ret

instead of:

_test3:
	#IMPLICIT_DEF %rax
	movd	%rax, %xmm0
	movd	%rdi, %xmm1
	punpcklqdq	%xmm1, %xmm0
	ret

This is still not ideal.  There is no reason to two xmm regs.

llvm-svn: 48058

a1f25b00

Mar 08, 2008
- Implement x86 support for @llvm.prefetch. It corresponds to prefetcht{0|1|2}... · 95cf6615
  Evan Cheng authored Mar 08, 2008
```
Implement x86 support for @llvm.prefetch. It corresponds to prefetcht{0|1|2} and prefetchnta instructions.

llvm-svn: 48042
```
  95cf6615
Mar 06, 2008
- 80 col violation. · 34173f0a
  Evan Cheng authored Mar 06, 2008
```
llvm-svn: 47998
```
  34173f0a
- Constant fold SIGN_EXTEND_INREG with ashr not lshr. · a3cb0904
  Evan Cheng authored Mar 06, 2008
```
llvm-svn: 47992
```
  a3cb0904
Mar 05, 2008
- Clarify that CALLSEQ_START..END may not be nested, · 8ee39c61
  Dale Johannesen authored Mar 05, 2008
```
and add some protection against creating such.

llvm-svn: 47957
```
  8ee39c61
- Generalize FP constant shrinking optimization to apply to any vt · 78e9cab2
  Chris Lattner authored Mar 05, 2008
```
except ppc long double.  This allows us to shrink constant pool
entries for x86 long double constants, which in turn allows us to
use flds/fldl instead of fldt.

llvm-svn: 47938
```
  78e9cab2
- Improve comment, pass in the original VT so that we can shrink a long double constant · 3dc38990
  Chris Lattner authored Mar 05, 2008
```
all the way to float, not stopping at double.

llvm-svn: 47937
```
  3dc38990
- Codegen support for i128 UINT_TO_FP. This just fixes a · da7897c4
  Dan Gohman authored Mar 05, 2008
```
bug in r47928 (Int64Ty is the correct type for the constant
pool entry here) and removes the asserts, now that the code
is capable of handling i128.

llvm-svn: 47932
```
  da7897c4
- Add a target lowering hook to control whether it's worthwhile to compress fp constant. · 0a62cb44
  Evan Cheng authored Mar 05, 2008
```
For x86, if sse2 is available, it's not a good idea since cvtss2sd is slower than a movsd load and it prevents load folding. On x87, it's important to shrink fp constant since fldt is very expensive.

llvm-svn: 47931
```
  0a62cb44
- 64bit CAS on 32bit x86. · 357061a7
  Andrew Lenharth authored Mar 05, 2008
```
llvm-svn: 47929
```
  357061a7
- Codegen support for i128 SINT_TO_FP. · d9d874b0
  Dan Gohman authored Mar 05, 2008
```
llvm-svn: 47928
```
  d9d874b0
Mar 04, 2008
- Some improvements related to the computation of heights, depths of SUnits. · c62c2bb4
  Roman Levenstein authored Mar 04, 2008
```
The basic idea is that all these algorithms are computing the longest paths from the root node or to the exit node. Therefore the existing implementation that uses and iterative and potentially
exponential algorithm was changed to a well-known graph algorithm based on dynamic programming. It has a linear run-time.

llvm-svn: 47884
```
  c62c2bb4
- Refactor ExpandConstantFP so it can optimize load from constpool of types... · 38caf774
  Evan Cheng authored Mar 04, 2008
```
Refactor ExpandConstantFP so it can optimize load from constpool of types larger than f64 into extload from smaller types.

llvm-svn: 47883
```
  38caf774
- Rename isOperand() to isOperandOf() (and other similar methods). It always confuses me. · 567d2e5b
  Evan Cheng authored Mar 04, 2008
```
llvm-svn: 47872
```
  567d2e5b
- Misc. APInt-ification in the DAGCombiner. · e1c4f995
  Dan Gohman authored Mar 03, 2008
```
llvm-svn: 47869
```
  e1c4f995
- More APInt-ification. · 10f34077
  Dan Gohman authored Mar 03, 2008
```
llvm-svn: 47868
```
  10f34077
Mar 03, 2008
- Yet more APInt-ification. · 0e238dc8
  Dan Gohman authored Mar 03, 2008
```
llvm-svn: 47867
```
  0e238dc8
- More APInt-ification. · 2fa65b79
  Dan Gohman authored Mar 03, 2008
```
llvm-svn: 47866
```
  2fa65b79
- More APInt-ification. · f2bbfa3b
  Dan Gohman authored Mar 03, 2008
```
llvm-svn: 47864
```
  f2bbfa3b
Mar 01, 2008
- all but CAS working on x86 · d032c333
  Andrew Lenharth authored Mar 01, 2008
```
llvm-svn: 47798
```
  d032c333
- Add MVT::is128BitVector and is64BitVector. Shrink · 208cc8f1
  Dale Johannesen authored Mar 01, 2008
```
unaligned load/store code using them.  Per review
of unaligned load/store vector patch.

llvm-svn: 47782
```
  208cc8f1
- Refactor / clean up code; remove td list scheduler special tie breaker (no real benefit). · 73bdf043
  Evan Cheng authored Mar 01, 2008
```
llvm-svn: 47779
```
  73bdf043
Feb 29, 2008
- More APInt-ification. · bd2fa566
  Dan Gohman authored Feb 29, 2008
```
llvm-svn: 47746
```
  bd2fa566
- Use the new convertFromAPInt instead of convertFromZeroExtendedInteger, · 837a6dcc
  Dan Gohman authored Feb 29, 2008
```
which allows more of the surrounding arithmetic to be done with APInt
instead of uint64_t.

llvm-svn: 47745
```
  837a6dcc
- Use the new APInt-enabled form of getConstant instead of converting · ec6be4a7
  Dan Gohman authored Feb 29, 2008
```
an APInt into a uint64_t to call getConstant.

llvm-svn: 47742
```
  ec6be4a7
Feb 28, 2008
- Interface of getByValTypeAlignment differed between · cbde4c22
  Dale Johannesen authored Feb 28, 2008
```
generic & x86 versions; change generic to follow x86
and improve comments.  Add PPC version (not right
for non-Darwin.)

llvm-svn: 47734
```
  cbde4c22
- Fix an assertion message. · c4c3de2b
  Dale Johannesen authored Feb 28, 2008
```
llvm-svn: 47722
```
  c4c3de2b
- Keep track how many commutes are performed by the scheduler. · a465bfb8
  Evan Cheng authored Feb 28, 2008
```
llvm-svn: 47710
```
  a465bfb8
- implement expand for ISD::DECLARE by just deleting it. · 9824ffef
  Chris Lattner authored Feb 28, 2008
```
llvm-svn: 47708
```
  9824ffef
- Add a quick and dirty "loop aligner pass". x86 uses it to align its loops to 16-byte boundaries. · c799065c
  Evan Cheng authored Feb 28, 2008
```
llvm-svn: 47703
```
  c799065c
Feb 27, 2008

Handle load/store of misaligned vectors that are the · bf76a08e

Dale Johannesen authored Feb 27, 2008

same size as an int type by doing a bitconvert of
load/store of the int type (same algorithm as floating point).
This makes them work for ppc Altivec.  There was some
code that purported to handle loads of (some) vectors
by splitting them into two smaller vectors, but getExtLoad
rejects subvector loads, so this could never have worked;
the patch removes it.

llvm-svn: 47696

bf76a08e

Remove the `else', at Evan's insistence. · e5e32ec8
Dan Gohman authored Feb 27, 2008
```
llvm-svn: 47686
```
e5e32ec8
Add a FIXME about the VECTOR_SHUFFLE evil hack. · ef40c5b2
Duncan Sands authored Feb 27, 2008
```
llvm-svn: 47676
```
ef40c5b2