Commits · e081d902335fa6db44cb218870d4533444ec3e2d · Roger Ferrer / llvm-epi-0.8

Jun 27, 2009

pull @GOT, @GOTOFF, @GOTPCREL handling into isel from the asmprinter. · ae0acfce
Chris Lattner authored Jun 27, 2009
```
llvm-svn: 74378
```
ae0acfce

Reimplement rip-relative addressing in the X86-64 backend. The new · fea81da4

Chris Lattner authored Jun 27, 2009

implementation primarily differs from the former in that the asmprinter
doesn't make a zillion decisions about whether or not something will be
RIP relative or not.  Instead, those decisions are made by isel lowering
and propagated through to the asm printer.  To achieve this, we:

1. Represent RIP relative addresses by setting the base of the X86 addr
   mode to X86::RIP.
2. When ISel Lowering decides that it is safe to use RIP, it lowers to
   X86ISD::WrapperRIP.  When it is unsafe to use RIP, it lowers to
   X86ISD::Wrapper as before.
3. This removes isRIPRel from X86ISelAddressMode, representing it with
   a basereg of RIP instead.
4. The addressing mode matching logic in isel is greatly simplified.
5. The asmprinter is greatly simplified, notably the "NotRIPRel" predicate
   passed through various printoperand routines is gone now.
6. The various symbol printing routines in asmprinter now no longer infer
   when to emit (%rip), they just print the symbol.

I think this is a big improvement over the previous situation.  It does have
two small caveats though: 1. I implemented a horrible "no-rip" modifier for
the inline asm "P" constraint modifier.  This is a short term hack, there is
a much better, but more involved, solution.  2. I had to xfail an 
-aggressive-remat testcase because it isn't handling the use of RIP in the
constant-pool reading instruction.  This specific test is easy to fix without
-aggressive-remat, which I intend to do next.

llvm-svn: 74372

fea81da4

Jun 26, 2009
- Move all the TLS processing logic into isel, don't do it in asmprinter at all. · 49ed726e
  Chris Lattner authored Jun 26, 2009
```
llvm-svn: 74327
```
  49ed726e
- move magic for PIC constantpool references from asmprinter to isel. · 2ed6a9d7
  Chris Lattner authored Jun 26, 2009
```
llvm-svn: 74313
```
  2ed6a9d7
- start adding logic in isel to determine asm printer semantics, step N of M. · 2aaad91b
  Chris Lattner authored Jun 26, 2009
```
llvm-svn: 74246
```
  2aaad91b
Jun 21, 2009
- indentation fix · a3da048c
  Chris Lattner authored Jun 21, 2009
```
llvm-svn: 73840
```
  a3da048c
Jun 16, 2009
- Misc accumulated tweaks to legalization logic for various targets. · 48021d15
  Eli Friedman authored Jun 16, 2009
```
llvm-svn: 73476
```
  48021d15
Jun 15, 2009
- I got J and K backward, many thanks to Eli for spotting this! · c68a564c
  Chris Lattner authored Jun 15, 2009
```
llvm-svn: 73372
```
  c68a564c
- implement support for the 'K' asm constraint, PR4347 · ea3621a6
  Chris Lattner authored Jun 15, 2009
```
llvm-svn: 73366
```
  ea3621a6
Jun 12, 2009

Fix Bug 4278: X86-64 with -tailcallopt calling convention · e3a018d7

Arnold Schwaighofer authored Jun 12, 2009

out of sync with regular cc.

The only difference between the tail call cc and the normal
cc was that one parameter register - R9 - was reserved for
calling functions through a function pointer. After time the
tail call cc has gotten out of sync with the regular cc. 

We can use R11 which is also caller saved but not used as
parameter register for potential function pointers and
remove the special tail call cc on x86-64.

llvm-svn: 73233

e3a018d7

Jun 10, 2009
- Silence a warning · 06039d11
  Anton Korobeynikov authored Jun 09, 2009
```
llvm-svn: 73152
```
  06039d11
Jun 07, 2009

Get rid of some unnecessary code. · 0d423441
Eli Friedman authored Jun 07, 2009
```
llvm-svn: 73017
```
0d423441

Slightly generalize the code that handles shuffles of consecutive loads · 32345872

Eli Friedman authored Jun 07, 2009

on x86 to handle more cases.  Fix a bug in said code that would cause it 
to read past the end of an object.  Rewrite the code in 
SelectionDAGLegalize::ExpandBUILD_VECTOR to be a bit more general. 
Remove PerformBuildVectorCombine, which is no longer necessary with 
these changes.  In addition to simplifying the code, with this change, 
we can now catch a few more cases of consecutive loads.

llvm-svn: 73012

32345872

Jun 06, 2009
- Avoid crashing on a variable-index insertelement with element type i16. · 75c496f9
  Eli Friedman authored Jun 06, 2009
```
llvm-svn: 72991
```
  75c496f9
- Get rid of some bogus patterns for X86vzmovl. Don't create VZEXT_MOVL · 1b1844ad
  Eli Friedman authored Jun 06, 2009
```
nodes for vectors with an i16 element type.  Add an optimization for 
building a vector which is all zeros/undef except for the bottom 
element, where the bottom element is an i8 or i16.

llvm-svn: 72988
```
  1b1844ad
- PR2598: make sure to expand illegal forms of integer/floating-point · b45e8ce6
  Eli Friedman authored Jun 06, 2009
```
conversions for x86, like <2 x i32> -> <2 x float> and <4 x i16> -> 
<4 x float>.

llvm-svn: 72983
```
  b45e8ce6
Jun 05, 2009

Add new function attribute - noimplicitfloat · d1c7d349

Devang Patel authored Jun 05, 2009

Update code generator to use this attribute and remove NoImplicitFloat target option.
Update llc to set this attribute when -no-implicit-float command line option is used.

llvm-svn: 72959

d1c7d349

Adapt the x86 build_vector dagcombine to the current state of the legalizer. · 624690c6

Nate Begeman authored Jun 05, 2009

build vectors with i64 elements will only appear on 32b x86 before legalize.
Since vector widening occurs during legalize, and produces i64 build_vector 
elements, the dag combiner is never run on these before legalize splits them
into 32b elements.

Teach the build_vector dag combine in x86 back end to recognize consecutive 
loads producing the low part of the vector.

Convert the two uses of TLI's consecutive load recognizer to pass LoadSDNodes
since that was required implicitly.

Add a testcase for the transform.

Old:
	subl	$28, %esp
	movl	32(%esp), %eax
	movl	4(%eax), %ecx
	movl	%ecx, 4(%esp)
	movl	(%eax), %eax
	movl	%eax, (%esp)
	movaps	(%esp), %xmm0
	pmovzxwd	%xmm0, %xmm0
	movl	36(%esp), %eax
	movaps	%xmm0, (%eax)
	addl	$28, %esp
	ret

New:
	movl	4(%esp), %eax
	pmovzxwd	(%eax), %xmm0
	movl	8(%esp), %eax
	movaps	%xmm0, (%eax)
	ret

llvm-svn: 72957

624690c6

Evan thinks NoImplicitFloat check is not required here. · 54707b42
Devang Patel authored Jun 05, 2009
```
llvm-svn: 72954
```
54707b42

Jun 03, 2009
- Remove unnecessary #includes. · 11231d0c
  Dan Gohman authored Jun 03, 2009
```
llvm-svn: 72782
```
  11231d0c
Jun 02, 2009

Revert 72707 and 72709, for the moment. · 5234d379
Dale Johannesen authored Jun 02, 2009
```
llvm-svn: 72712
```
5234d379

Make the implicit inputs and outputs of target-independent · 0b8ca792

Dale Johannesen authored Jun 01, 2009

ADDC/ADDE use MVT::i1 (later, whatever it gets legalized to)
instead of MVT::Flag.  Remove CARRY_FALSE in favor of 0; adjust
all target-independent code to use this format.

Most targets will still produce a Flag-setting target-dependent
version when selection is done.  X86 is converted to use i32
instead, which means TableGen needs to produce different code
in xxxGenDAGISel.inc.  This keys off the new supportsHasI1 bit
in xxxInstrInfo, currently set only for X86; in principle this
is temporary and should go away when all other targets have
been converted.  All relevant X86 instruction patterns are
modified to represent setting and using EFLAGS explicitly.  The
same can be done on other targets.

The immediate behavior change is that an ADC/ADD pair are no
longer tightly coupled in the X86 scheduler; they can be
separated by instructions that don't clobber the flags (MOV).
I will soon add some peephole optimizations based on using
other instructions that set the flags to feed into ADC.

llvm-svn: 72707

0b8ca792

May 30, 2009
- Untabification. · 09f17a84
  Bill Wendling authored May 30, 2009
```
llvm-svn: 72604
```
  09f17a84
May 28, 2009

Added optimization that narrow load / op / store and the 'op' is a bit... · a9cda8ab

Evan Cheng authored May 28, 2009

Added optimization that narrow load / op / store and the 'op' is a bit twiddling instruction and its second operand is an immediate. If bits that are touched by 'op' can be done with a narrower instruction, reduce the width of the load and store as well. This happens a lot with bitfield manipulation code.
e.g.
orl     $65536, 8(%rax)
=>
orb     $1, 10(%rax)

Since narrowing is not always a win, e.g. i32 -> i16 is a loss on x86, dag combiner consults with the target before performing the optimization.

llvm-svn: 72507

a9cda8ab

May 27, 2009
- Ger rid of some dead code. · a56159b7
  Eli Friedman authored May 27, 2009
```
llvm-svn: 72494
```
  a56159b7
- Don't abuse the quirky behavior of LegalizeDAG for XINT_TO_FP and · acb851a8
  Eli Friedman authored May 27, 2009
```
FP_TO_XINT.  Necessary for some cleanups I'm working on.  Updated 
from the previous version (r72431) to fix a bug and make some things a 
bit clearer.

llvm-svn: 72445
```
  acb851a8
May 26, 2009
- Back out r72431, it is causing a number of compilation crashes with clang. · d96b1178
  Daniel Dunbar authored May 26, 2009
```
llvm-svn: 72436
```
  d96b1178
- Don't abuse the quirky behavior of LegalizeDAG for XINT_TO_FP and · 8c7bff96
  Eli Friedman authored May 26, 2009
```
FP_TO_XINT.  Necessary for some cleanups I'm working on. 

llvm-svn: 72431
```
  8c7bff96
May 24, 2009
- Make the X86 backend mark EXTRACT_SUBVECTOR as Expand, at least for the · 2199ed39
  Eli Friedman authored May 23, 2009
```
moment.

llvm-svn: 72350
```
  2199ed39
May 23, 2009

Make the x86 backend custom-lower UINT_TO_FP and FP_TO_UINT on 32-bit · dfe4f253

Eli Friedman authored May 23, 2009

systems instead of attempting to promote them to a 64-bit SINT_TO_FP or 
FP_TO_SINT.  This is in preparation for removing the type legalization 
code from LegalizeDAG: once type legalization is gone from LegalizeDAG, 
it won't be able to handle the i64 operand/result correctly.

This isn't quite ideal, but I don't think any other operation for any 
target ends up in this situation, so treating this case specially seems 
reasonable.

llvm-svn: 72324

dfe4f253

May 13, 2009
- Run code placement optimization for targets that want it (arm and x86 for now). · ab0d2339
  Evan Cheng authored May 13, 2009
```
llvm-svn: 71726
```
  ab0d2339
May 08, 2009
- Fix PR4152: asm constraint validation happens before dag combine, so we · f1d9b914
  Chris Lattner authored May 08, 2009
```
need to work a bit to combine things like (x+c1+c2) into x+c3.

llvm-svn: 71232
```
  f1d9b914
Apr 30, 2009
- Fix infinite recursion in the C++ code which handles movddup by making it unnecessary. · 7e6e3527
  Nate Begeman authored Apr 29, 2009
```
llvm-svn: 70425
```
  7e6e3527
Apr 29, 2009
- Implement review feedback for vector shuffle work. · 5f829d89
  Nate Begeman authored Apr 29, 2009
```
llvm-svn: 70372
```
  5f829d89
Apr 27, 2009

2nd attempt, fixing SSE4.1 issues and implementing feedback from duncan. · 8d6d4b92

Nate Begeman authored Apr 27, 2009

PR2957

ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle
mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes
as the shuffle mask.  A value of -1 represents UNDEF.

In addition to eliminating the creation of illegal BUILD_VECTORS just to 
represent shuffle masks, we are better about canonicalizing the shuffle mask,
resulting in substantially better code for some classes of shuffles.

llvm-svn: 70225

8d6d4b92

Apr 24, 2009

Fix PR 4004 by including the call to __tls_get_addr in X86tlsaddr. This is not · c1396a23
Rafael Espindola authored Apr 24, 2009
```
very elegant, but neither is the tls specification :-(

llvm-svn: 69968
```
c1396a23
Revert 69952. Causes testsuite failures on linux x86-64. · b93db668
Rafael Espindola authored Apr 24, 2009
```
llvm-svn: 69967
```
b93db668

PR2957 · bb881d66

Nate Begeman authored Apr 24, 2009

ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle
mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes
as the shuffle mask. A value of -1 represents UNDEF.

In addition to eliminating the creation of illegal BUILD_VECTORS just to
represent shuffle masks, we are better about canonicalizing the shuffle mask,
resulting in substantially better code for some classes of shuffles.

A clean up of x86 shuffle code, and some canonicalizing in DAGCombiner is next.

llvm-svn: 69952

bb881d66

Apr 21, 2009
- Get rid of what looks like a copy-and-pasted typo. · 7ce5cc6b
  Duncan Sands authored Apr 21, 2009
```
Spotted by gcc-4.5.

llvm-svn: 69673
```
  7ce5cc6b
Apr 20, 2009

Move duplicated AddLiveIn function from X86 and ARM backends to be a method · f8b85477

Bob Wilson authored Apr 20, 2009

in the MachineFunction class, renaming it to addLiveIn for consistency with
the same method in MachineBasicBlock.  Thanks for Anton for suggesting this.

llvm-svn: 69615

f8b85477