Commits · 868bd6ab52e08049e315c1d51a62f6fe6d424afb · Roger Ferrer / llvm-epi-0.8

Jun 06, 2009
- Fix an obvious typo. · 868bd6ab
  Eli Friedman authored Jun 06, 2009
```
llvm-svn: 72987
```
  868bd6ab
- x86_64 now uses the correct ELF e_machine type · 7531e923
  Bruno Cardoso Lopes authored Jun 06, 2009
```
llvm-svn: 72986
```
  7531e923
- Get rid of a bogus pattern that interferes with optimization. · 6c101ebf
  Eli Friedman authored Jun 06, 2009
```
llvm-svn: 72985
```
  6c101ebf
- PR2598: make sure to expand illegal forms of integer/floating-point · b45e8ce6
  Eli Friedman authored Jun 06, 2009
```
conversions for x86, like <2 x i32> -> <2 x float> and <4 x i16> -> 
<4 x float>.

llvm-svn: 72983
```
  b45e8ce6
- Add explicit keywords. · d185a7a6
  Dan Gohman authored Jun 05, 2009
```
llvm-svn: 72969
```
  d185a7a6
Jun 05, 2009

Add new function attribute - noimplicitfloat · d1c7d349

Devang Patel authored Jun 05, 2009

Update code generator to use this attribute and remove NoImplicitFloat target option.
Update llc to set this attribute when -no-implicit-float command line option is used.

llvm-svn: 72959

d1c7d349

Adapt the x86 build_vector dagcombine to the current state of the legalizer. · 624690c6

Nate Begeman authored Jun 05, 2009

build vectors with i64 elements will only appear on 32b x86 before legalize.
Since vector widening occurs during legalize, and produces i64 build_vector 
elements, the dag combiner is never run on these before legalize splits them
into 32b elements.

Teach the build_vector dag combine in x86 back end to recognize consecutive 
loads producing the low part of the vector.

Convert the two uses of TLI's consecutive load recognizer to pass LoadSDNodes
since that was required implicitly.

Add a testcase for the transform.

Old:
	subl	$28, %esp
	movl	32(%esp), %eax
	movl	4(%eax), %ecx
	movl	%ecx, 4(%esp)
	movl	(%eax), %eax
	movl	%eax, (%esp)
	movaps	(%esp), %xmm0
	pmovzxwd	%xmm0, %xmm0
	movl	36(%esp), %eax
	movaps	%xmm0, (%eax)
	addl	$28, %esp
	ret

New:
	movl	4(%esp), %eax
	pmovzxwd	(%eax), %xmm0
	movl	8(%esp), %eax
	movaps	%xmm0, (%eax)
	ret

llvm-svn: 72957

624690c6

Evan thinks NoImplicitFloat check is not required here. · 54707b42
Devang Patel authored Jun 05, 2009
```
llvm-svn: 72954
```
54707b42

The DWARF unwind info was incorrect. While compiling with · 5f0d6c44

Bill Wendling authored Jun 04, 2009

`-fomit-frame-pointer', we would lack the DW_CFA_advance_loc information for a
lot of function, and then they would be `0'. The linker (at least on Darwin)
needs to encode the stack size. In some cases, the stack size is too large to
directly encode. So the linker checks to see if there is a "subl $xxx,%esp"
instruction at the point where the `DW_CFA_def_cfa_offset' says the pc was. If
so, the compact encoding records the offset in the function to where the stack
size is embedded. But because the `DW_CFA_advance_loc' instructions are missing,
it looks before the function and dies.

So, instead of emitting the EH debug label before the stack adjustment
operations, emit it afterwards, right before the frame move stuff.

llvm-svn: 72898

5f0d6c44

Add new function attribute - noredzone. · 72a4d2fe

Devang Patel authored Jun 04, 2009

Update code generator to use this attribute and remove DisableRedZone target option.
Update llc to set this attribute when -disable-red-zone command line option is used.

llvm-svn: 72894

72a4d2fe

Jun 04, 2009
- PR3739, part 2: Use an explicit store to spill XMM registers. (Previously, · 63488f1f
  Eli Friedman authored Jun 04, 2009
```
the code tried to use "push", which doesn't exist for XMM registers.)

llvm-svn: 72836
```
  63488f1f
- PR3739, part 1: Disable the red zone on Win64. · 0cb0c78a
  Eli Friedman authored Jun 04, 2009
```
llvm-svn: 72830
```
  0cb0c78a
- Evan says it's wrong; back out 72808. · 2797e7a4
  Stuart Hastings authored Jun 03, 2009
```
llvm-svn: 72817
```
  2797e7a4
Jun 03, 2009
- Recognize another euphemism for MOVDQ2Q. · 679ec691
  Stuart Hastings authored Jun 03, 2009
```
llvm-svn: 72808
```
  679ec691
- For Darwin / x86_64, override -relocation-model=static to pic if the output is... · ad6f3ff2
  Evan Cheng authored Jun 03, 2009
```
For Darwin / x86_64, override -relocation-model=static to pic if the output is assembly since Darwin assembler does not really support -static codeine.

I view this as a temporary workaround until the assembler / linker changes.

llvm-svn: 72806
```
  ad6f3ff2
- Remove the redundant TM member from X86DAGToDAGISel; replace it · 4751bb9e
  Dan Gohman authored Jun 03, 2009
```
with an accessor method which simply casts the parent class
SelectionDAGISel's TM to the target-specific type.

llvm-svn: 72801
```
  4751bb9e
- Remove unnecessary #includes. · 11231d0c
  Dan Gohman authored Jun 03, 2009
```
llvm-svn: 72782
```
  11231d0c
- Avoid a warning "'U' might be used uninitialized in · c66ad73e
  Duncan Sands authored Jun 03, 2009
```
this function" when using a not-too-smart compiler.

llvm-svn: 72768
```
  c66ad73e
- Revert r72734. The Darwin assembler doesn't support the static · fc262bab
  Dan Gohman authored Jun 03, 2009
```
relocation model on x86-64. Higher level logic should override
the relocation model to PIC on x86_64-apple-darwin.

llvm-svn: 72746
```
  fc262bab
Jun 02, 2009

On Darwin x86_64 small code model doesn't guarantee code address fits in 32-bit. · 448641d8
Evan Cheng authored Jun 02, 2009
```
llvm-svn: 72734
```
448641d8
Revert 72707 and 72709, for the moment. · 5234d379
Dale Johannesen authored Jun 02, 2009
```
llvm-svn: 72712
```
5234d379
Add missing file. · 7fde88cc
Dale Johannesen authored Jun 01, 2009
```
llvm-svn: 72709
```
7fde88cc

Make the implicit inputs and outputs of target-independent · 0b8ca792

Dale Johannesen authored Jun 01, 2009

ADDC/ADDE use MVT::i1 (later, whatever it gets legalized to)
instead of MVT::Flag.  Remove CARRY_FALSE in favor of 0; adjust
all target-independent code to use this format.

Most targets will still produce a Flag-setting target-dependent
version when selection is done.  X86 is converted to use i32
instead, which means TableGen needs to produce different code
in xxxGenDAGISel.inc.  This keys off the new supportsHasI1 bit
in xxxInstrInfo, currently set only for X86; in principle this
is temporary and should go away when all other targets have
been converted.  All relevant X86 instruction patterns are
modified to represent setting and using EFLAGS explicitly.  The
same can be done on other targets.

The immediate behavior change is that an ADC/ADD pair are no
longer tightly coupled in the X86 scheduler; they can be
separated by instructions that don't clobber the flags (MOV).
I will soon add some peephole optimizations based on using
other instructions that set the flags to feed into ADC.

llvm-svn: 72707

0b8ca792

Jun 01, 2009
- Fix new CodeEmitter stuff to follow LLVM codying style. Patch by Aaron Gray · 9fd794be
  Bruno Cardoso Lopes authored Jun 01, 2009
```
llvm-svn: 72697
```
  9fd794be
May 31, 2009
- Fix a grammaro and clarify a comment. · c1c2c689
  Dan Gohman authored May 31, 2009
```
llvm-svn: 72668
```
  c1c2c689
May 30, 2009
- First patch in the direction of splitting MachineCodeEmitter in two subclasses: · a194c3a6
  Bruno Cardoso Lopes authored May 30, 2009
```
JITCodeEmitter and ObjectCodeEmitter. No functional changes yet. Patch by Aaron Gray

llvm-svn: 72631
```
  a194c3a6
- (i64 (zext (srl GR32 8))) -> movzbl AH is not safe since srl 8 only clear the top 8 bits. · 7142ad75
  Evan Cheng authored May 30, 2009
```
llvm-svn: 72618
```
  7142ad75
- Untabification. · 09f17a84
  Bill Wendling authored May 30, 2009
```
llvm-svn: 72604
```
  09f17a84
May 29, 2009

More h-registers tricks: folding zext nodes. · 716e688f
Evan Cheng authored May 29, 2009
```
llvm-svn: 72558
```
716e688f

The MONITOR and MWAIT instructions have insufficient information for · 2e09bd3d

Bill Wendling authored May 28, 2009

decoding. Essentially, they both map to the same column in the "opcode
extensions for one- and two-byte opcodes" table in the x86 manual. The RawFrm
complicates decoding this.

Instead, use opcode 0x01, prefix 0x01, and form MRM1r. Then have the code
emitter special case these, a la [SML]FENCE.

llvm-svn: 72556

2e09bd3d

May 28, 2009

Fix MOVMSKPDrr encoding. · cc3ae1f2
Evan Cheng authored May 28, 2009
```
llvm-svn: 72535
```
cc3ae1f2
Fix PSIGND encoding bug. Patch by Sean Callanan. · 60618fe4
Evan Cheng authored May 28, 2009
```
llvm-svn: 72534
```
60618fe4

"The instructions MMX_PSADBWrm and MMX_PSADBWrr have opcode 0b11100000 (e0), but · 0feb0e60

Bill Wendling authored May 28, 2009

the Intel manual (screenshot) says it should be 0b11110110 (f6).  The existing
encoding causes a disassembly conflict with MMX_PAVGBrm, which really should be
0f e0."

Patch by Sean Callanan!

llvm-svn: 72508

0feb0e60

Added optimization that narrow load / op / store and the 'op' is a bit... · a9cda8ab

Evan Cheng authored May 28, 2009

Added optimization that narrow load / op / store and the 'op' is a bit twiddling instruction and its second operand is an immediate. If bits that are touched by 'op' can be done with a narrower instruction, reduce the width of the load and store as well. This happens a lot with bitfield manipulation code.
e.g.
orl     $65536, 8(%rax)
=>
orb     $1, 10(%rax)

Since narrowing is not always a win, e.g. i32 -> i16 is a loss on x86, dag combiner consults with the target before performing the optimization.

llvm-svn: 72507

a9cda8ab

May 27, 2009
- Ger rid of some dead code. · a56159b7
  Eli Friedman authored May 27, 2009
```
llvm-svn: 72494
```
  a56159b7
- Fix sfence jit encoding. Patch by Sean Callanan. · 4db1631a
  Evan Cheng authored May 27, 2009
```
llvm-svn: 72488
```
  4db1631a
- Don't abuse the quirky behavior of LegalizeDAG for XINT_TO_FP and · acb851a8
  Eli Friedman authored May 27, 2009
```
FP_TO_XINT.  Necessary for some cleanups I'm working on.  Updated 
from the previous version (r72431) to fix a bug and make some things a 
bit clearer.

llvm-svn: 72445
```
  acb851a8
May 26, 2009

Back out r72431, it is causing a number of compilation crashes with clang. · d96b1178
Daniel Dunbar authored May 26, 2009
```
llvm-svn: 72436
```
d96b1178

Update CPU capabilities for AMD machines · 96180b53

Stefanus Du Toit authored May 26, 2009

- added processors k8-sse3, opteron-sse3, athlon64-sse3, amdfam10, and
barcelona with appropriate sse3/4a levels
- added FeatureSSE4A for amdfam10 processors
in X86Subtarget:
- added hasSSE4A
- updated AutoDetectSubtargetFeatures to detect SSE4A
- updated GetCurrentX86CPU to detect family 15 with sse3 as k8-sse3 and
family 10h as amdfam10

New processor names match those used by gcc.

Patch by Paul Redmond!

llvm-svn: 72434

96180b53

Don't abuse the quirky behavior of LegalizeDAG for XINT_TO_FP and · 8c7bff96
Eli Friedman authored May 26, 2009
```
FP_TO_XINT.  Necessary for some cleanups I'm working on. 

llvm-svn: 72431
```
8c7bff96