Commits · b41d838d28a64369f8a804bfd2bc0899115e60cc · Roger Ferrer / llvm-epi-0.8

Dec 07, 2007
- Add comment. · b41d838d
  Evan Cheng authored Dec 07, 2007
```
llvm-svn: 44686
```
  b41d838d
- Much improved v8i16 shuffles. (Step 1). · bfd373a5
  Evan Cheng authored Dec 07, 2007
```
llvm-svn: 44676
```
  bfd373a5
Dec 06, 2007
- Remove a bogus optimization. It's not possible to do a move to low element to... · c829e5cd
  Evan Cheng authored Dec 06, 2007
```
Remove a bogus optimization. It's not possible to do a move to low element to a <8 x i16> or <16 x i8> vector.

llvm-svn: 44669
```
  c829e5cd
Dec 05, 2007
- add a note · ad05e174
  Chris Lattner authored Dec 05, 2007
```
llvm-svn: 44637
```
  ad05e174
- Add a argument to storeRegToStackSlot and storeRegToAddr to specify whether · bb263018
  Evan Cheng authored Dec 05, 2007
```
the stored register is killed.

llvm-svn: 44600
```
  bb263018
Dec 02, 2007
- Remove redundant foldMemoryOperand variants and other code clean up. · f45a1d62
  Evan Cheng authored Dec 02, 2007
```
llvm-svn: 44517
```
  f45a1d62
Dec 01, 2007
- Allow some reloads to be folded in multi-use cases. Specifically testl r, r -> cmpl [mem], 0. · 69fda0a7
  Evan Cheng authored Dec 01, 2007
```
llvm-svn: 44479
```
  69fda0a7
Nov 27, 2007

Support returning non-power-of-2 vectors to unblock some work · 6f026a65
Nate Begeman authored Nov 27, 2007
```
llvm-svn: 44371
```
6f026a65

Fix PR1146: parameter attributes are longer part of · ad0ea2d4

Duncan Sands authored Nov 27, 2007

the function type, instead they belong to functions
and function calls.  This is an updated and slightly
corrected version of Reid Spencer's original patch.
The only known problem is that auto-upgrading of
bitcode files doesn't seem to work properly (see
test/Bitcode/AutoUpgradeIntrinsics.ll).  Hopefully
a bitcode guru (who might that be? :) ) will fix it.

llvm-svn: 44359

ad0ea2d4

Nov 25, 2007

Fix a long standing deficiency in the X86 backend: we would · 5728bdd4

Chris Lattner authored Nov 25, 2007

sometimes emit "zero" and "all one" vectors multiple times,
for example:

_test2:
	pcmpeqd	%mm0, %mm0
	movq	%mm0, _M1
	pcmpeqd	%mm0, %mm0
	movq	%mm0, _M2
	ret

instead of:

_test2:
	pcmpeqd	%mm0, %mm0
	movq	%mm0, _M1
	movq	%mm0, _M2
	ret

This patch fixes this by always arranging for zero/one vectors
to be defined as v4i32 or v2i32 (SSE/MMX) instead of letting them be
any random type.  This ensures they get trivially CSE'd on the dag.
This fix is also important for LegalizeDAGTypes, as it gets unhappy
when the x86 backend wants BUILD_VECTOR(i64 0) to be legal even when
'i64' isn't legal.

This patch makes the following changes:

1) X86TargetLowering::LowerBUILD_VECTOR now lowers 0/1 vectors into
   their canonical types.
2) The now-dead patterns are removed from the SSE/MMX .td files.
3) All the patterns in the .td file that referred to immAllOnesV or
   immAllZerosV in the wrong form now use *_bc to match them with a
   bitcast wrapped around them.
4) X86DAGToDAGISel::SelectScalarSSELoad is generalized to handle 
   bitcast'd zero vectors, which simplifies the code actually.
5) getShuffleVectorZeroOrUndef is updated to generate a shuffle that
   is legal, instead of generating one that is illegal and expecting
   a later legalize pass to clean it up.
6) isZeroShuffle is generalized to handle bitcast of zeros.
7) several other minor tweaks.

This patch is definite goodness, but has the potential to cause random
code quality regressions.  Please be on the lookout for these and let 
me know if they happen.

llvm-svn: 44310

5728bdd4

Nov 24, 2007

remove bogus assertion that broke CodeGen/Generic/cast-fp.ll on x86 · f72ad162
Chris Lattner authored Nov 24, 2007
```
among others.

llvm-svn: 44302
```
f72ad162

Several changes: · f81d5886

Chris Lattner authored Nov 24, 2007

1) Change the interface to TargetLowering::ExpandOperationResult to 
   take and return entire NODES that need a result expanded, not just
   the value.  This allows us to handle things like READCYCLECOUNTER,
   which returns two values.
2) Implement (extremely limited) support in LegalizeDAG::ExpandOp for MERGE_VALUES.
3) Reimplement custom lowering in LegalizeDAGTypes in terms of the new
   ExpandOperationResult.  This makes the result simpler and fully 
   general.
4) Implement (fully general) expand support for MERGE_VALUES in LegalizeDAGTypes.
5) Implement ExpandOperationResult support for ARM f64->i64 bitconvert and ARM
   i64 shifts, allowing them to work with LegalizeDAGTypes.
6) Implement ExpandOperationResult support for X86 READCYCLECOUNTER and FP_TO_SINT,
   allowing them to work with LegalizeDAGTypes.

LegalizeDAGTypes now passes several more X86 codegen tests when enabled and when
type legalization in LegalizeDAG is ifdef'd out.

llvm-svn: 44300

f81d5886

add a note · ab98c413
Chris Lattner authored Nov 24, 2007
```
llvm-svn: 44299
```
ab98c413

Nov 21, 2007
- Fix .eh table linkage issues on Darwin. Some EH support · 763e110a
  Dale Johannesen authored Nov 20, 2007
```
for Darwin PPC, but it's not fully working yet.

llvm-svn: 44258
```
  763e110a
Nov 17, 2007
- Add support for vectors to int <-> float casts. · d4d45c26
  Nate Begeman authored Nov 17, 2007
```
llvm-svn: 44204
```
  d4d45c26
Nov 16, 2007
- Implement codegen for flt_rounds on x86 · 91460e43
  Anton Korobeynikov authored Nov 16, 2007
```
llvm-svn: 44183
```
  91460e43
Nov 14, 2007

Oops. Debugging code shouldn't have been checked in. · 0cbe920d
Evan Cheng authored Nov 14, 2007
```
llvm-svn: 44128
```
0cbe920d
Fix PIC jump table codegen on x86-32/linux. In fact, such thing should be applied · 2c638780
Anton Korobeynikov authored Nov 14, 2007
```
to all targets uses GOT-relative offsets for PIC (Alpha?)

llvm-svn: 44108
```
2c638780

Eliminate the recently introduced CCAssignToStackABISizeAlign · e2287ed5

Duncan Sands authored Nov 14, 2007

in favour of teaching CCAssignToStack that size 0 and/or align
0 means to use the ABI values.  This seems a neater solution.
It is safe since no legal value type has size 0.

llvm-svn: 44107

e2287ed5

Clean up sub-register implementation by moving subReg information back to · 7f02cfa5

Evan Cheng authored Nov 14, 2007

MachineOperand auxInfo. Previous clunky implementation uses an external map
to track sub-register uses. That works because register allocator uses
a new virtual register for each spilled use. With interval splitting (coming
soon), we may have multiple uses of the same register some of which are
of using different sub-registers from others. It's too fragile to constantly
update the information.

llvm-svn: 44104

7f02cfa5

Nov 13, 2007
- Revert previous; these files aren't ready to go in yet. · 79047083
  Dale Johannesen authored Nov 13, 2007
```
llvm-svn: 44057
```
  79047083
- Add parameter to getDwarfRegNum to permit targets · 7a7085f6
  Dale Johannesen authored Nov 13, 2007
```
to use different mappings for EH and debug info;
no functional change yet.
Fix warning in X86CodeEmitter.

llvm-svn: 44056
```
  7a7085f6
- Fix x86-64 jit: remove reliance on Dwarf numbers. · c891ae92
  Evan Cheng authored Nov 13, 2007
```
llvm-svn: 44048
```
  c891ae92
- Unifacalize the CALLSEQ{START,END} stuff. · 77b13af9
  Bill Wendling authored Nov 13, 2007
```
llvm-svn: 44045
```
  77b13af9
- Unify CALLSEQ_{START,END}. They take 4 parameters: the chain, two stack · f359fed9
  Bill Wendling authored Nov 13, 2007
```
adjustment fields, and an optional flag. If there is a "dynamic_stackalloc" in
the code, make sure that it's bracketed by CALLSEQ_START and CALLSEQ_END. If
not, then there is the potential for the stack to be changed while the stack's
being used by another instruction (like a call).

This can only result in tears...

llvm-svn: 44037
```
  f359fed9
Nov 12, 2007

Add a flag for indirect branch instructions. · 933b5b7e

Owen Anderson authored Nov 12, 2007

Target maintainers: please check that the instructions for your target are correctly marked.

llvm-svn: 44012

933b5b7e

Nov 11, 2007

Use TableGen to emit information for dwarf register numbers. · 4edfea43

Anton Korobeynikov authored Nov 11, 2007

This makes DwarfRegNum to accept list of numbers instead.
Added three different "flavours", but only slightly tested on x86-32/linux.
Please check another subtargets if possible,

llvm-svn: 43997

4edfea43

Nov 10, 2007
- Add CCAssignToStackABISizeAlign for convenience in · b988e7e8
  Dale Johannesen authored Nov 10, 2007
```
dealing with types whose size & alignment are
different on different subtargets.  Use it for x86 f80.

llvm-svn: 43988
```
  b988e7e8
- Update tailcall code to include inline attribute operand for memcpy. · d2c16ff9
  Arnold Schwaighofer authored Nov 10, 2007
```
llvm-svn: 43978
```
  d2c16ff9
Nov 09, 2007

Unbreak x86-64 jumptable. · fb13fd6f
Evan Cheng authored Nov 09, 2007
```
llvm-svn: 43955
```
fb13fd6f
Revert previous rewrite per chris's comments. · dfb85c78
Dale Johannesen authored Nov 09, 2007
```
llvm-svn: 43950
```
dfb85c78

Much improved pic jumptable codegen: · 797d56ff

Evan Cheng authored Nov 09, 2007

Then:
        call    "L1$pb"
"L1$pb":
        popl    %eax
		...
LBB1_1: # entry
        imull   $4, %ecx, %ecx
        leal    LJTI1_0-"L1$pb"(%eax), %edx
        addl    LJTI1_0-"L1$pb"(%ecx,%eax), %edx
        jmpl    *%edx

        .align  2
        .set L1_0_set_3,LBB1_3-LJTI1_0
        .set L1_0_set_2,LBB1_2-LJTI1_0
        .set L1_0_set_5,LBB1_5-LJTI1_0
        .set L1_0_set_4,LBB1_4-LJTI1_0
LJTI1_0:
        .long    L1_0_set_3
        .long    L1_0_set_2

Now:
        call    "L1$pb"
"L1$pb":
        popl    %eax
		...
LBB1_1: # entry
        addl    LJTI1_0-"L1$pb"(%eax,%ecx,4), %eax
        jmpl    *%eax

		.align  2
		.set L1_0_set_3,LBB1_3-"L1$pb"
		.set L1_0_set_2,LBB1_2-"L1$pb"
		.set L1_0_set_5,LBB1_5-"L1$pb"
		.set L1_0_set_4,LBB1_4-"L1$pb"
LJTI1_0:
        .long    L1_0_set_3
        .long    L1_0_set_2

llvm-svn: 43924

797d56ff

Rewrite Dwarf number handling per review comments. · 04fd8208
Dale Johannesen authored Nov 09, 2007
```
llvm-svn: 43918
```
04fd8208

Nov 07, 2007
- Complete conditionalization of Dwarf reg numbers. · 1b9de4dd
  Dale Johannesen authored Nov 07, 2007
```
Would somebody not on Darwin please make sure this
doesn't break anything.  Exception handling failures
would be the most likely symptom.

llvm-svn: 43844
```
  1b9de4dd
- Interchange Dwarf numbers of ESP and EBP on x86 Darwin. · fbe69d2c
  Dale Johannesen authored Nov 07, 2007
```
Much improvement in exception handling.

llvm-svn: 43794
```
  fbe69d2c
Nov 06, 2007
- Move the LowerMEMCPY and LowerMEMCPYCall to a common place. · fa0df55b
  Rafael Espindola authored Nov 05, 2007
```
Thanks for the suggestions Bill :-)

llvm-svn: 43742
```
  fa0df55b
Nov 05, 2007

Use movups to spill / restore SSE registers on targets where stacks alignment is · 9337929a
Evan Cheng authored Nov 05, 2007
```
less than 16. This is a temporary solution until dynamic stack alignment is
implemented.

llvm-svn: 43703
```
9337929a

Eliminate the remaining uses of getTypeSize. This · 283207a7

Duncan Sands authored Nov 05, 2007

should only effect x86 when using long double.  Now
12/16 bytes are output for long double globals (the
exact amount depends on the alignment).  This brings
globals in line with the rest of LLVM: the space
reserved for an object is now always the ABI size.
One tricky point is that only 10 bytes should be
output for long double if it is a field in a packed
struct, which is the reason for the additional
argument to EmitGlobalConstant.

llvm-svn: 43688

283207a7

Nov 04, 2007
- Fix PR1761 by not printing (rip) suffix when in -static mode. · 9329e780
  Chris Lattner authored Nov 04, 2007
```
Evan, please review this.

llvm-svn: 43680
```
  9329e780
- Fix PR1763 by allowing the 'q' constraint to work with 64-bit · 296160d4
  Chris Lattner authored Nov 04, 2007
```
regs on x86-64.

llvm-svn: 43669
```
  296160d4