Commits · afa12db8a631e9b7a58c1baa8ce5fd2c711971ad · Roger Ferrer / llvm-epi-0.8

Sep 02, 2009
- Retype from unsigned to CallingConv::ID accordingly. Approved by Bob Wilson. · 68c5f477
  Sandeep Patel authored Sep 02, 2009
```
llvm-svn: 80773
```
  68c5f477
- refactor select 'sched insertion' out to its own method. · d5f4fcce
  Chris Lattner authored Sep 02, 2009
```
llvm-svn: 80764
```
  d5f4fcce
Aug 31, 2009
- Revert commit 80428. It completely broke exception · 9cf8bcb6
  Duncan Sands authored Aug 31, 2009
```
handling on x86-32 linux.

llvm-svn: 80592
```
  9cf8bcb6
Aug 29, 2009

- Add target lowering methods to get the preferred format for the FDE and LSDA · 39bb29f7

Bill Wendling authored Aug 29, 2009

  encodings.
- Make some of the values emitted by the FDEs dependent upon the pointer
  size. This is in line with how GCC does things. And it has the benefit of
  working for Darwin in 64-bit mode now.

llvm-svn: 80428

39bb29f7

Aug 19, 2009

Implement sse4.2 string/text processing instructions: · 9fe912de

Eric Christopher authored Aug 18, 2009

Add patterns and instruction encoding information.
Add custom lowering to deal with hardwired return register of
uncertain type (xmm0).

llvm-svn: 79377

9fe912de

Aug 15, 2009
- Reapply r79127. It was fixed by d0k. · bae6b2cc
  Bill Wendling authored Aug 15, 2009
```
llvm-svn: 79136
```
  bae6b2cc
- Revert r79127. It was causing compilation errors. · d3fade65
  Bill Wendling authored Aug 15, 2009
```
llvm-svn: 79135
```
  d3fade65
- Change allowsUnalignedMemoryAccesses to take type argument since some targets · 52d4e647
  Evan Cheng authored Aug 15, 2009
```
support unaligned mem access only for certain types. (Should it be size
instead?)

ARM v7 supports unaligned access for i16 and i32, some v6 variants support it
as well.

llvm-svn: 79127
```
  52d4e647
- On x86-64, for a varargs function, don't store the xmm registers to · 0700a568
  Dan Gohman authored Aug 15, 2009
```
the register save area if %al is 0. This avoids touching xmm
regsiters when they aren't actually used.

llvm-svn: 79061
```
  0700a568
Aug 11, 2009
- Split EVT into MVT and EVT, the former representing _just_ a primitive type, while · 9f94459d
  Owen Anderson authored Aug 11, 2009
```
the latter is capable of representing either a primitive or an extended type.

llvm-svn: 78713
```
  9f94459d
- Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. · 53aa7a96
  Owen Anderson authored Aug 10, 2009
```
llvm-svn: 78610
```
  53aa7a96
Aug 10, 2009

Start moving TargetLowering away from using full MVTs and towards... · c30530d1

Owen Anderson authored Aug 10, 2009

Start moving TargetLowering away from using full MVTs and towards SimpleValueType, which will simplify the privatization of IntegerType in the future.

llvm-svn: 78584

c30530d1

Aug 06, 2009
- Better handle kernel code model. Also, generalize the things and fix one · 741ea0d7
  Anton Korobeynikov authored Aug 05, 2009
```
subtle bug with small code model.

llvm-svn: 78255
```
  741ea0d7
Aug 05, 2009

Major calling convention code refactoring. · f9bbcd1a

Dan Gohman authored Aug 05, 2009

Instead of awkwardly encoding calling-convention information with ISD::CALL,
ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering
provides three virtual functions for targets to override:
LowerFormalArguments, LowerCall, and LowerRet, which replace the custom
lowering done on the special nodes. They provide the same information, but
in a more immediately usable format.

This also reworks much of the target-independent tail call logic. The
decision of whether or not to perform a tail call is now cleanly split
between target-independent portions, and the target dependent portion
in IsEligibleForTailCallOptimization.

This also synchronizes all in-tree targets, to help enable future
refactoring and feature work.

llvm-svn: 78142

f9bbcd1a

Aug 01, 2009
- Fix typos in comments. · 9139b02c
  Dan Gohman authored Aug 01, 2009
```
llvm-svn: 77806
```
  9139b02c
Jul 30, 2009

Optimize some common usage patterns of atomic built-ins __sync_add_and_fetch()... · e62288fd

Evan Cheng authored Jul 30, 2009

Optimize some common usage patterns of atomic built-ins __sync_add_and_fetch() and __sync_sub_and_fetch.

When the return value is not used (i.e. only care about the value in the memory), x86 does not have to use add to implement these. Instead, it can use add, sub, inc, dec instructions with the "lock" prefix.

This is currently implemented using a bit of instruction selection trick. The issue is the target independent pattern produces one output and a chain and we want to map it into one that just output a chain. The current trick is to select it into a merge_values with the first definition being an implicit_def. The proper solution is to add new ISD opcodes for the no-output variant. DAG combiner can then transform the node before it gets to target node selection.

Problem #2 is we are adding a whole bunch of x86 atomic instructions when in fact these instructions are identical to the non-lock versions. We need a way to add target specific information to target nodes and have this information carried over to machine instructions. Asm printer (or JIT) can use this information to add the "lock" prefix.

llvm-svn: 77582

e62288fd

Jul 29, 2009
- Add support for gcc __builtin_ia32_ptest{z,c,nzc} intrinsics. Lower · f7802a33
  Eric Christopher authored Jul 29, 2009
```
to ptest instruction plus setcc. Revamp ptest instruction. Add test.

llvm-svn: 77407
```
  f7802a33
Jul 20, 2009
- Copy ExpandInlineAsm to TargetLowering from TargetAsmInfo. · 5849d22b
  Chris Lattner authored Jul 20, 2009
```
llvm-svn: 76441
```
  5849d22b
Jul 09, 2009
- change a few methods to be static functions. · 88765d48
  Chris Lattner authored Jul 09, 2009
```
llvm-svn: 75089
```
  88765d48
Jul 01, 2009
- Update comments to make it clear that the function alignment is the Log2 of the · 512ff735
  Bill Wendling authored Jul 01, 2009
```
bytes and not bytes.

llvm-svn: 74624
```
  512ff735
- Add an "alignment" field to the MachineFunction object. It makes more sense to · 31ceb1bc
  Bill Wendling authored Jun 30, 2009
```
have the alignment be calculated up front, and have the back-ends obey whatever
alignment is decided upon.

This allows for future work that would allow for precise no-op placement and the
like.

llvm-svn: 74564
```
  31ceb1bc
Jun 05, 2009

Add new function attribute - noimplicitfloat · d1c7d349

Devang Patel authored Jun 05, 2009

Update code generator to use this attribute and remove NoImplicitFloat target option.
Update llc to set this attribute when -no-implicit-float command line option is used.

llvm-svn: 72959

d1c7d349

Jun 02, 2009

Revert 72707 and 72709, for the moment. · 5234d379
Dale Johannesen authored Jun 02, 2009
```
llvm-svn: 72712
```
5234d379

Make the implicit inputs and outputs of target-independent · 0b8ca792

Dale Johannesen authored Jun 01, 2009

ADDC/ADDE use MVT::i1 (later, whatever it gets legalized to)
instead of MVT::Flag.  Remove CARRY_FALSE in favor of 0; adjust
all target-independent code to use this format.

Most targets will still produce a Flag-setting target-dependent
version when selection is done.  X86 is converted to use i32
instead, which means TableGen needs to produce different code
in xxxGenDAGISel.inc.  This keys off the new supportsHasI1 bit
in xxxInstrInfo, currently set only for X86; in principle this
is temporary and should go away when all other targets have
been converted.  All relevant X86 instruction patterns are
modified to represent setting and using EFLAGS explicitly.  The
same can be done on other targets.

The immediate behavior change is that an ADC/ADD pair are no
longer tightly coupled in the X86 scheduler; they can be
separated by instructions that don't clobber the flags (MOV).
I will soon add some peephole optimizations based on using
other instructions that set the flags to feed into ADC.

llvm-svn: 72707

0b8ca792

May 28, 2009

Added optimization that narrow load / op / store and the 'op' is a bit... · a9cda8ab

Evan Cheng authored May 28, 2009

Added optimization that narrow load / op / store and the 'op' is a bit twiddling instruction and its second operand is an immediate. If bits that are touched by 'op' can be done with a narrower instruction, reduce the width of the load and store as well. This happens a lot with bitfield manipulation code.
e.g.
orl     $65536, 8(%rax)
=>
orb     $1, 10(%rax)

Since narrowing is not always a win, e.g. i32 -> i16 is a loss on x86, dag combiner consults with the target before performing the optimization.

llvm-svn: 72507

a9cda8ab

May 23, 2009

Make the x86 backend custom-lower UINT_TO_FP and FP_TO_UINT on 32-bit · dfe4f253

Eli Friedman authored May 23, 2009

systems instead of attempting to promote them to a 64-bit SINT_TO_FP or 
FP_TO_SINT.  This is in preparation for removing the type legalization 
code from LegalizeDAG: once type legalization is gone from LegalizeDAG, 
it won't be able to handle the i64 operand/result correctly.

This isn't quite ideal, but I don't think any other operation for any 
target ends up in this situation, so treating this case specially seems 
reasonable.

llvm-svn: 72324

dfe4f253

Apr 29, 2009
- Implement review feedback for vector shuffle work. · 5f829d89
  Nate Begeman authored Apr 29, 2009
```
llvm-svn: 70372
```
  5f829d89
Apr 27, 2009

2nd attempt, fixing SSE4.1 issues and implementing feedback from duncan. · 8d6d4b92

Nate Begeman authored Apr 27, 2009

PR2957

ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle
mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes
as the shuffle mask.  A value of -1 represents UNDEF.

In addition to eliminating the creation of illegal BUILD_VECTORS just to 
represent shuffle masks, we are better about canonicalizing the shuffle mask,
resulting in substantially better code for some classes of shuffles.

llvm-svn: 70225

8d6d4b92

Apr 24, 2009

Revert 69952. Causes testsuite failures on linux x86-64. · b93db668
Rafael Espindola authored Apr 24, 2009
```
llvm-svn: 69967
```
b93db668

PR2957 · bb881d66

Nate Begeman authored Apr 24, 2009

ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle
mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes
as the shuffle mask. A value of -1 represents UNDEF.

In addition to eliminating the creation of illegal BUILD_VECTORS just to
represent shuffle masks, we are better about canonicalizing the shuffle mask,
resulting in substantially better code for some classes of shuffles.

A clean up of x86 shuffle code, and some canonicalizing in DAGCombiner is next.

llvm-svn: 69952

bb881d66

Apr 08, 2009

Re-apply 68552. · 3b2df10c

Rafael Espindola authored Apr 08, 2009

Tested by bootstrapping llvm-gcc and using that to build llvm.

llvm-svn: 68645

3b2df10c

Implement support for using modeling implicit-zero-extension on x86-64 · ad3e549a

Dan Gohman authored Apr 08, 2009

with SUBREG_TO_REG, teach SimpleRegisterCoalescing to coalesce
SUBREG_TO_REG instructions (which are similar to INSERT_SUBREG
instructions), and teach the DAGCombiner to take advantage of this on
targets which support it. This eliminates many redundant
zero-extension operations on x86-64.

This adds a new TargetLowering hook, isZExtFree. It's similar to
isTruncateFree, except it only applies to actual definitions, and not
no-op truncates which may not zero the high bits.

Also, this adds a new optimization to SimplifyDemandedBits: transform
operations like x+y into (zext (add (trunc x), (trunc y))) on targets
where all the casts are no-ops. In contexts where the high part of the
add is explicitly masked off, this allows the mask operation to be
eliminated. Fix the DAGCombiner to avoid undoing these transformations
to eliminate casts on targets where the casts are no-ops.

Also, this adds a new two-address lowering heuristic. Since
two-address lowering runs before coalescing, it helps to be able to
look through copies when deciding whether commuting and/or
three-address conversion are profitable.

Also, fix a bug in LiveInterval::MergeInClobberRanges. It didn't handle
the case that a clobber range extended both before and beyond an
existing live range. In that case, multiple live ranges need to be
added. This was exposed by the new subreg coalescing code.

Remove 2008-05-06-SpillerBug.ll. It was bugpoint-reduced, and the
spiller behavior it was looking for no longer occurrs with the new
instruction selection.

llvm-svn: 68576

ad3e549a

Temporarily revert r68552. This was causing a failure in the self-hosting LLVM · 4aa25b79

Bill Wendling authored Apr 07, 2009

builds.

--- Reverse-merging (from foreign repository) r68552 into '.':
U    test/CodeGen/X86/tls8.ll
U    test/CodeGen/X86/tls10.ll
U    test/CodeGen/X86/tls2.ll
U    test/CodeGen/X86/tls6.ll
U    lib/Target/X86/X86Instr64bit.td
U    lib/Target/X86/X86InstrSSE.td
U    lib/Target/X86/X86InstrInfo.td
U    lib/Target/X86/X86RegisterInfo.cpp
U    lib/Target/X86/X86ISelLowering.cpp
U    lib/Target/X86/X86CodeEmitter.cpp
U    lib/Target/X86/X86FastISel.cpp
U    lib/Target/X86/X86InstrInfo.h
U    lib/Target/X86/X86ISelDAGToDAG.cpp
U    lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.cpp
U    lib/Target/X86/AsmPrinter/X86IntelAsmPrinter.cpp
U    lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.h
U    lib/Target/X86/AsmPrinter/X86IntelAsmPrinter.h
U    lib/Target/X86/X86ISelLowering.h
U    lib/Target/X86/X86InstrInfo.cpp
U    lib/Target/X86/X86InstrBuilder.h
U    lib/Target/X86/X86RegisterInfo.td

llvm-svn: 68560

4aa25b79

Apr 07, 2009

Reduce code duplication on the TLS implementation. · 1edda067

Rafael Espindola authored Apr 07, 2009

This introduces a small regression on the generated code
quality in the case we are just computing addresses, not
loading values.

Will work on it and on X86-64 support.

llvm-svn: 68552

1edda067

Mar 30, 2009

When optimzing a mul by immediate into two, the resulting mul's should get a... · a84a3188

Evan Cheng authored Mar 30, 2009

When optimzing a mul by immediate into two, the resulting mul's should get a x86 specific node to avoid dag combiner from hacking on them further.

llvm-svn: 68066

a84a3188

Mar 26, 2009
- Doxygen-ify comments. · 189d6718
  Bill Wendling authored Mar 26, 2009
```
llvm-svn: 67727
```
  189d6718
Mar 23, 2009
- Correct some comments. Operand numbers start at 0. · 4a683478
  Dan Gohman authored Mar 23, 2009
```
llvm-svn: 67518
```
  4a683478
Mar 12, 2009
- improve comment. · a492d29c
  Chris Lattner authored Mar 12, 2009
```
llvm-svn: 66778
```
  a492d29c
Mar 07, 2009

Arithmetic instructions don't set EFLAGS bits OF and CF bits · ff659b5b

Dan Gohman authored Mar 07, 2009

the same say the "test" instruction does in overflow cases,
so eliminating the test is only safe when those bits aren't
needed, as is the case for COND_E and COND_NE, or if it
can be proven that no overflow will occur. For now, just
restrict the optimization to COND_E and COND_NE and don't
do any overflow analysis.

llvm-svn: 66318

ff659b5b

Mar 04, 2009
- Re-apply 66008, now that the unfoldMemoryOperand bug is fixed. · 55d7b2ac
  Dan Gohman authored Mar 04, 2009
```
llvm-svn: 66058
```
  55d7b2ac