Commits · db30612fc4c61a72b30fe4c3dee92ba436bc483b · Roger Ferrer / llvm-epi-0.8

Oct 25, 2008
- Generate code for TLS instructions. · db30612f
  Nicolas Geoffray authored Oct 25, 2008
```
llvm-svn: 58141
```
  db30612f
- CMake: lib/Target/ARM/AsmPrinter/CMakeLists.txt added. · 9ba4650b
  Oscar Fuentes authored Oct 25, 2008
```
llvm-svn: 58133
```
  9ba4650b
- Mark MFCR as reading all condition code registers. · 71f361e7
  Dale Johannesen authored Oct 24, 2008
```
Prevents some more overzealous deletions (mostly
in AltiVec code).

llvm-svn: 58121
```
  71f361e7
Oct 24, 2008

Rewrite logic to figure out whether LR needs to · 3863f8e7

Dale Johannesen authored Oct 24, 2008

be saved/restored in the prolog/epilog.  We need
to do this iff something in the function stores
into it.

llvm-svn: 58116

3863f8e7

move the note to the correct README · 33986d8f
Torok Edwin authored Oct 24, 2008
```
llvm-svn: 58104
```
33986d8f
add note about va_arg code on x86 and x86-64 · fcaae546
Torok Edwin authored Oct 24, 2008
```
llvm-svn: 58103
```
fcaae546

Fix translateX86CC: if SetCCOpcode is SETULE and · 014f5bba

Duncan Sands authored Oct 24, 2008

LHS is a foldable load, then LHS and RHS are swapped
and SetCCOpcode is changed to SETUGT.  But the later
code is expecting operands to be the wrong way round
for SETUGT, but they are not in this case, resulting
in an inverted compare.  The solution is to move the
load normalization before the correction for SETUGT.
This bug was tickled by LegalizeTypes which happened
to legalize the testcase slightly differently to
LegalizeDAG.

llvm-svn: 58092

014f5bba

Fix constant-offset emission for x86-64 absolute addresses. This · 712886f5
Dan Gohman authored Oct 24, 2008
```
fixes a bunch of test-suite JIT failures on x86-64 in
-relocation-model=static mode.

llvm-svn: 58066
```
712886f5

Oct 23, 2008
- Mark defs and uses of CTR and LR correctly. · e395d786
  Dale Johannesen authored Oct 23, 2008
```
Prevents DeadMachineInstructionElim from thinking
things like MTCTR are dead (fixes massive
testsuite breakage at -O0).

llvm-svn: 58043
```
  e395d786
- remove extraneous #ifdef's · 1ecf1fd5
  Jim Grosbach authored Oct 22, 2008
```
llvm-svn: 58006
```
  1ecf1fd5
Oct 22, 2008
- Remove allocation of unused stack slot. · f6655a9e
  Dale Johannesen authored Oct 22, 2008
```
llvm-svn: 57987
```
  f6655a9e
- Get this working with LegalizeTypes: (1) don't · 5ee1dde8
  Duncan Sands authored Oct 22, 2008
```
assume that i64 has been turned into a BUILD_PAIR
node (when called from LegalizeTypes this hasn't
happened yet) and don't use a vector shuffle mask
with an illegal element type.

llvm-svn: 57972
```
  5ee1dde8
- Fix PR2907 by digging through constant expressions to find FP constants that · 35b40f8c
  Chris Lattner authored Oct 22, 2008
```
are their operands.

llvm-svn: 57956
```
  35b40f8c
- CMake: Turned some libraries into partially linked objects. Corrected · f3c03b02
  Oscar Fuentes authored Oct 22, 2008
```
names of LLVMCore and ARMCodeGen.

llvm-svn: 57943
```
  f3c03b02
- Adjust comments for pedantic satisfaction. · cf4607fc
  Dale Johannesen authored Oct 22, 2008
```
llvm-svn: 57940
```
  cf4607fc
- Add comments to explain uint64->f64 algorithm, · 3d7ece1a
  Dale Johannesen authored Oct 21, 2008
```
well, sort of.  (Algorithm by Ian Ollmann.)

llvm-svn: 57932
```
  3d7ece1a
Oct 21, 2008

Add an SSE2 algorithm for uint64->f64 conversion. · 28929589

Dale Johannesen authored Oct 21, 2008

The same one Apple gcc uses, faster.  Also gets the
extreme case in gcc.c-torture/execute/ieee/rbug.c
correct which we weren't before; this is not
sufficient to get the test to pass though, there
is another bug.

llvm-svn: 57926

28929589

Implement the optimized FCMP_OEQ/FCMP_UNE code for x86 fast-isel. · 4ddf7a4c
Dan Gohman authored Oct 21, 2008
```
llvm-svn: 57915
```
4ddf7a4c
use pre-UAL mnemonics for push/pop for compilaton callback function · cfebc18d
Jim Grosbach authored Oct 21, 2008
```
llvm-svn: 57911
```
cfebc18d
Disable constant-offset folding for PowerPC, as the PowerPC target · c14e5227
Dan Gohman authored Oct 21, 2008
```
isn't yet prepared for it.

llvm-svn: 57886
```
c14e5227

Don't create TargetGlobalAddress nodes with offsets that don't fit · 269246b0

Dan Gohman authored Oct 21, 2008

in the 32-bit signed offset field of addresses. Even though this
may be intended, some linkers refuse to relocate code where the
relocated address computation overflows.

Also, fix the sign-extension of constant offsets to use the
actual pointer size, rather than the size of the GlobalAddress
node, which may be different, for example on x86-64 where MVT::i32
is used when the address is being fit into the 32-bit displacement
field.

llvm-svn: 57885

269246b0

Optimized FCMP_OEQ and FCMP_UNE for x86. · 97d95d6d

Dan Gohman authored Oct 21, 2008

Where previously LLVM might emit code like this:

        ucomisd %xmm1, %xmm0
        setne   %al
        setp    %cl
        orb     %al, %cl
        jne     .LBB4_2

it now emits this:

        ucomisd %xmm1, %xmm0
        jne     .LBB4_2
        jp      .LBB4_2

It has fewer instructions and uses fewer registers, but it does
have more branches. And in the case that this code is followed by
a non-fallthrough edge, it may be followed by a jmp instruction,
resulting in three branch instructions in sequence. Some effort
is made to avoid this situation.

To achieve this, X86ISelLowering.cpp now recognizes FCMP_OEQ and
FCMP_UNE in lowered form, and replace them with code that emits
two branches, except in the case where it would require converting
a fall-through edge to an explicit branch.

Also, X86InstrInfo.cpp's branch analysis and transform code now
knows now to handle blocks with multiple conditional branches. It
uses loops instead of having fixed checks for up to two
instructions. It can now analyze and transform code generated
from FCMP_OEQ and FCMP_UNE.

llvm-svn: 57873

97d95d6d

When the coalescer is doing rematerializing, have it remove · c835458d

Dan Gohman authored Oct 21, 2008

the copy instruction from the instruction list before asking the
target to create the new instruction. This gets the old instruction
out of the way so that it doesn't interfere with the target's
rematerialization code. In the case of x86, this helps it find
more cases where EFLAGS is not live.

Also, in the X86InstrInfo.cpp, teach isSafeToClobberEFLAGS to check
to see if it reached the end of the block after scanning each
instruction, instead of just before. This lets it notice when the
end of the block is only two instructions away, without doing any
additional scanning.

These changes allow rematerialization to clobber EFLAGS in more
cases, for example using xor instead of mov to set the return value
to zero in the included testcase.

llvm-svn: 57872

c835458d

Oct 20, 2008

Update the stub and callback code to handle lazy compilation. The stub · 9396051e

Jim Grosbach authored Oct 20, 2008

is re-written by the callback to branch directly to the compiled code
in future invocations.

Added back in range-based memory permission functions for the updating of
the stub on Darwin.

llvm-svn: 57846

9396051e

Have X86 custom lowering for LegalizeTypes use · 1d20ab57

Duncan Sands authored Oct 20, 2008

LowerOperation if it doesn't know what else to do.
This methods should probably be factorized some,
but this is good enough for the moment.  Have
LowerATOMIC_BINARY_64 use EXTRACT_ELEMENT rather
than assuming the operand is a BUILD_PAIR (if it
is then getNode will automagically simplify the
EXTRACT_ELEMENT).  This way LowerATOMIC_BINARY_64
usable from LegalizeTypes.

llvm-svn: 57831

1d20ab57

Oct 18, 2008

Teach DAGCombine to fold constant offsets into GlobalAddress nodes, · 2fe6bee5

Dan Gohman authored Oct 18, 2008

and add a TargetLowering hook for it to use to determine when this
is legal (i.e. not in PIC mode, etc.)

This allows instruction selection to emit folded constant offsets
in more cases, such as the included testcase, eliminating the need
for explicit arithmetic instructions.

This eliminates the need for the C++ code in X86ISelDAGToDAG.cpp
that attempted to achieve the same effect, but wasn't as effective.

Also, fix handling of offsets in GlobalAddressSDNodes in several
places, including changing GlobalAddressSDNode's offset from
int to int64_t.

The Mips, Alpha, Sparc, and CellSPU targets appear to be
unaware of GlobalAddress offsets currently, so set the hook to
false on those targets.

llvm-svn: 57748

2fe6bee5

Oct 17, 2008

This is now partly done. · 209fc264
Dan Gohman authored Oct 17, 2008
```
llvm-svn: 57734
```
209fc264
This is done. · b1d8d6ec
Dan Gohman authored Oct 17, 2008
```
llvm-svn: 57733
```
b1d8d6ec

Add implicit defs of XMM8 to XMM15 on 32-bit call instructions. While this is... · 0fcc89b5

Evan Cheng authored Oct 17, 2008

Add implicit defs of XMM8 to XMM15 on 32-bit call instructions. While this is not technically true, it tells tblgen that these instructions "clobber" the entire XMM register file.

llvm-svn: 57723

0fcc89b5

add support for 128 bit inputs on both x86-64 and x86-32. · 8e2ef196
Chris Lattner authored Oct 17, 2008
```
llvm-svn: 57709
```
8e2ef196

Fix a bug where the x86 backend would reject 64-bit r constraints when · c7e65f43

Chris Lattner authored Oct 17, 2008

in 32-bit mode instead of assigning a register pair.  This has nothing to
do with PR2356, but I happened to notice it while working on it.

llvm-svn: 57704

c7e65f43

Fix lfence and mfence encoding. These look like MRM5r and MRM6r instructions... · 27c37022

Evan Cheng authored Oct 17, 2008

Fix lfence and mfence encoding. These look like MRM5r and MRM6r instructions except they do not have any operands. The RegModRM byte is encoded with register number 0.

llvm-svn: 57692

27c37022

getX86RegNum has long been moved to X86RegisterInfo. · 9e23d746
Evan Cheng authored Oct 17, 2008
```
llvm-svn: 57691
```
9e23d746
add some simple hacky long double support for the CBE. This · 7e9e3b3d
Chris Lattner authored Oct 17, 2008
```
should work for intel long double, but ppc long double aborts
in convert.

llvm-svn: 57672
```
7e9e3b3d

Fun x86 encoding tricks: when adding an immediate value of 128, · ca0546fa

Dan Gohman authored Oct 17, 2008

use a SUB instruction instead of an ADD, because -128 can be
encoded in an 8-bit signed immediate field, while +128 can't be.
This avoids the need for a 32-bit immediate field in this case.

A similar optimization applies to 64-bit adds with 0x80000000,
with the 32-bit signed immediate field.

To support this, teach tablegen how to handle 64-bit constants.

llvm-svn: 57663

ca0546fa

Define patterns for shld and shrd that match immediate · a39b0a1f

Dan Gohman authored Oct 17, 2008

shift counts, and patterns that match dynamic shift counts
when the subtract is obscured by a truncate node.

Add DAGCombiner support for recognizing rotate patterns
when the shift counts are defined by truncate nodes.

Fix and simplify the code for commuting shld and shrd
instructions to work even when the given instruction doesn't
have a parent, and when the caller needs a new instruction.

These changes allow LLVM to use the shld, shrd, rol, and ror
instructions on x86 to replace equivalent code using two
shifts and an or in many more cases.

llvm-svn: 57662

a39b0a1f

Oct 16, 2008
- Trim #includes. · e33afda4
  Dan Gohman authored Oct 16, 2008
```
llvm-svn: 57649
```
  e33afda4
- fix typo noticed by sdt · ba88d86a
  Chris Lattner authored Oct 16, 2008
```
llvm-svn: 57644
```
  ba88d86a
- Fix warnings about mb/me being potentially used · dc845111
  Duncan Sands authored Oct 16, 2008
```
uninitialized in these functions with gcc-4.3.

llvm-svn: 57635
```
  dc845111
- add some notes · 305fb0a7
  Chris Lattner authored Oct 16, 2008
```
llvm-svn: 57631
```
  305fb0a7