Commits · 64c996271ef310e594ff1edc8b701c9ca5cb4c6e · Roger Ferrer / llvm-epi-0.8

Oct 22, 2008

Privatize PrintModulePass and PrintFunctionPass and add · 54d5b9ea

Daniel Dunbar authored Oct 21, 2008

createPrintModulePass and createPrintFunctionPass.
 - So clients who compile w/o RTTI can use them.

llvm-svn: 57933

54d5b9ea

Oct 21, 2008

Add an SSE2 algorithm for uint64->f64 conversion. · 28929589

Dale Johannesen authored Oct 21, 2008

The same one Apple gcc uses, faster.  Also gets the
extreme case in gcc.c-torture/execute/ieee/rbug.c
correct which we weren't before; this is not
sufficient to get the test to pass though, there
is another bug.

llvm-svn: 57926

28929589

Fix SelectionDAGBuild lowering of Select instructions to · 8b44b88e

Dan Gohman authored Oct 21, 2008

handle first-class aggregate values. Also, fix a bug in
the Ret handling for empty aggregates.

llvm-svn: 57925

8b44b88e

Don't create TargetGlobalAddress nodes with offsets that don't fit · 269246b0

Dan Gohman authored Oct 21, 2008

in the 32-bit signed offset field of addresses. Even though this
may be intended, some linkers refuse to relocate code where the
relocated address computation overflows.

Also, fix the sign-extension of constant offsets to use the
actual pointer size, rather than the size of the GlobalAddress
node, which may be different, for example on x86-64 where MVT::i32
is used when the address is being fit into the 32-bit displacement
field.

llvm-svn: 57885

269246b0

Optimized FCMP_OEQ and FCMP_UNE for x86. · 97d95d6d

Dan Gohman authored Oct 21, 2008

Where previously LLVM might emit code like this:

        ucomisd %xmm1, %xmm0
        setne   %al
        setp    %cl
        orb     %al, %cl
        jne     .LBB4_2

it now emits this:

        ucomisd %xmm1, %xmm0
        jne     .LBB4_2
        jp      .LBB4_2

It has fewer instructions and uses fewer registers, but it does
have more branches. And in the case that this code is followed by
a non-fallthrough edge, it may be followed by a jmp instruction,
resulting in three branch instructions in sequence. Some effort
is made to avoid this situation.

To achieve this, X86ISelLowering.cpp now recognizes FCMP_OEQ and
FCMP_UNE in lowered form, and replace them with code that emits
two branches, except in the case where it would require converting
a fall-through edge to an explicit branch.

Also, X86InstrInfo.cpp's branch analysis and transform code now
knows now to handle blocks with multiple conditional branches. It
uses loops instead of having fixed checks for up to two
instructions. It can now analyze and transform code generated
from FCMP_OEQ and FCMP_UNE.

llvm-svn: 57873

97d95d6d

When the coalescer is doing rematerializing, have it remove · c835458d

Dan Gohman authored Oct 21, 2008

the copy instruction from the instruction list before asking the
target to create the new instruction. This gets the old instruction
out of the way so that it doesn't interfere with the target's
rematerialization code. In the case of x86, this helps it find
more cases where EFLAGS is not live.

Also, in the X86InstrInfo.cpp, teach isSafeToClobberEFLAGS to check
to see if it reached the end of the block after scanning each
instruction, instead of just before. This lets it notice when the
end of the block is only two instructions away, without doing any
additional scanning.

These changes allow rematerialization to clobber EFLAGS in more
cases, for example using xor instead of mov to set the return value
to zero in the included testcase.

llvm-svn: 57872

c835458d

Make the NaN test come second, heuristically assuming · 97d3f6cf
Dan Gohman authored Oct 21, 2008
```
that NaNs are less common.

llvm-svn: 57871
```
97d3f6cf
CMake: updated lib/CodeGen/CMakeLists.txt · 0e12e5b1
Oscar Fuentes authored Oct 21, 2008
```
llvm-svn: 57869
```
0e12e5b1

Fix gcc.c-torture/compile/920520-1.c by inserting bitconverts · 4396e0d2

Chris Lattner authored Oct 21, 2008

for strange asm conditions earlier.  In this case, we have a
double being passed in an integer reg class.  Convert to like
sized integer register so that we allocate the right number 
for the class (two i32's for the f64 in this case).

llvm-svn: 57862

4396e0d2

Oct 20, 2008
- Add skeleton for the pre-register allocation live interval splitting pass. · 7e721ecd
  Evan Cheng authored Oct 20, 2008
```
llvm-svn: 57847
```
  7e721ecd
- Fast-isel no longer an experiment. · 1a59b3b9
  Dan Gohman authored Oct 20, 2008
```
llvm-svn: 57845
```
  1a59b3b9
- Add a register class -> virtual registers map. · bc623eda
  Evan Cheng authored Oct 20, 2008
```
llvm-svn: 57844
```
  bc623eda
- Support operations like fp_to_uint with a vector · aac74a90
  Duncan Sands authored Oct 20, 2008
```
result type when the result type is legal but
not the operand type.  Add additional support
for EXTRACT_SUBVECTOR and CONCAT_VECTORS,
needed to handle such cases.

llvm-svn: 57840
```
  aac74a90
- LegalizeTypes support for atomic operation promotion. · e0fb87ac
  Duncan Sands authored Oct 20, 2008
```
llvm-svn: 57838
```
  e0fb87ac
- Use DAG.getIntPtrConstant rather than DAG.getConstant · 840143fc
  Duncan Sands authored Oct 20, 2008
```
with TLI.getPointerTy for a small simplification.

llvm-svn: 57837
```
  840143fc
- Always use either MVT::i1 or getSetCCResultType for · 5805334d
  Duncan Sands authored Oct 20, 2008
```
the condition of a SELECT node.  Make sure that the
correct extension type (any-, sign- or zero-extend)
is used.

llvm-svn: 57836
```
  5805334d
- Formatting - no functional change. · fe9b5550
  Duncan Sands authored Oct 20, 2008
```
llvm-svn: 57834
```
  fe9b5550
- Don't use a random type for the select condition, · 3ed8b29a
  Duncan Sands authored Oct 20, 2008
```
use an MVT::i1 and simplify the code while there.

llvm-svn: 57833
```
  3ed8b29a
Oct 19, 2008
- Set N->OperandList to 0 after deletion. Otherwise, it's possible that it will · 8ec2a4a9
  Bill Wendling authored Oct 19, 2008
```
be either deleted or referenced afterwards.

llvm-svn: 57786
```
  8ec2a4a9
- Fix comment. Other formatting changes. No functionality changes. · 6c87bfc6
  Bill Wendling authored Oct 19, 2008
```
llvm-svn: 57785
```
  6c87bfc6
- Vector shuffle mask elements may be "undef". Handle · 8d11adca
  Duncan Sands authored Oct 19, 2008
```
this everywhere in LegalizeTypes.

llvm-svn: 57783
```
  8d11adca
- Use a legal integer type for vector shuffle mask · c6d12bd6
  Duncan Sands authored Oct 19, 2008
```
elements.  Otherwise LegalizeTypes will, reasonably
enough, legalize the mask, which may result in it
no longer being a BUILD_VECTOR node (LegalizeDAG
simply ignores the legality or not of vector masks).

llvm-svn: 57782
```
  c6d12bd6
Oct 18, 2008

Reapply r57699 with a fix to not crash on asms with multiple results. Unlike · 160e8abd

Chris Lattner authored Oct 18, 2008

the previous patch this one actually passes make check.

"Fix PR2356 on PowerPC: if we have an input and output that are tied together
that have different sizes (e.g. i32 and i64) make sure to reserve registers for
the bigger operand."

llvm-svn: 57771

160e8abd

Don't truncate GlobalAddress offsets to int in debug output. · 727a9406
Dan Gohman authored Oct 18, 2008
```
llvm-svn: 57770
```
727a9406
By min, I mean max. · 2dadd3bb
Evan Cheng authored Oct 18, 2008
```
llvm-svn: 57766
```
2dadd3bb
When creating intervals, leave min(1, numdefs) holes after each instruction. · ac4e70d9
Evan Cheng authored Oct 18, 2008
```
llvm-svn: 57765
```
ac4e70d9

Teach DAGCombine to fold constant offsets into GlobalAddress nodes, · 2fe6bee5

Dan Gohman authored Oct 18, 2008

and add a TargetLowering hook for it to use to determine when this
is legal (i.e. not in PIC mode, etc.)

This allows instruction selection to emit folded constant offsets
in more cases, such as the included testcase, eliminating the need
for explicit arithmetic instructions.

This eliminates the need for the C++ code in X86ISelDAGToDAG.cpp
that attempted to achieve the same effect, but wasn't as effective.

Also, fix handling of offsets in GlobalAddressSDNodes in several
places, including changing GlobalAddressSDNode's offset from
int to int64_t.

The Mips, Alpha, Sparc, and CellSPU targets appear to be
unaware of GlobalAddress offsets currently, so set the hook to
false on those targets.

llvm-svn: 57748

2fe6bee5

Revert r57699. It's causing regressions in · 6de25562

Dan Gohman authored Oct 18, 2008

test/CodeGen/X86/2008-09-17-inline-asm-1.ll
and a few others, and it breaks the llvm-gcc build.

llvm-svn: 57747

6de25562

Oct 17, 2008
- Factor out the code for mapping LLVM IR condition opcodes to · d01ddb51
  Dan Gohman authored Oct 17, 2008
```
ISD condition opcodes into helper functions.

llvm-svn: 57726
```
  d01ddb51
- Fix PR2898. Spiller delete a store for reuse before it knows for sure the reuse happened. · 94169f10
  Evan Cheng authored Oct 17, 2008
```
Patch by Lang Hames!

llvm-svn: 57720
```
  94169f10
- add support for 128 bit aggregates. · aadf7414
  Chris Lattner authored Oct 17, 2008
```
llvm-svn: 57715
```
  aadf7414
- The Dwarf writer was comparing mangled and unmangled names for C++ code when we · fe9e2c58
  Bill Wendling authored Oct 17, 2008
```
have an unreachable block in a function. This was triggering the assert. This is
a horrid hack to cover this up.

Oh! for a good debug info architecture!

llvm-svn: 57714
```
  fe9e2c58
- Added MemIntrinsicNode which is useful to represent target intrinsics that · 85f48ade
  Mon P Wang authored Oct 17, 2008
```
touches memory and need an associated MemOperand

llvm-svn: 57712
```
  85f48ade
- Factor out the code for mapping LLVM IR condition opcodes to · 293abcc9
  Dan Gohman authored Oct 17, 2008
```
ISD condition opcodes into helper functions.

llvm-svn: 57710
```
  293abcc9
- Fix PR2356 on PowerPC: if we have an input and output that are tied together · 052092bf
  Chris Lattner authored Oct 17, 2008
```
that have different sizes (e.g. i32 and i64) make sure to reserve registers for
the bigger operand.

llvm-svn: 57699
```
  052092bf
- refactor some code into a helper method, no functionality change. · 3b1833c9
  Chris Lattner authored Oct 17, 2008
```
llvm-svn: 57690
```
  3b1833c9
- Keep track of *which* input constraint matches an output · 860df6e8
  Chris Lattner authored Oct 17, 2008
```
constraint.  Reject asms where an output has multiple
input constraints tied to it.

llvm-svn: 57687
```
  860df6e8
- add an assert so that PR2356 explodes instead of running off an · ef890172
  Chris Lattner authored Oct 17, 2008
```
array.  Improve some minor comments, refactor some helpers in
AsmOperandInfo.  No functionality change for valid code.

llvm-svn: 57686
```
  ef890172
- Fix a very subtle spiller bug: UpdateKills should not forget to track defs of aliases. · 08acb242
  Evan Cheng authored Oct 17, 2008
```
llvm-svn: 57673
```
  08acb242
- Define patterns for shld and shrd that match immediate · a39b0a1f
  Dan Gohman authored Oct 17, 2008
```
shift counts, and patterns that match dynamic shift counts
when the subtract is obscured by a truncate node.

Add DAGCombiner support for recognizing rotate patterns
when the shift counts are defined by truncate nodes.

Fix and simplify the code for commuting shld and shrd
instructions to work even when the given instruction doesn't
have a parent, and when the caller needs a new instruction.

These changes allow LLVM to use the shld, shrd, rol, and ror
instructions on x86 to replace equivalent code using two
shifts and an or in many more cases.

llvm-svn: 57662
```
  a39b0a1f