Commits · 5a3eecdfd8e677396e381326182e310266a24ed1 · Roger Ferrer / llvm-epi-0.8

May 07, 2008
- Fix a bug in the ComputeMaskedBits logic for multiply. · 5a3eecdf
  Dan Gohman authored May 07, 2008
```
llvm-svn: 50793
```
  5a3eecdf
- Make StripPointerCast a common function (should we mak it method of Value instead?) · 82c02b28
  Anton Korobeynikov authored May 06, 2008
```
llvm-svn: 50775
```
  82c02b28
May 06, 2008
- Make several variable declarations static. · 6a2da37c
  Dan Gohman authored May 06, 2008
```
llvm-svn: 50696
```
  6a2da37c
- Remove uses of llvm/System/IncludeFile.h that are no longer needed. · a8b7e78f
  Dan Gohman authored May 06, 2008
```
llvm-svn: 50695
```
  a8b7e78f
- Instead of enumerating each opcode that isn't handled that · 38dc08f3
  Dan Gohman authored May 06, 2008
```
ComputeMaskedBits handles, just use a 'default:'. This avoids
TargetLowering's list getting out of date with SelectionDAG's.

llvm-svn: 50693
```
  38dc08f3
- Correct the value of LowBits in srem and urem handling in · cf0e3acf
  Dan Gohman authored May 06, 2008
```
ComputeMaskedBits.

llvm-svn: 50692
```
  cf0e3acf
- Fix a broken doxygen comment, and reword it for clarity. · 72a0bc14
  Dan Gohman authored May 06, 2008
```
llvm-svn: 50687
```
  72a0bc14
May 05, 2008
- Added addition atomic instrinsics and, or, xor, min, and max. · 3e58393c
  Mon P Wang authored May 05, 2008
```
llvm-svn: 50663
```
  3e58393c
- Fix a bug in the ELF writer that caused it to produce malformed · e3a63ba3
  Dan Gohman authored May 05, 2008
```
ELF headers. The ELF writer still isn't generally usable though.

llvm-svn: 50652
```
  e3a63ba3
- Add AsmPrinter support for emitting a directive to declare that · bcde1722
  Dan Gohman authored May 05, 2008
```
the code being generated does not require an executable stack.

Also, add target-specific code to make use of this on Linux
on x86. 

llvm-svn: 50634
```
  bcde1722
May 02, 2008
- Fix a mistake in the computation of leading zeros for udiv. · 1962c2be
  Dan Gohman authored May 02, 2008
```
llvm-svn: 50591
```
  1962c2be
- Fix a typo in a comment. · 2f83b478
  Dan Gohman authored May 02, 2008
```
llvm-svn: 50562
```
  2f83b478
- Use push_back(...) instead of resize(1, ...), per review feedback. · ea635782
  Dan Gohman authored May 02, 2008
```
llvm-svn: 50561
```
  ea635782
- Fix uninitialized uses of the FPC variable. · 752ce50b
  Dan Gohman authored May 01, 2008
```
llvm-svn: 50558
```
  752ce50b
May 01, 2008

don't randomly miscompile seto/setuo just because we are in · d4b2a67c

Chris Lattner authored May 01, 2008

ffastmath mode.  This fixes rdar://5902801, a miscompilation
of gcc.dg/builtins-8.c.

Bill, please pull this into Tak.

llvm-svn: 50523

d4b2a67c

Apr 30, 2008

Tail call optimization improvements: · be0de34e

Arnold Schwaighofer authored Apr 30, 2008

Move platform independent code (lowering of possibly overwritten
arguments, check for tail call optimization eligibility) from
target X86ISelectionLowering.cpp to TargetLowering.h and
SelectionDAGISel.cpp.

Initial PowerPC tail call implementation:

Support ppc32 implemented and tested (passes my tests and
test-suite llvm-test).  
Support ppc64 implemented and half tested (passes my tests).
On ppc tail call optimization is performed if 
  caller and callee are fastcc
  call is a tail call (in tail call position, call followed by ret)
  no variable argument lists or byval arguments
  option -tailcallopt is enabled
Supported:
 * non pic tail calls on linux/darwin
 * module-local tail calls on linux(PIC/GOT)/darwin(PIC)
 * inter-module tail calls on darwin(PIC)
If constraints are not met a normal call will be emitted.

A test checking the argument lowering behaviour on x86-64 was added.

llvm-svn: 50477

be0de34e

Add comments for previous patch as requested. · c110c4a5
Dale Johannesen authored Apr 30, 2008
```
llvm-svn: 50463
```
c110c4a5
Fix custom target lowering for zero/any/sign_extend: make sure that · be940424
Scott Michel authored Apr 30, 2008
```
DAG.UpdateNodeOperands() is called before (not after) the call to
TLI.LowerOperation().

llvm-svn: 50461
```
be940424
Make eh_frame objects by 8-byte aligned on 64-bit · fc3e3ad7
Dale Johannesen authored Apr 29, 2008
```
targets.

llvm-svn: 50451
```
fc3e3ad7

Apr 29, 2008

Use std::set instead of std::priority_queue for the RegReductionPriorityQueue. · 6b371145

Roman Levenstein authored Apr 29, 2008

This removes the existing bottleneck related to the removal of elements from 
the middle of the queue.

Also fixes a subtle bug in ScheduleDAGRRList::CapturePred:
It was updating the state of the SUnit before removing it. As a result, the
comparison operators were working incorrectly and this SUnit could not be removed 
from the queue properly.

Reviewed by Evan and Dan. Approved by Dan.

llvm-svn: 50412

6b371145

make the vector conversion magic handle multiple results. · 5c88f7b1

Chris Lattner authored Apr 29, 2008

We now compile test2/test3 to:

_test2:
	## InlineAsm Start
	set %xmm0, %xmm1
	## InlineAsm End
	addps	%xmm1, %xmm0
	ret
_test3:
	## InlineAsm Start
	set %xmm0, %xmm1
	## InlineAsm End
	paddd	%xmm1, %xmm0
	ret

as expected.

llvm-svn: 50389

5c88f7b1

add support for multiple return values in inline asm. This is a step · f9a49c43

Chris Lattner authored Apr 29, 2008

towards PR2094.  It now compiles the attached .ll file to:

_sad16_sse2:
	movslq	%ecx, %rax
	## InlineAsm Start
	%ecx %rdx %rax %rax %r8d %rdx %rsi
	## InlineAsm End
	## InlineAsm Start
	set %eax
	## InlineAsm End
	ret

which is pretty decent for a 3 output, 4 input asm.

llvm-svn: 50386

f9a49c43

Another extract_subreg coalescing bug. · 11b98b66

Evan Cheng authored Apr 29, 2008

e.g.
vr1024<2> extract_subreg vr1025, 2
If vr1024 do not have the same register class as vr1025, it's not safe to coalesce this away. For example, vr1024 might be a GPR32 while vr1025 might be a GPR64.

llvm-svn: 50385

11b98b66

Fix a bug in RegsForValue::getCopyToRegs() that causes cyclical scheduling... · b96782ec

Evan Cheng authored Apr 28, 2008

Fix a bug in RegsForValue::getCopyToRegs() that causes cyclical scheduling units. If it's creating multiple CopyToReg nodes that are "flagged" together, it should not create a TokenFactor for it's chain outputs:

c1, f1 = CopyToReg                                                                                                                                                                                             
c2, f2 = CopyToReg                                                                                                                                                                                             
c3     = TokenFactor c1, c2                                                                                                                                                                                    
 ...                                                                                                                                                                                                                      
       = user c3, ..., f2

Now that the two CopyToReg's and the user are "flagged" together. They effectively forms a single scheduling unit. The TokenFactor is now both an operand and a successor of the Flagged nodes.

llvm-svn: 50376

b96782ec

Apr 28, 2008
- Evan pointed out that folding sext to zext may not be correct · c968c1f5
  Dan Gohman authored Apr 28, 2008
```
if the zext is not legal.

llvm-svn: 50368
```
  c968c1f5
- Delete an unused constructor. · 77ce6da3
  Dan Gohman authored Apr 28, 2008
```
llvm-svn: 50367
```
  77ce6da3
- Add a comment to CreateRegForValue that clarifies the handling of · d961d30b
  Dan Gohman authored Apr 28, 2008
```
aggregate types.

llvm-svn: 50366
```
  d961d30b
- Rewrite the comments for RegsForValue and its members, and · 80c692d4
  Dan Gohman authored Apr 28, 2008
```
reorder some of the members for clarity.

llvm-svn: 50365
```
  80c692d4
- Don't call size() on each iteration of the loop. · 14a05df9
  Dan Gohman authored Apr 28, 2008
```
llvm-svn: 50361
```
  14a05df9
- Fix the SVOffset values for loads and stores produced by · da440548
  Dan Gohman authored Apr 28, 2008
```
memcpy/memset expansion. It was a bug for the SVOffset value
to be used in the actual address calculations.

llvm-svn: 50359
```
  da440548
- Teach InstCombine's ComputeMaskedBits what SelectionDAG's · 72ec3f45
  Dan Gohman authored Apr 28, 2008
```
ComputeMaskedBits knows about cttz, ctlz, and ctpop. Teach
SelectionDAG's ComputeMaskedBits what InstCombine's knows
about SRem. And teach them both some things about high bits
in Mul, UDiv, URem, and Sub. This allows instcombine and
dagcombine to eliminate sign-extension operations in
several new cases.

llvm-svn: 50358
```
  72ec3f45
- Teach DAGCombine to convert (sext x) to (zext x) when the · 3eb10f75
  Dan Gohman authored Apr 28, 2008
```
sign-bit of x is known to be zero.

llvm-svn: 50357
```
  3eb10f75
- Another collection of random cleanups. No functionality change. · c9e280c7
  Chris Lattner authored Apr 28, 2008
```
llvm-svn: 50341
```
  c9e280c7
- Remove the SmallVector ctor that converts from a SmallVectorImpl. This · 52504e78
  Chris Lattner authored Apr 28, 2008
```
conversion open the door for many nasty implicit conversion issues, and
can be easily solved by initializing with (V.begin(), V.end()) when 
needed.

This patch includes many small cleanups for sdisel also.

llvm-svn: 50340
```
  52504e78
- switch RegsForValue::Regs to be a SmallVector to avoid · 8c7f5ad9
  Chris Lattner authored Apr 28, 2008
```
heap thrash on tiny (usually single-element) vectors.

llvm-svn: 50335
```
  8c7f5ad9
- move static function out of anon namespace, no functionality change. · d04b818a
  Chris Lattner authored Apr 27, 2008
```
llvm-svn: 50330
```
  d04b818a
- Another step to getting multiple result inline asm to work. · 12272184
  Chris Lattner authored Apr 27, 2008
```
llvm-svn: 50329
```
  12272184
Apr 27, 2008

typo · 58b9ece3
Chris Lattner authored Apr 27, 2008
```
llvm-svn: 50316
```
58b9ece3

Implement a signficant optimization for inline asm: · 22379734

Chris Lattner authored Apr 27, 2008

When choosing between constraints with multiple options,
like "ir", test to see if we can use the 'i' constraint and
go with that if possible.  This produces more optimal ASM in
all cases (sparing a register and an instruction to load it),
and fixes inline asm like this:

void test () {
  asm volatile (" %c0 %1 " : : "imr" (42), "imr"(14));
}

Previously we would dump "42" into a memory location (which
is ok for the 'm' constraint) which would cause a problem
because the 'c' modifier is not valid on memory operands.

Isn't it great how inline asm turns 'missed optimization'
into 'compile failed'??

Incidentally, this was the todo in 
PowerPC/2007-04-24-InlineAsm-I-Modifier.ll

Please do NOT pull this into Tak.

llvm-svn: 50315

22379734

isa+cast -> dyn_cast · a937baeb
Chris Lattner authored Apr 27, 2008
```
llvm-svn: 50314
```
a937baeb