Commits · be0de34ede6658914f90eb35d6ee2a5270f8ea8c · Roger Ferrer / llvm-epi-0.8

Apr 30, 2008
- Tail call optimization improvements: · be0de34e
  Arnold Schwaighofer authored Apr 30, 2008
```
Move platform independent code (lowering of possibly overwritten
arguments, check for tail call optimization eligibility) from
target X86ISelectionLowering.cpp to TargetLowering.h and
SelectionDAGISel.cpp.

Initial PowerPC tail call implementation:

Support ppc32 implemented and tested (passes my tests and
test-suite llvm-test).  
Support ppc64 implemented and half tested (passes my tests).
On ppc tail call optimization is performed if 
  caller and callee are fastcc
  call is a tail call (in tail call position, call followed by ret)
  no variable argument lists or byval arguments
  option -tailcallopt is enabled
Supported:
 * non pic tail calls on linux/darwin
 * module-local tail calls on linux(PIC/GOT)/darwin(PIC)
 * inter-module tail calls on darwin(PIC)
If constraints are not met a normal call will be emitted.

A test checking the argument lowering behaviour on x86-64 was added.

llvm-svn: 50477
```
  be0de34e
- fcntl.h is pretty standard on unix (without the sys/) · 659e5c4b
  Gabor Greif authored Apr 30, 2008
```
llvm-svn: 50475
```
  659e5c4b
- Move this test to LoopDeletion, where it now passes. · 0255998d
  Owen Anderson authored Apr 30, 2008
```
llvm-svn: 50474
```
  0255998d
- This condition got inverted accidentally. · 0ced13cc
  Owen Anderson authored Apr 30, 2008
```
llvm-svn: 50473
```
  0ced13cc
- move lowering of llvm.memset -> store from simplify libcalls · 2dc44266
  Chris Lattner authored Apr 30, 2008
```
to instcombine.

llvm-svn: 50472
```
  2dc44266
- no reason for simplifylibcalls to simplify intrinsics, instcombine does · f5f944ae
  Chris Lattner authored Apr 30, 2008
```
a fine job.

llvm-svn: 50470
```
  f5f944ae
- remove redundant check. · 4b20032b
  Chris Lattner authored Apr 30, 2008
```
llvm-svn: 50469
```
  4b20032b
- add missing #include · 9caa8c0d
  Chris Lattner authored Apr 30, 2008
```
llvm-svn: 50468
```
  9caa8c0d
- add a method for comparing to see if a value has a specified name. · a1d850ee
  Chris Lattner authored Apr 30, 2008
```
llvm-svn: 50465
```
  a1d850ee
- use string length computation to generalize several xforms. · 438e35c4
  Chris Lattner authored Apr 30, 2008
```
llvm-svn: 50464
```
  438e35c4
- Add comments for previous patch as requested. · c110c4a5
  Dale Johannesen authored Apr 30, 2008
```
llvm-svn: 50463
```
  c110c4a5
- Bug fixes and updates for CellSPU, syncing up with trunk. Most notable · c3a1910a
  Scott Michel authored Apr 30, 2008
```
fixes are target-specific lowering of frame indices, fix constants generated
for the FSMBI instruction, and fixing SPUTargetLowering::computeMaskedBitsFor-
TargetNode().

llvm-svn: 50462
```
  c3a1910a
- Fix custom target lowering for zero/any/sign_extend: make sure that · be940424
  Scott Michel authored Apr 30, 2008
```
DAG.UpdateNodeOperands() is called before (not after) the call to
TLI.LowerOperation().

llvm-svn: 50461
```
  be940424
- Make eh_frame objects by 8-byte aligned on 64-bit · fc3e3ad7
  Dale Johannesen authored Apr 29, 2008
```
targets.

llvm-svn: 50451
```
  fc3e3ad7
- Minor spelling and typo fixes. · 663f5fcc
  John Criswell authored Apr 29, 2008
```
llvm-svn: 50448
```
  663f5fcc
Apr 29, 2008

Revert r50441. The original code was correct. Add some more comments so that... · ad5367f8

Owen Anderson authored Apr 29, 2008

Revert r50441.  The original code was correct.  Add some more comments so that I don't make the same mistake in the future.

llvm-svn: 50446

ad5367f8

Fix a bug in memcpyopt where the memcpy-memcpy transform was never being applied because · ff7d7b18

Owen Anderson authored Apr 29, 2008

we were checking for it in the wrong order.  This caused a miscompilation because the
return slot optimization assumes that the call it is dealing with is NOT a memcpy.

llvm-svn: 50444

ff7d7b18

We should be returning true here since we've changed the function. · f07de734
Owen Anderson authored Apr 29, 2008
```
llvm-svn: 50442
```
f07de734
A lot of cleanups and documentation improvements, as well as a few corner case fixes. Most · e6746002
Owen Anderson authored Apr 29, 2008
```
of this was suggested by Chris.

llvm-svn: 50441
```
e6746002
Rename DeadLoopElimination to LoopDeletion, part 2. · 2306a1e0
Owen Anderson authored Apr 29, 2008
```
llvm-svn: 50437
```
2306a1e0
Rename DeadLoopElimination to LoopDeletion, part one. · e9f05bd1
Owen Anderson authored Apr 29, 2008
```
llvm-svn: 50436
```
e9f05bd1
Don't do stupid things: doInitialization(Module&) is not applicable to ModulePass :) · 0acc7398
Anton Korobeynikov authored Apr 29, 2008
```
llvm-svn: 50433
```
0acc7398
don't eliminate load from volatile value on paths where the load is dead. · d9e3b5c5
Chris Lattner authored Apr 29, 2008
```
This fixes the second half of PR2262

llvm-svn: 50430
```
d9e3b5c5
make this test reduced and *valid* · 53bcf360
Chris Lattner authored Apr 29, 2008
```
llvm-svn: 50429
```
53bcf360
fix a subtle volatile handling bug. · 9233c124
Chris Lattner authored Apr 29, 2008
```
llvm-svn: 50428
```
9233c124

Use std::set instead of std::priority_queue for the RegReductionPriorityQueue. · 6b371145

Roman Levenstein authored Apr 29, 2008

This removes the existing bottleneck related to the removal of elements from 
the middle of the queue.

Also fixes a subtle bug in ScheduleDAGRRList::CapturePred:
It was updating the state of the SUnit before removing it. As a result, the
comparison operators were working incorrectly and this SUnit could not be removed 
from the queue properly.

Reviewed by Evan and Dan. Approved by Dan.

llvm-svn: 50412

6b371145

Implement more aggressive support for analyzing string length. This · 92f47022

Chris Lattner authored Apr 29, 2008

generalizes the previous code to handle the case when the string is not
an immediate to the strlen call (for example, crazy stuff like 
strlen(c ? "foo" : "bart"+1) -> 3).  This implements 
gcc.c-torture/execute/builtins/strlen-2.c.  I will generalize other
cases in simplifylibcalls to use the same routine later.

llvm-svn: 50408

92f47022

Clarify what we mean by a dead loop. · 304ef22f
Owen Anderson authored Apr 29, 2008
```
llvm-svn: 50406
```
304ef22f
new testcase for PR2094. The inline asms should not pin allocas to the · 141d2dfd
Chris Lattner authored Apr 29, 2008
```
stack anymore.

llvm-svn: 50397
```
141d2dfd
don't delete the last store to an alloca if the store is volatile. · e331a65c
Chris Lattner authored Apr 29, 2008
```
llvm-svn: 50390
```
e331a65c

make the vector conversion magic handle multiple results. · 5c88f7b1

Chris Lattner authored Apr 29, 2008

We now compile test2/test3 to:

_test2:
	## InlineAsm Start
	set %xmm0, %xmm1
	## InlineAsm End
	addps	%xmm1, %xmm0
	ret
_test3:
	## InlineAsm Start
	set %xmm0, %xmm1
	## InlineAsm End
	paddd	%xmm1, %xmm0
	ret

as expected.

llvm-svn: 50389

5c88f7b1

add support for multiple return values in inline asm. This is a step · f9a49c43

Chris Lattner authored Apr 29, 2008

towards PR2094.  It now compiles the attached .ll file to:

_sad16_sse2:
	movslq	%ecx, %rax
	## InlineAsm Start
	%ecx %rdx %rax %rax %r8d %rdx %rsi
	## InlineAsm End
	## InlineAsm Start
	set %eax
	## InlineAsm End
	ret

which is pretty decent for a 3 output, 4 input asm.

llvm-svn: 50386

f9a49c43

Another extract_subreg coalescing bug. · 11b98b66

Evan Cheng authored Apr 29, 2008

e.g.
vr1024<2> extract_subreg vr1025, 2
If vr1024 do not have the same register class as vr1025, it's not safe to coalesce this away. For example, vr1024 might be a GPR32 while vr1025 might be a GPR64.

llvm-svn: 50385

11b98b66

Add some more comments. · 586216e5
Owen Anderson authored Apr 29, 2008
```
llvm-svn: 50384
```
586216e5
Remove debugging code. · 41377175
Owen Anderson authored Apr 29, 2008
```
llvm-svn: 50383
```
41377175
Add dead loop elimination, which removes dead loops for which we can compute · 94ad7024
Owen Anderson authored Apr 29, 2008
```
the trip count.

llvm-svn: 50382
```
94ad7024
Add -march=x86. · 73c3b474
Evan Cheng authored Apr 28, 2008
```
llvm-svn: 50380
```
73c3b474

Update and_ops.ll according to the recent dagcombiner changes. · c73845b3

Dan Gohman authored Apr 28, 2008

Add a new test, and_ops_more.ll, which is XFAIL'd, to
record the parts of and_ops.ll that were affected by this
change.

llvm-svn: 50379

c73845b3

Test case. · 315e3cb9
Evan Cheng authored Apr 28, 2008
```
llvm-svn: 50377
```
315e3cb9

Fix a bug in RegsForValue::getCopyToRegs() that causes cyclical scheduling... · b96782ec

Evan Cheng authored Apr 28, 2008

Fix a bug in RegsForValue::getCopyToRegs() that causes cyclical scheduling units. If it's creating multiple CopyToReg nodes that are "flagged" together, it should not create a TokenFactor for it's chain outputs:

c1, f1 = CopyToReg                                                                                                                                                                                             
c2, f2 = CopyToReg                                                                                                                                                                                             
c3     = TokenFactor c1, c2                                                                                                                                                                                    
 ...                                                                                                                                                                                                                      
       = user c3, ..., f2

Now that the two CopyToReg's and the user are "flagged" together. They effectively forms a single scheduling unit. The TokenFactor is now both an operand and a successor of the Flagged nodes.

llvm-svn: 50376

b96782ec