Commits · 42adc73a2b34b001f1c269febd1e3f066ae46290 · Roger Ferrer / llvm-epi-0.8

Mar 11, 2009

My last coalescer fix introduced a subtler one. It's aborting a commuting... · 6cba5616

Evan Cheng authored Mar 11, 2009

My last coalescer fix introduced a subtler one. It's aborting a commuting optimization too late and left the live intervals to be out of sync with instructions. This fixes 8b10b.

llvm-svn: 66715

6cba5616

It makes no sense to have a ODR version of common · 4581bebf
Duncan Sands authored Mar 11, 2009
```
linkage, so remove it.

llvm-svn: 66690
```
4581bebf
Add parentheses to pacify gcc-4.3. · be69d60d
Duncan Sands authored Mar 11, 2009
```
llvm-svn: 66653
```
be69d60d

reapply my previous patch (r66358) with a tweak to set the · 43d6377f

Chris Lattner authored Mar 11, 2009

alignment of the generated constant pool entry to the
desired alignment of a type.  If we don't do this, we end up
trying to do movsd from 4-byte alignment memory.  This fixes
450.soplex and 456.hmmer.

llvm-svn: 66641

43d6377f

Put the assignment back at the top of this method. · 1df2c1b5
Bill Wendling authored Mar 11, 2009
```
llvm-svn: 66611
```
1df2c1b5

Two coalescer fixes in one. · 64b3f9d7

Evan Cheng authored Mar 11, 2009

1. Use the same value# to represent unknown values being merged into sub-registers.
2. When coalescer commute an instruction and the destination is a physical register, update its sub-registers by merging in the extended ranges.

llvm-svn: 66610

64b3f9d7

Make ivars private. Other cleanup. No functionality change. · 9621b099
Bill Wendling authored Mar 10, 2009
```
llvm-svn: 66607
```
9621b099

Mar 10, 2009
- Just make the Dwarf timer group static inside of the getter function. No need to alloc/dealloc. · 86c26564
  Bill Wendling authored Mar 10, 2009
```
llvm-svn: 66591
```
  86c26564
- Don't put static functions in anonymous namespace. · b74d6507
  Bill Wendling authored Mar 10, 2009
```
llvm-svn: 66589
```
  b74d6507
- These should *stop* the timer, not start it again. · ff1faf70
  Bill Wendling authored Mar 10, 2009
```
llvm-svn: 66586
```
  ff1faf70
- - Fix misspelled method name. · 64185905
  Bill Wendling authored Mar 10, 2009
```
- Remove unused method.

llvm-svn: 66585
```
  64185905
- - Create GetOrCreateSourceID from getOrCreateSourceID. GetOrCreateSourceID is · a5c8e50e
  Bill Wendling authored Mar 10, 2009
```
  the untimed version of getOrCreateSourceID. getOrCreateSourceID calls
  GetOrCreateSourceID, of course.

- Move some methods into the "private" section. Constify at least one method.

- General clean-ups.

llvm-svn: 66582
```
  a5c8e50e
- Refine the Dwarf writer timers so that they measure exception writing and debug · 6e6d1b24
  Bill Wendling authored Mar 10, 2009
```
writing individually.

llvm-svn: 66577
```
  6e6d1b24
- Revert 66358 for now. It's breaking povray, 450.soplex, and 456.hmmer on x86 / Darwin. · aa887653
  Evan Cheng authored Mar 10, 2009
```
llvm-svn: 66574
```
  aa887653
- Add a timer to the DwarfWriter pass that measures the total time it takes to · e8dd2847
  Bill Wendling authored Mar 10, 2009
```
emit exception and debug Dwarf info.

llvm-svn: 66571
```
  e8dd2847
- Fix a post-RA scheduling liveness bug. When a basic block is being · 64613ace
  Dan Gohman authored Mar 10, 2009
```
scheduled in multiple regions, liveness data used by the
anti-dependence breaker is carried from one region to the next, however
the information reflects the state of the instructions before scheduling.
After scheduling, there may be new live range overlaps. Handle this by
pessimizing the liveness data carried between regions to the point where
it will be conservatively correct now matter how the earlier region is
scheduled. This fixes a miscompilation in 176.gcc with the post-RA
scheduler enabled.

llvm-svn: 66558
```
  64613ace
- wire up support for emitting "special" values from inline asm · 1522e249
  Chris Lattner authored Mar 10, 2009
```
format strings with the standard ${:foo} syntax.

llvm-svn: 66527
```
  1522e249
Mar 09, 2009
- Fix PR3763 by using proper APInt methods instead of uint64_t's. · 4249b9a6
  Chris Lattner authored Mar 09, 2009
```
llvm-svn: 66434
```
  4249b9a6
- Yet another case where the spiller marked two uses of the same register on the... · fb8ded91
  Evan Cheng authored Mar 09, 2009
```
Yet another case where the spiller marked two uses of the same register on the same instruction as kill. This fixes PR3706.

llvm-svn: 66428
```
  fb8ded91
- just remove the use_empty() check entirely, the only reason it · 126dab2f
  Chris Lattner authored Mar 09, 2009
```
existed was for llvm-gcc 3.4 (which used the __main hack) which 
is really really long dead.

llvm-svn: 66417
```
  126dab2f
- Make the code generator rip of dead constant expr uses before deciding · 317293b5
  Chris Lattner authored Mar 09, 2009
```
whether a global is dead or not.  This should fix PR3749 - linker adds 
spurious use to appending globals.  I can't reasonably add a testcase
for this, because the bc writer/reader strip dead constant users.

llvm-svn: 66404
```
  317293b5
- Pass in a std::string when getting the names of debugging things. This cuts down · c6869f46
  Bill Wendling authored Mar 09, 2009
```
on the number of times a std::string is created and copied.

llvm-svn: 66396
```
  c6869f46
Mar 08, 2009

If a MI uses the same register more than once, only mark one of them as 'kill'. · de22116f
Evan Cheng authored Mar 08, 2009
```
llvm-svn: 66363
```
de22116f

implement an optimization to codegen c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4. · ab5a4431

Chris Lattner authored Mar 08, 2009

For 2009-03-07-FPConstSelect.ll we now produce:

_f:
	xorl	%eax, %eax
	testl	%edi, %edi
	movl	$4, %ecx
	cmovne	%rax, %rcx
	leaq	LCPI1_0(%rip), %rax
	movss	(%rcx,%rax), %xmm0
	ret

previously we produced:

_f:
	subl	$4, %esp
	cmpl	$0, 8(%esp)
	movss	LCPI1_0, %xmm0
	je	LBB1_2	## entry
LBB1_1:	## entry
	movss	LCPI1_1, %xmm0
LBB1_2:	## entry
	movss	%xmm0, (%esp)
	flds	(%esp)
	addl	$4, %esp
	ret

on PPC the code also improves to:

_f:
	cntlzw r2, r3
	srwi r2, r2, 5
	li r3, lo16(LCPI1_0)
	slwi r2, r2, 2
	addis r3, r3, ha16(LCPI1_0)
	lfsx f1, r3, r2
	blr 

from:

_f:
	li r2, lo16(LCPI1_1)
	cmplwi cr0, r3, 0
	addis r2, r2, ha16(LCPI1_1)
	beq cr0, LBB1_2	; entry
LBB1_1:	; entry
	li r2, lo16(LCPI1_0)
	addis r2, r2, ha16(LCPI1_0)
LBB1_2:	; entry
	lfs f1, 0(r2)
	blr 

This also improves the existing pic-cpool case from:

foo:
	subl	$12, %esp
	call	.Lllvm$1.$piclabel
.Lllvm$1.$piclabel:
	popl	%eax
	addl	$_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax
	cmpl	$0, 16(%esp)
	movsd	.LCPI1_0@GOTOFF(%eax), %xmm0
	je	.LBB1_2	# entry
.LBB1_1:	# entry
	movsd	.LCPI1_1@GOTOFF(%eax), %xmm0
.LBB1_2:	# entry
	movsd	%xmm0, (%esp)
	fldl	(%esp)
	addl	$12, %esp
	ret

to:

foo:
	call	.Lllvm$1.$piclabel
.Lllvm$1.$piclabel:
	popl	%eax
	addl	$_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax
	xorl	%ecx, %ecx
	cmpl	$0, 4(%esp)
	movl	$8, %edx
	cmovne	%ecx, %edx
	fldl	.LCPI1_0@GOTOFF(%eax,%edx)
	ret

This triggers a few dozen times in spec FP 2000.

llvm-svn: 66358

ab5a4431

random cleanups. · 21cf4bf2
Chris Lattner authored Mar 08, 2009
```
llvm-svn: 66357
```
21cf4bf2

Mar 07, 2009

Introduce new linkage types linkonce_odr, weak_odr, common_odr · 12da8ce3

Duncan Sands authored Mar 07, 2009

and extern_weak_odr.  These are the same as the non-odr versions,
except that they indicate that the global will only be overridden
by an *equivalent* global.  In C, a function with weak linkage can
be overridden by a function which behaves completely differently.
This means that IP passes have to skip weak functions, since any
deductions made from the function definition might be wrong, since
the definition could be replaced by something completely different
at link time.   This is not allowed in C++, thanks to the ODR
(One-Definition-Rule): if a function is replaced by another at
link-time, then the new function must be the same as the original
function.  If a language knows that a function or other global can
only be overridden by an equivalent global, it can give it the
weak_odr linkage type, and the optimizers will understand that it
is alright to make deductions based on the function body.  The
code generators on the other hand map weak and weak_odr linkage
to the same thing.

llvm-svn: 66339

12da8ce3

Mar 06, 2009

Fix ScheduleDAGRRList::CopyAndMoveSuccessors' handling of nodes · 15af5524

Dan Gohman authored Mar 06, 2009

with multiple chain operands. This can occur when the scheduler
has added chain operands to a node that already has a chain
operand, in order to handle physical register dependencies.

This fixes an llvm-gcc bootstrap failure on x86-64 introduced
in r66058.

llvm-svn: 66240

15af5524

When we split a basic block, there's a default branch to the newly created BB. · 6d8472b9
Bill Wendling authored Mar 06, 2009
```
Delete this default branch, because we're going to generate our own.

llvm-svn: 66234
```
6d8472b9

Mar 05, 2009
- (Hopefully) silence a warning. · 380b3838
  Owen Anderson authored Mar 05, 2009
```
llvm-svn: 66158
```
  380b3838
- Be more careful about choosing restore points when doing restore folding. ... · ad503987
  Owen Anderson authored Mar 05, 2009
```
Be more careful about choosing restore points when doing restore folding.  This fixes some subtle miscompilations.

llvm-svn: 66147
```
  ad503987
- Fix how livein live intervals are handled. Previously it could end at MBB... · f0bfc6a6
  Evan Cheng authored Mar 05, 2009
```
Fix how livein live intervals are handled. Previously it could end at MBB start. Sorry, no small test case possible.

llvm-svn: 66129
```
  f0bfc6a6
Mar 04, 2009

Fix BuildVectorSDNode::isConstantSplat to handle one-element vectors. · 5b15d01f

Bob Wilson authored Mar 04, 2009

It is an error to call APInt::zext with a size that is equal to the value's
current size, so use zextOrTrunc instead.

llvm-svn: 66039

5b15d01f

Add a restore folder, which shaves a dozen or so machineinstrs off oggenc. ... · 0dedc114
Owen Anderson authored Mar 04, 2009
```
Add a restore folder, which shaves a dozen or so machineinstrs off oggenc.  Update a testcase to check this.

llvm-svn: 66029
```
0dedc114
PR3686: make the legalizer handle bitcast from i80 to x86 long double. · 7604d377
Eli Friedman authored Mar 04, 2009
```
llvm-svn: 66021
```
7604d377

Fix PR3701. 1. X86 target renamed eflags register to flags. This matches what... · b8905c4e

Evan Cheng authored Mar 04, 2009

Fix PR3701. 1. X86 target renamed eflags register to flags. This matches what llvm-gcc generates so codegen knows flags register is being clobbered by inline asm. 2. BURR scheduler should also check if inline asm nodes can clobber "live" physical registers. Previously it was only checking target nodes with implicit defs.

llvm-svn: 65996

b8905c4e

The DAG combiner was performing a BT combine. The BT combine had a value of -1, · 6d271473

Bill Wendling authored Mar 04, 2009

so it changed it into a 31 via the TLO.ShrinkDemandedConstant() call. Then it
would go through the DAG combiner again. This time it had a value of 31, which
was turned into a -1 by TLI.SimplifyDemandedBits(). This would ping pong
forever.

Teach the TLO.ShrinkDemandedConstant() call not to lower a value if the demanded
value is an XOR of all ones.

llvm-svn: 65985

6d271473

Mar 03, 2009

Generalize BuildVectorSDNode::isConstantSplat to use APInts and handle · 85cefe85

Bob Wilson authored Mar 02, 2009

arbitrary vector sizes.  Add an optional MinSplatBits parameter to specify
a minimum for the splat element size.  Update the PPC target to use the
revised interface.

llvm-svn: 65899

85cefe85

Mar 02, 2009

Fix a problem with DAGCombine on 64b targets where folding · a9e98122

Nate Begeman authored Mar 01, 2009

extracts + build_vector into a shuffle would fail, because the
type of the new build_vector would not be legal.  Try harder to
create a legal build_vector type.  Note: this will be totally 
irrelevant once vector_shuffle no longer takes a build_vector for
shuffle mask.

New:
_foo:
	xorps	%xmm0, %xmm0
	xorps	%xmm1, %xmm1
	subps	%xmm1, %xmm1
	mulps	%xmm0, %xmm1
	addps	%xmm0, %xmm1
	movaps	%xmm1, 0

Old:
_foo:
	xorps	%xmm0, %xmm0
	movss	%xmm0, %xmm1
	xorps	%xmm2, %xmm2
	unpcklps	%xmm1, %xmm2
	pshufd	$80, %xmm1, %xmm1
	unpcklps	%xmm1, %xmm2
	pslldq	$16, %xmm2
	pshufd	$57, %xmm2, %xmm1
	subps	%xmm0, %xmm1
	mulps	%xmm0, %xmm1
	addps	%xmm0, %xmm1
	movaps	%xmm1, 0

llvm-svn: 65791

a9e98122

Mar 01, 2009

Minor optimization: · c2f95b56

Evan Cheng authored Mar 01, 2009

Look for situations like this:                                                                                                                                                              
%reg1024<def> = MOV r1                                                                                                                                                                      
%reg1025<def> = MOV r0                                                                                                                                                                      
%reg1026<def> = ADD %reg1024, %reg1025                                                                                                                                                      
r0            = MOV %reg1026                                                                                                                                                                
Commute the ADD to hopefully eliminate an otherwise unavoidable copy.

llvm-svn: 65752

c2f95b56

Combine PPC's GetConstantBuildVectorBits and isConstantSplat functions to a new · d8ea0e14
Bob Wilson authored Mar 01, 2009
```
method in a BuildVectorSDNode "pseudo-class".

llvm-svn: 65747
```
d8ea0e14