Commits · fcf7348f80a3026676a35086884177c2acf55aeb · Roger Ferrer / llvm-epi-0.8

Mar 14, 2009

Avoid doing the transformation c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4 · c8671563

Mon P Wang authored Mar 14, 2009

if FPConstant is legal because if the FPConstant doesn't need to be stored
in a constant pool, the transformation is unlikely to be profitable.

llvm-svn: 66994

c8671563

Improve FastISel's handling of truncates to i1, and implement · a62e4ab6

Dan Gohman authored Mar 13, 2009

ptrtoint and inttoptr in X86FastISel. These casts aren't always
handled in the generic FastISel code because X86 sometimes needs
custom code to do truncation and zero-extension.

llvm-svn: 66988

a62e4ab6

Mar 13, 2009

Fix PR3784: If the source of a phi comes from a bb ended with an invoke, make... · 94419d6f

Evan Cheng authored Mar 13, 2009

Fix PR3784: If the source of a phi comes from a bb ended with an invoke, make sure the copy is inserted before the try range (unless it's used as an input to the invoke, then insert it after the last use), not at the end of the bb.

Also re-apply r66140 which was disabled as a workaround.

llvm-svn: 66976

94419d6f

Fix FastISel's assumption that i1 values are always zero-extended · c0bb9595

Dan Gohman authored Mar 13, 2009

by inserting explicit zero extensions where necessary. Included
is a testcase where SelectionDAG produces a virtual register
holding an i1 value which FastISel previously mistakenly assumed
to be zero-extended.

llvm-svn: 66941

c0bb9595

Fix some significant problems with constant pools that resulted in unnecessary... · 1fb8aedd

Evan Cheng authored Mar 13, 2009

Fix some significant problems with constant pools that resulted in unnecessary paddings between constant pool entries, larger than necessary alignments (e.g. 8 byte alignment for .literal4 sections), and potentially other issues.

1. ConstantPoolSDNode alignment field is log2 value of the alignment requirement. This is not consistent with other SDNode variants.
2. MachineConstantPool alignment field is also a log2 value.
3. However, some places are creating ConstantPoolSDNode with alignment value rather than log2 values. This creates entries with artificially large alignments, e.g. 256 for SSE vector values.
4. Constant pool entry offsets are computed when they are created. However, asm printer group them by sections. That means the offsets are no longer valid. However, asm printer uses them to determine size of padding between entries.
5. Asm printer uses expensive data structure multimap to track constant pool entries by sections.
6. Asm printer iterate over SmallPtrSet when it's emitting constant pool entries. This is non-deterministic.

Solutions:
1. ConstantPoolSDNode alignment field is changed to keep non-log2 value.
2. MachineConstantPool alignment field is also changed to keep non-log2 value.
3. Functions that create ConstantPool nodes are passing in non-log2 alignments.
4. MachineConstantPoolEntry no longer keeps an offset field. It's replaced with an alignment field. Offsets are not computed when constant pool entries are created. They are computed on the fly in asm printer and JIT.
5. Asm printer uses cheaper data structure to group constant pool entries.
6. Asm printer compute entry offsets after grouping is done.
7. Change JIT code to compute entry offsets on the fly.

llvm-svn: 66875

1fb8aedd

Convert VirtRegMap to a MachineFunctionPass. · d37ddf5b
Owen Anderson authored Mar 13, 2009
```
llvm-svn: 66870
```
d37ddf5b
Oops...I committed too much. · fa54bc20
Bill Wendling authored Mar 13, 2009
```
llvm-svn: 66867
```
fa54bc20
Temporarily XFAIL this test. · b02eadf6
Bill Wendling authored Mar 13, 2009
```
llvm-svn: 66866
```
b02eadf6
Fix a typo in a comment. · a19c662a
Dan Gohman authored Mar 12, 2009
```
llvm-svn: 66843
```
a19c662a

Mar 12, 2009

Reorganize some #include's. · 36a99378
Owen Anderson authored Mar 12, 2009
```
llvm-svn: 66780
```
36a99378

Move 3 "(add (select cc, 0, c), x) -> (select cc, x, (add, x, c))" · 4147f08e

Chris Lattner authored Mar 12, 2009

related transformations out of target-specific dag combine into the
ARM backend.  These were added by Evan in r37685 with no testcases
and only seems to help ARM (e.g. test/CodeGen/ARM/select_xform.ll).

Add some simple X86-specific (for now) DAG combines that turn things
like cond ? 8 : 0  -> (zext(cond) << 3).  This happens frequently
with the recently added cp constant select optimization, but is a
very general xform.  For example, we now compile the second example
in const-select.ll to:

_test:
        movsd   LCPI2_0, %xmm0
        ucomisd 8(%esp), %xmm0
        seta    %al
        movzbl  %al, %eax
        movl    4(%esp), %ecx
        movsbl  (%ecx,%eax,4), %eax
        ret

instead of:

_test:
        movl    4(%esp), %eax
        leal    4(%eax), %ecx
        movsd   LCPI2_0, %xmm0
        ucomisd 8(%esp), %xmm0
        cmovbe  %eax, %ecx
        movsbl  (%ecx), %eax
        ret

This passes multisource and dejagnu.

llvm-svn: 66779

4147f08e

Enable Chris' value propagation change. It make available known sign, zero,... · 44659546

Evan Cheng authored Mar 12, 2009

Enable Chris' value propagation change. It make available known sign, zero, one bits information for values that are live out of basic blocks. The goal is to eliminate unnecessary sext, zext, truncate of values that are live-in to blocks. This does not handle PHI nodes yet.

llvm-svn: 66777

44659546

Mar 11, 2009
- update · d9065c4e
  Gabor Greif authored Mar 11, 2009
```
llvm-svn: 66733
```
  d9065c4e
- Reorganization: Move the Spiller out of VirtRegMap.cpp into its own files. No... · aabe06d9
  Owen Anderson authored Mar 11, 2009
```
Reorganization: Move the Spiller out of VirtRegMap.cpp into its own files.  No (intended) functionality change.

llvm-svn: 66720
```
  aabe06d9
- My last coalescer fix introduced a subtler one. It's aborting a commuting... · 6cba5616
  Evan Cheng authored Mar 11, 2009
```
My last coalescer fix introduced a subtler one. It's aborting a commuting optimization too late and left the live intervals to be out of sync with instructions. This fixes 8b10b.

llvm-svn: 66715
```
  6cba5616
- It makes no sense to have a ODR version of common · 4581bebf
  Duncan Sands authored Mar 11, 2009
```
linkage, so remove it.

llvm-svn: 66690
```
  4581bebf
- Add parentheses to pacify gcc-4.3. · be69d60d
  Duncan Sands authored Mar 11, 2009
```
llvm-svn: 66653
```
  be69d60d
- reapply my previous patch (r66358) with a tweak to set the · 43d6377f
  Chris Lattner authored Mar 11, 2009
```
alignment of the generated constant pool entry to the
desired alignment of a type.  If we don't do this, we end up
trying to do movsd from 4-byte alignment memory.  This fixes
450.soplex and 456.hmmer.

llvm-svn: 66641
```
  43d6377f
- Put the assignment back at the top of this method. · 1df2c1b5
  Bill Wendling authored Mar 11, 2009
```
llvm-svn: 66611
```
  1df2c1b5
- Two coalescer fixes in one. · 64b3f9d7
  Evan Cheng authored Mar 11, 2009
```
1. Use the same value# to represent unknown values being merged into sub-registers.
2. When coalescer commute an instruction and the destination is a physical register, update its sub-registers by merging in the extended ranges.

llvm-svn: 66610
```
  64b3f9d7
- Make ivars private. Other cleanup. No functionality change. · 9621b099
  Bill Wendling authored Mar 10, 2009
```
llvm-svn: 66607
```
  9621b099
Mar 10, 2009
- Just make the Dwarf timer group static inside of the getter function. No need to alloc/dealloc. · 86c26564
  Bill Wendling authored Mar 10, 2009
```
llvm-svn: 66591
```
  86c26564
- Don't put static functions in anonymous namespace. · b74d6507
  Bill Wendling authored Mar 10, 2009
```
llvm-svn: 66589
```
  b74d6507
- These should *stop* the timer, not start it again. · ff1faf70
  Bill Wendling authored Mar 10, 2009
```
llvm-svn: 66586
```
  ff1faf70
- - Fix misspelled method name. · 64185905
  Bill Wendling authored Mar 10, 2009
```
- Remove unused method.

llvm-svn: 66585
```
  64185905
- - Create GetOrCreateSourceID from getOrCreateSourceID. GetOrCreateSourceID is · a5c8e50e
  Bill Wendling authored Mar 10, 2009
```
  the untimed version of getOrCreateSourceID. getOrCreateSourceID calls
  GetOrCreateSourceID, of course.

- Move some methods into the "private" section. Constify at least one method.

- General clean-ups.

llvm-svn: 66582
```
  a5c8e50e
- Refine the Dwarf writer timers so that they measure exception writing and debug · 6e6d1b24
  Bill Wendling authored Mar 10, 2009
```
writing individually.

llvm-svn: 66577
```
  6e6d1b24
- Revert 66358 for now. It's breaking povray, 450.soplex, and 456.hmmer on x86 / Darwin. · aa887653
  Evan Cheng authored Mar 10, 2009
```
llvm-svn: 66574
```
  aa887653
- Add a timer to the DwarfWriter pass that measures the total time it takes to · e8dd2847
  Bill Wendling authored Mar 10, 2009
```
emit exception and debug Dwarf info.

llvm-svn: 66571
```
  e8dd2847
- Fix a post-RA scheduling liveness bug. When a basic block is being · 64613ace
  Dan Gohman authored Mar 10, 2009
```
scheduled in multiple regions, liveness data used by the
anti-dependence breaker is carried from one region to the next, however
the information reflects the state of the instructions before scheduling.
After scheduling, there may be new live range overlaps. Handle this by
pessimizing the liveness data carried between regions to the point where
it will be conservatively correct now matter how the earlier region is
scheduled. This fixes a miscompilation in 176.gcc with the post-RA
scheduler enabled.

llvm-svn: 66558
```
  64613ace
- wire up support for emitting "special" values from inline asm · 1522e249
  Chris Lattner authored Mar 10, 2009
```
format strings with the standard ${:foo} syntax.

llvm-svn: 66527
```
  1522e249
Mar 09, 2009
- Fix PR3763 by using proper APInt methods instead of uint64_t's. · 4249b9a6
  Chris Lattner authored Mar 09, 2009
```
llvm-svn: 66434
```
  4249b9a6
- Yet another case where the spiller marked two uses of the same register on the... · fb8ded91
  Evan Cheng authored Mar 09, 2009
```
Yet another case where the spiller marked two uses of the same register on the same instruction as kill. This fixes PR3706.

llvm-svn: 66428
```
  fb8ded91
- just remove the use_empty() check entirely, the only reason it · 126dab2f
  Chris Lattner authored Mar 09, 2009
```
existed was for llvm-gcc 3.4 (which used the __main hack) which 
is really really long dead.

llvm-svn: 66417
```
  126dab2f
- Make the code generator rip of dead constant expr uses before deciding · 317293b5
  Chris Lattner authored Mar 09, 2009
```
whether a global is dead or not.  This should fix PR3749 - linker adds 
spurious use to appending globals.  I can't reasonably add a testcase
for this, because the bc writer/reader strip dead constant users.

llvm-svn: 66404
```
  317293b5
- Pass in a std::string when getting the names of debugging things. This cuts down · c6869f46
  Bill Wendling authored Mar 09, 2009
```
on the number of times a std::string is created and copied.

llvm-svn: 66396
```
  c6869f46
Mar 08, 2009

If a MI uses the same register more than once, only mark one of them as 'kill'. · de22116f
Evan Cheng authored Mar 08, 2009
```
llvm-svn: 66363
```
de22116f

implement an optimization to codegen c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4. · ab5a4431

Chris Lattner authored Mar 08, 2009

For 2009-03-07-FPConstSelect.ll we now produce:

_f:
	xorl	%eax, %eax
	testl	%edi, %edi
	movl	$4, %ecx
	cmovne	%rax, %rcx
	leaq	LCPI1_0(%rip), %rax
	movss	(%rcx,%rax), %xmm0
	ret

previously we produced:

_f:
	subl	$4, %esp
	cmpl	$0, 8(%esp)
	movss	LCPI1_0, %xmm0
	je	LBB1_2	## entry
LBB1_1:	## entry
	movss	LCPI1_1, %xmm0
LBB1_2:	## entry
	movss	%xmm0, (%esp)
	flds	(%esp)
	addl	$4, %esp
	ret

on PPC the code also improves to:

_f:
	cntlzw r2, r3
	srwi r2, r2, 5
	li r3, lo16(LCPI1_0)
	slwi r2, r2, 2
	addis r3, r3, ha16(LCPI1_0)
	lfsx f1, r3, r2
	blr 

from:

_f:
	li r2, lo16(LCPI1_1)
	cmplwi cr0, r3, 0
	addis r2, r2, ha16(LCPI1_1)
	beq cr0, LBB1_2	; entry
LBB1_1:	; entry
	li r2, lo16(LCPI1_0)
	addis r2, r2, ha16(LCPI1_0)
LBB1_2:	; entry
	lfs f1, 0(r2)
	blr 

This also improves the existing pic-cpool case from:

foo:
	subl	$12, %esp
	call	.Lllvm$1.$piclabel
.Lllvm$1.$piclabel:
	popl	%eax
	addl	$_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax
	cmpl	$0, 16(%esp)
	movsd	.LCPI1_0@GOTOFF(%eax), %xmm0
	je	.LBB1_2	# entry
.LBB1_1:	# entry
	movsd	.LCPI1_1@GOTOFF(%eax), %xmm0
.LBB1_2:	# entry
	movsd	%xmm0, (%esp)
	fldl	(%esp)
	addl	$12, %esp
	ret

to:

foo:
	call	.Lllvm$1.$piclabel
.Lllvm$1.$piclabel:
	popl	%eax
	addl	$_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax
	xorl	%ecx, %ecx
	cmpl	$0, 4(%esp)
	movl	$8, %edx
	cmovne	%ecx, %edx
	fldl	.LCPI1_0@GOTOFF(%eax,%edx)
	ret

This triggers a few dozen times in spec FP 2000.

llvm-svn: 66358

ab5a4431

random cleanups. · 21cf4bf2
Chris Lattner authored Mar 08, 2009
```
llvm-svn: 66357
```
21cf4bf2

Mar 07, 2009

Introduce new linkage types linkonce_odr, weak_odr, common_odr · 12da8ce3

Duncan Sands authored Mar 07, 2009

and extern_weak_odr.  These are the same as the non-odr versions,
except that they indicate that the global will only be overridden
by an *equivalent* global.  In C, a function with weak linkage can
be overridden by a function which behaves completely differently.
This means that IP passes have to skip weak functions, since any
deductions made from the function definition might be wrong, since
the definition could be replaced by something completely different
at link time.   This is not allowed in C++, thanks to the ODR
(One-Definition-Rule): if a function is replaced by another at
link-time, then the new function must be the same as the original
function.  If a language knows that a function or other global can
only be overridden by an equivalent global, it can give it the
weak_odr linkage type, and the optimizers will understand that it
is alright to make deductions based on the function body.  The
code generators on the other hand map weak and weak_odr linkage
to the same thing.

llvm-svn: 66339

12da8ce3