Commits · c6089fda207626b47d66c1a79b658819ac6c9a39 · Roger Ferrer / llvm-epi-0.8

Dec 03, 2009
- fix a build problem with VC++, PR5664, patch by Alp Toker! · c831fac0
  Chris Lattner authored Dec 03, 2009
```
llvm-svn: 90419
```
  c831fac0
- Recognize canonical forms of vector shuffles where the same vector is used for · 0bbd3077
  Bob Wilson authored Dec 03, 2009
```
both source operands.  In the canonical form, the 2nd operand is changed to an
undef and the shuffle mask is adjusted to only reference elements from the 1st
operand.  Radar 7434842.

llvm-svn: 90417
```
  0bbd3077
- Revert r90371. It was causing build failures. · aba7d487
  Bill Wendling authored Dec 03, 2009
```
llvm-svn: 90383
```
  aba7d487
- Further improvements: refactoring code that does the same thing into one · 693969eb
  Bill Wendling authored Dec 03, 2009
```
function, converting "dyn_cast" to "cast", asserting the correct things, and
other general cleanups.

llvm-svn: 90371
```
  693969eb
- yay for case insensitive file systems (?) · 765ac33a
  Chris Lattner authored Dec 03, 2009
```
llvm-svn: 90370
```
  765ac33a
- remove some dead std::ostream using code. · 73570673
  Chris Lattner authored Dec 03, 2009
```
llvm-svn: 90366
```
  73570673
- improve portability to avoid conflicting with std::next in c++'0x. · a48f44d9
  Chris Lattner authored Dec 03, 2009
```
Patch by Howard Hinnant!

llvm-svn: 90365
```
  a48f44d9
- This initial code is meant to convert TargetData to use an AbstractTypesUser so · 1ed59c63
  Bill Wendling authored Dec 03, 2009
```
that it doesn't have dangling pointers when abstract types are resolved. This
modifies it somewhat to address comments: making the "StructLayoutMap" an
anonymous structure, calling "removeAbstractTypeUser" when appropriate, and
adding asserts where helpful.

llvm-svn: 90362
```
  1ed59c63
Dec 02, 2009
- Factor the stack alignment calculations out into a target independent pass. · 2c3a6c65
  Jim Grosbach authored Dec 02, 2009
```
No functionality change.

llvm-svn: 90336
```
  2c3a6c65
Dec 01, 2009
- Thumb1 exception handling setjmp · 36d4dec2
  Jim Grosbach authored Dec 01, 2009
```
llvm-svn: 90246
```
  36d4dec2
- For VLDM/VSTM (Advanced SIMD), set encoding bits Inst{11-8} to 0b1011. · 86fc9207
  Johnny Chen authored Dec 01, 2009
```
llvm-svn: 90243
```
  86fc9207
- For VMOV (immediate), make some of the encoding bits (cmode and op) unspecified. · ee536b0e
  Johnny Chen authored Dec 01, 2009
```
For VMOVv*i[16,32], op bit is don't care, and some cmode bits vary depending on
the immediate values.

Ref: Table A7-15 Modified immediate values for Advanced SIMD instructions.
llvm-svn: 90173
```
  ee536b0e
- Minor whitespace fixes. · 3ee8bc9b
  Dan Gohman authored Nov 30, 2009
```
llvm-svn: 90166
```
  3ee8bc9b
- Fix a minor inconsistency. · 6f513090
  Dan Gohman authored Nov 30, 2009
```
llvm-svn: 90165
```
  6f513090
Nov 30, 2009

Remove isProfitableToDuplicateIndirectBranch target hook. It is profitable · 505ddaa4

Bob Wilson authored Nov 30, 2009

for all the processors where I have tried it, and even when it might not help
performance, the cost is quite low.  The opportunities for duplicating
indirect branches are limited by other factors so code size does not change
much due to tail duplicating indirect branches aggressively.

llvm-svn: 90144

505ddaa4

Fix some more ARM unified syntax warnings. · c168a526
Bob Wilson authored Nov 30, 2009
```
llvm-svn: 90141
```
c168a526
Added support to allow clients to custom widen. For X86, custom widen vectors for · 32f8bb9e
Mon P Wang authored Nov 30, 2009
```
divide/remainder since these operations can trap by unroll them and adding undefs
for the resulting vector.

llvm-svn: 90108
```
32f8bb9e

Nov 29, 2009
- update and consolidate the load pre notes. · 58ccf88c
  Chris Lattner authored Nov 29, 2009
```
llvm-svn: 90050
```
  58ccf88c
Nov 27, 2009

add a deadargelim note. · 83a4a986
Chris Lattner authored Nov 27, 2009
```
llvm-svn: 90009
```
83a4a986
This testcase is actually only partially redundant, and requires · ca9e0e83
Chris Lattner authored Nov 27, 2009
```
the FIXME I added yesterday to be implemented.

llvm-svn: 90008
```
ca9e0e83
this (and probably several others) are now done. · cc6d2928
Chris Lattner authored Nov 27, 2009
```
llvm-svn: 89982
```
cc6d2928

Teach memdep to phi translate bitcasts. This allows us to compile · 9bd2136c

Chris Lattner authored Nov 26, 2009

the example in GCC PR16799 to:

LBB1_2:                                                     ## %bb1
	movl	%eax, %eax
	subq	%rax, %rdi
	movq	%rdi, (%rcx)
	movl	(%rdi), %eax
	testl	%eax, %eax
	je	LBB1_2

instead of:

LBB1_2:                                                     ## %bb1
	movl	(%rdi), %ecx
	subq	%rcx, %rdi
	movq	%rdi, (%rax)
	cmpl	$0, (%rdi)
	je	LBB1_2

llvm-svn: 89978

9bd2136c

Nov 26, 2009

Teach basicaa that x|c == x+c when the c bits of x are clear. This · 29bc8a91

Chris Lattner authored Nov 26, 2009

allows us to compile the example in readme.txt into:

LBB1_1:                                                     ## %bb
	movl	4(%rdx,%rax), %ecx
	movl	%ecx, %esi
	imull	(%rdx,%rax), %esi
	imull	%esi, %ecx
	movl	%esi, 8(%rdx,%rax)
	imull	%ecx, %esi
	movl	%ecx, 12(%rdx,%rax)
	movl	%esi, 16(%rdx,%rax)
	imull	%ecx, %esi
	movl	%esi, 20(%rdx,%rax)
	addq	$16, %rax
	cmpq	$4000, %rax
	jne	LBB1_1

instead of:

LBB1_1: 
	movl	(%rdx,%rax), %ecx
	imull	4(%rdx,%rax), %ecx
	movl	%ecx, 8(%rdx,%rax)
	imull	4(%rdx,%rax), %ecx
	movl	%ecx, 12(%rdx,%rax)
	imull	8(%rdx,%rax), %ecx
	movl	%ecx, 16(%rdx,%rax)
	imull	12(%rdx,%rax), %ecx
	movl	%ecx, 20(%rdx,%rax)
	addq	$16, %rax
	cmpq	$4000, %rax
	jne	LBB1_1

GCC (4.2) doesn't seem to be able to eliminate the loads in this 
testcase either, it generates:

L2:
	movl	(%rdx), %eax
	imull	4(%rdx), %eax
	movl	%eax, 8(%rdx)
	imull	4(%rdx), %eax
	movl	%eax, 12(%rdx)
	imull	8(%rdx), %eax
	movl	%eax, 16(%rdx)
	imull	12(%rdx), %eax
	movl	%eax, 20(%rdx)
	addl	$4, %ecx
	addq	$16, %rdx
	cmpl	$1002, %ecx
	jne	L2

llvm-svn: 89952

29bc8a91

teach basicaa that A[i] != A[i+1]. · 12dacdd3
Chris Lattner authored Nov 26, 2009
```
llvm-svn: 89951
```
12dacdd3
update some notes slightly · 8e09ad6f
Chris Lattner authored Nov 26, 2009
```
llvm-svn: 89913
```
8e09ad6f

Nov 25, 2009
- Rollback changes r89516: Added two SubtargetFeatures::AddFeatures methods,... · 8981b3ab
  Viktor Kutuzov authored Nov 25, 2009
```
Rollback changes r89516: Added two SubtargetFeatures::AddFeatures methods, which accept a comma-separated string or already parsed command line parameters as input, and some code re-factoring to use these new methods.

llvm-svn: 89893
```
  8981b3ab
- Tail duplicate indirect branches for PowerPC, too. · 4419301d
  Bob Wilson authored Nov 25, 2009
```
With the testcase for pr3120, the "threaded interpreter" runtime decreases
from 1788 to 1413 with this change.

llvm-svn: 89877
```
  4419301d
- Avoid some possibly unsafe uses of StringRef::data(). · 4cd30817
  Benjamin Kramer authored Nov 25, 2009
```
llvm-svn: 89873
```
  4cd30817
- Use StringRef (again) in DebugInfo interface. · 2d9caf9f
  Devang Patel authored Nov 25, 2009
```
llvm-svn: 89866
```
  2d9caf9f
- Based on the testcase for pr3120, running on my MacPro with Xeon processors, · 120f729e
  Bob Wilson authored Nov 25, 2009
```
it is definitely profitable to tail duplicate indirect branches for x86.
This is likely to be true to various degrees for all modern x86 processors.

llvm-svn: 89865
```
  120f729e
- Support PIC loading of constant pool entries · 2db07581
  Bruno Cardoso Lopes authored Nov 25, 2009
```
llvm-svn: 89863
```
  2db07581
- Sketch structure for X86 disassembler. · 900f2ce3
  Daniel Dunbar authored Nov 25, 2009
```
llvm-svn: 89850
```
  900f2ce3
- Use endianess dependent offsets for load/store of doubles when · 2c6d498c
  Bruno Cardoso Lopes authored Nov 25, 2009
```
using two swc/lwc instead of sdc/ldc.

llvm-svn: 89826
```
  2c6d498c
- Fix compiler warnings. · e0eb3365
  Dale Johannesen authored Nov 25, 2009
```
llvm-svn: 89824
```
  e0eb3365
- Only include in the callee saved regs the sub registers to avoid · fa2741e0
  Bruno Cardoso Lopes authored Nov 25, 2009
```
unnecessary save/restore.

llvm-svn: 89823
```
  fa2741e0
- Add proper emission of load/store double to stack slots for mips1 targets! · dce6f66c
  Bruno Cardoso Lopes authored Nov 25, 2009
```
llvm-svn: 89821
```
  dce6f66c
- Revert r89803. · d23ea6a3
  Devang Patel authored Nov 25, 2009
```
llvm-svn: 89819
```
  d23ea6a3
- Refactor target hook for tail duplication as requested by Chris. · d4d40670
  Bob Wilson authored Nov 24, 2009
```
Make tail duplication of indirect branches much more aggressive (for targets
that indicate that it is profitable), based on further experience with
this transformation.  I compiled 3 large applications with and without
this more aggressive tail duplication and measured minimal changes in code
size.  ("size" on Darwin seems to round the text size up to the nearest
page boundary, so I can only say that any code size increase was less than
one 4k page.) Radar 7421267.

llvm-svn: 89814
```
  d4d40670
Nov 24, 2009

Do not store R31 into the caller's link area on PPC. · 5ece8f0a

Dale Johannesen authored Nov 24, 2009

This violates the ABI (that area is "reserved"), and
while it is safe if all code is generated with current
compilers, there is some very old code around that uses
that slot for something else, and breaks if it is stored
into.  Adjust testcases looking for current behavior.
I've verified that the stack frame size is right in all
testcases, whether it changed or not.  7311323.

llvm-svn: 89811

5ece8f0a

Enable debug info for ppc-darwin. · 29c9b709
Devang Patel authored Nov 24, 2009
```
llvm-svn: 89803
```
29c9b709