Commits · 0b0dcd987da5ea4f557c936616dc65f7adc8e187 · Roger Ferrer / llvm-epi-0.8

Feb 24, 2009

Add a debugging option for SSC DCE. · 0fb12efb
Owen Anderson authored Feb 24, 2009
```
llvm-svn: 65375
```
0fb12efb

- Use the "Fast" flag instead of "OptimizeForSize" to determine whether to emit · 786c5973

Bill Wendling authored Feb 24, 2009

  a DBG_LABEL or not. We want to fall back to the original way of emitting debug
  info when we're in -O0/-fast mode.
- Add plumbing in to pass the "Fast" flag to places that need it.
- XFAIL DebugInfo/deaddebuglabel.ll. This is finding 11 labels instead of 8. I
  need to investigate still.

llvm-svn: 65367

786c5973

Fix a ValueTracking rule: RHS means operand 1, not 0. Add a simple · 4f356bb9

Dan Gohman authored Feb 24, 2009

ashr instcombine to help expose this code. And apply the fix to
SelectionDAG's copy of this code too.

llvm-svn: 65364

4f356bb9

Generalize the ChangeCompareStride code, in preparation for · 5d1f458f
Dan Gohman authored Feb 24, 2009
```
handling non-constant strides. No functionality change.

llvm-svn: 65363
```
5d1f458f
Preserve the DominanceFrontier analysis in the LoopDeletion pass. · e6698847
Dan Gohman authored Feb 24, 2009
```
llvm-svn: 65359
```
e6698847
gdb uses DW_AT_prototyped to identify K&R style in C based languages. · 50c63241
Devang Patel authored Feb 24, 2009
```
This fixes objc.dg/dwarf-prototypes.m scan-assembler DW_AT_prototyped from llvmgcc42 test suite.

llvm-svn: 65357
```
50c63241

While folding unconditional return move DbgRegionEndInst into the predecessor,... · e2880826

Devang Patel authored Feb 24, 2009

While folding unconditional return move DbgRegionEndInst into the predecessor, instead of removing it. This fixes following tests from llvmgcc42 testsuite.

gcc.c-torture/execute/20000605-3.c
gcc.c-torture/execute/20020619-1.c
gcc.c-torture/execute/20030920-1.c
gcc.c-torture/execute/loop-ivopts-1.c

llvm-svn: 65353

e2880826

If there is not any debug info available for any global variables and any... · 6ced076e

Devang Patel authored Feb 24, 2009

If there is not any debug info available for any global variables and any subprograms then there is not any debug info to emit.

llvm-svn: 65352

6ced076e

Back out the change in 64918 that used sign-extensions when promoting · f6e8c77e

Dan Gohman authored Feb 23, 2009

trip counts that use signed comparisons. It's not obviously the best
approach for preserving trip count information, and at any rate there
isn't anything in the tree right now that makes use of that, so for
now always using zero-extensions is preferable.

llvm-svn: 65347

f6e8c77e

Feb 23, 2009

Fast-isel can't do TLS yet, so it should fall back to SDISel · 318d7376
Dan Gohman authored Feb 23, 2009
```
if it sees TLS addresses.

llvm-svn: 65341
```
318d7376

LoopDeletion needs to inform ScalarEvolution when a loop is deleted, · e591411f

Dan Gohman authored Feb 23, 2009

so that ScalarEvolution doesn't hang onto a dangling Loop*, which
could be a problem if another Loop happens to get allocated at the
same address.

llvm-svn: 65323

e591411f

IndVarSimplify preserves ScalarEvolution. In the · 42987f52

Dan Gohman authored Feb 23, 2009

-std-compile-opts sequence, this avoids the need for ScalarEvolution to
be rerun before LoopDeletion.

llvm-svn: 65318

42987f52

Should reset DBI_Prev if DBI_Next == 0. · 3a86bcf1
Zhou Sheng authored Feb 23, 2009
```
llvm-svn: 65314
```
3a86bcf1
Only v1i16 (i.e. _m64) is returned via RAX / RDX. · 9f8fddee
Evan Cheng authored Feb 23, 2009
```
llvm-svn: 65313
```
9f8fddee

Generate better code for v8i16 shuffles on SSE2 · e684da3e

Nate Begeman authored Feb 23, 2009

Generate better code for v16i8 shuffles on SSE2 (avoids stack)
Generate pshufb for v8i16 and v16i8 shuffles on SSSE3 where it is fewer uops.
Document the shuffle matching logic and add some FIXMEs for later further
  cleanups.
New tests that test the above.

Examples:

New:
_shuf2:
	pextrw	$7, %xmm0, %eax
	punpcklqdq	%xmm1, %xmm0
	pshuflw	$128, %xmm0, %xmm0
	pinsrw	$2, %eax, %xmm0

Old:
_shuf2:
	pextrw	$2, %xmm0, %eax
	pextrw	$7, %xmm0, %ecx
	pinsrw	$2, %ecx, %xmm0
	pinsrw	$3, %eax, %xmm0
	movd	%xmm1, %eax
	pinsrw	$4, %eax, %xmm0
	ret

=========

New:
_shuf4:
	punpcklqdq	%xmm1, %xmm0
	pshufb	LCPI1_0, %xmm0

Old:
_shuf4:
	pextrw	$3, %xmm0, %eax
	movsd	%xmm1, %xmm0
	pextrw	$3, %xmm1, %ecx
	pinsrw	$4, %ecx, %xmm0
	pinsrw	$5, %eax, %xmm0

========

New:
_shuf1:
	pushl	%ebx
	pushl	%edi
	pushl	%esi
	pextrw	$1, %xmm0, %eax
	rolw	$8, %ax
	movd	%xmm0, %ecx
	rolw	$8, %cx
	pextrw	$5, %xmm0, %edx
	pextrw	$4, %xmm0, %esi
	pextrw	$3, %xmm0, %edi
	pextrw	$2, %xmm0, %ebx
	movaps	%xmm0, %xmm1
	pinsrw	$0, %ecx, %xmm1
	pinsrw	$1, %eax, %xmm1
	rolw	$8, %bx
	pinsrw	$2, %ebx, %xmm1
	rolw	$8, %di
	pinsrw	$3, %edi, %xmm1
	rolw	$8, %si
	pinsrw	$4, %esi, %xmm1
	rolw	$8, %dx
	pinsrw	$5, %edx, %xmm1
	pextrw	$7, %xmm0, %eax
	rolw	$8, %ax
	movaps	%xmm1, %xmm0
	pinsrw	$7, %eax, %xmm0
	popl	%esi
	popl	%edi
	popl	%ebx
	ret

Old:
_shuf1:
	subl	$252, %esp
	movaps	%xmm0, (%esp)
	movaps	%xmm0, 16(%esp)
	movaps	%xmm0, 32(%esp)
	movaps	%xmm0, 48(%esp)
	movaps	%xmm0, 64(%esp)
	movaps	%xmm0, 80(%esp)
	movaps	%xmm0, 96(%esp)
	movaps	%xmm0, 224(%esp)
	movaps	%xmm0, 208(%esp)
	movaps	%xmm0, 192(%esp)
	movaps	%xmm0, 176(%esp)
	movaps	%xmm0, 160(%esp)
	movaps	%xmm0, 144(%esp)
	movaps	%xmm0, 128(%esp)
	movaps	%xmm0, 112(%esp)
	movzbl	14(%esp), %eax
	movd	%eax, %xmm1
	movzbl	22(%esp), %eax
	movd	%eax, %xmm2
	punpcklbw	%xmm1, %xmm2
	movzbl	42(%esp), %eax
	movd	%eax, %xmm1
	movzbl	50(%esp), %eax
	movd	%eax, %xmm3
	punpcklbw	%xmm1, %xmm3
	punpcklbw	%xmm2, %xmm3
	movzbl	77(%esp), %eax
	movd	%eax, %xmm1
	movzbl	84(%esp), %eax
	movd	%eax, %xmm2
	punpcklbw	%xmm1, %xmm2
	movzbl	104(%esp), %eax
	movd	%eax, %xmm1
	punpcklbw	%xmm1, %xmm0
	punpcklbw	%xmm2, %xmm0
	movaps	%xmm0, %xmm1
	punpcklbw	%xmm3, %xmm1
	movzbl	127(%esp), %eax
	movd	%eax, %xmm0
	movzbl	135(%esp), %eax
	movd	%eax, %xmm2
	punpcklbw	%xmm0, %xmm2
	movzbl	155(%esp), %eax
	movd	%eax, %xmm0
	movzbl	163(%esp), %eax
	movd	%eax, %xmm3
	punpcklbw	%xmm0, %xmm3
	punpcklbw	%xmm2, %xmm3
	movzbl	188(%esp), %eax
	movd	%eax, %xmm0
	movzbl	197(%esp), %eax
	movd	%eax, %xmm2
	punpcklbw	%xmm0, %xmm2
	movzbl	217(%esp), %eax
	movd	%eax, %xmm4
	movzbl	225(%esp), %eax
	movd	%eax, %xmm0
	punpcklbw	%xmm4, %xmm0
	punpcklbw	%xmm2, %xmm0
	punpcklbw	%xmm3, %xmm0
	punpcklbw	%xmm1, %xmm0
	addl	$252, %esp
	ret

llvm-svn: 65311

e684da3e

Changed option name from inline-threshold to basic-inline-threshold because · dccfa0b2
Mon P Wang authored Feb 23, 2009
```
inline-threshold option is used by the inliner.

llvm-svn: 65309
```
dccfa0b2
fix some typos that Duncan noticed · d5420f09
Chris Lattner authored Feb 23, 2009
```
llvm-svn: 65306
```
d5420f09
Propagate debug loc info through prologue/epilogue. · 9ee052bc
Bill Wendling authored Feb 23, 2009
```
llvm-svn: 65298
```
9ee052bc

Introduce the BuildVectorSDNode class that encapsulates the ISD::BUILD_VECTOR · 9d31aca6

Scott Michel authored Feb 22, 2009

instruction. The class also consolidates the code for detecting constant
splats that's shared across PowerPC and the CellSPU backends (and might be
useful for other backends.) Also introduces SelectionDAG::getBUID_VECTOR() for
generating new BUILD_VECTOR nodes.

llvm-svn: 65296

9d31aca6

Feb 22, 2009

Revert the part of 64623 that attempted to align the source in a · 648c5e9c

Dan Gohman authored Feb 22, 2009

memcpy to match the alignment of the destination. It isn't necessary
for making loads and stores handled like the SSE loadu/storeu
intrinsics, and it was causing a performance regression in
MultiSource/Applications/JM/lencod.

The problem appears to have been a memcpy that copies from some
highly aligned array into an alloca; the alloca was then being
assigned a large alignment, which required codegen to perform
dynamic stack-pointer re-alignment, which forced the enclosing
function to have a frame pointer, which led to increased spilling.

llvm-svn: 65289

648c5e9c

Properly parenthesize this expression, fixing a real bug in the new · f394e58a
Dan Gohman authored Feb 22, 2009
```
-full-lsr code, as well as a GCC warning.

llvm-svn: 65288
```
f394e58a
If a use operand is marked isKill, don't forget to add kill to its live interval as well. · e779595a
Evan Cheng authored Feb 22, 2009
```
llvm-svn: 65279
```
e779595a
Add a note. · 2448aa1d
Evan Cheng authored Feb 22, 2009
```
llvm-svn: 65275
```
2448aa1d
Be bug compatible with gcc by returning MMX values in RAX. · e4ffc030
Evan Cheng authored Feb 22, 2009
```
llvm-svn: 65274
```
e4ffc030

Do not consider MMX_MOVD64rr a move instructions. The source register is in... · 91193c00

Evan Cheng authored Feb 22, 2009

Do not consider MMX_MOVD64rr a move instructions. The source register is in GR32, the destination is VR64. They are not compatible.

llvm-svn: 65273

91193c00

Only try to sink immediate when TLI is not null. It needs to check if... · 69decbf0

Evan Cheng authored Feb 22, 2009

Only try to sink immediate when TLI is not null. It needs to check if immediate would fit in target addressing field.

llvm-svn: 65268

69decbf0

Feb 21, 2009
- Don't sign extend the char when expanding char -> int during · d44e80d7
  Nick Lewycky authored Feb 21, 2009
```
load(bitcast(char[4] to i32*)) evaluation.

llvm-svn: 65246
```
  d44e80d7
- bug 3610: Floating point vaarg not softened. · 99f6d7c9
  Richard Pennington authored Feb 21, 2009
```
llvm-svn: 65239
```
  99f6d7c9
- Drop bunch of half-working stuff in the ext_weak linkage support. · 42aae865
  Anton Korobeynikov authored Feb 21, 2009
```
Now we're using one gross, but quite robust hack :) (previous ones
did not work, for example, when ext_weak symbol was used deep inside
constant expression in the initializer).

The proper fix of this problem will require some quite huge asmprinter
changes and that's why was postponed. This fixes PR3629 by the way :)

llvm-svn: 65230
```
  42aae865
- Add AddrModeMatcher.cpp · 1173ec7a
  Evan Cheng authored Feb 21, 2009
```
llvm-svn: 65228
```
  1173ec7a
- If two-address def is dead and the instruction does not define other... · 34806b1f
  Evan Cheng authored Feb 21, 2009
```
If two-address def is dead and the instruction does not define other registers, and it doesn't produce side effects, just delete the instruction.

llvm-svn: 65218
```
  34806b1f
- Teach LSR sink to sink the immediate portion of the common expression back... · 107b06c4
  Evan Cheng authored Feb 21, 2009
```
Teach LSR sink to sink the immediate portion of the common expression back into uses if they fit in address modes of all the uses.

llvm-svn: 65215
```
  107b06c4
- Make sure this doesn't access .end() too. · 82aa14fa
  Bill Wendling authored Feb 21, 2009
```
llvm-svn: 65213
```
  82aa14fa
- Make sure we don't dereference the .end() of the container. · 81ebf9a5
  Bill Wendling authored Feb 21, 2009
```
llvm-svn: 65211
```
  81ebf9a5
- rename a function to indicate that it checks for profitability as well · bef6b209
  Chris Lattner authored Feb 21, 2009
```
as legality.  Make load sinking and gep sinking more careful: we only
do it when it won't pessimize loads from the stack.  This has the added
benefit of not producing code that is unanalyzable to SROA.

llvm-svn: 65209
```
  bef6b209
- Propagate more debug loc infos. This also includes some code cleaning. · 56759ee6
  Bill Wendling authored Feb 21, 2009
```
llvm-svn: 65207
```
  56759ee6
- We need to propagate the debug location information even when dealing with the · 51919343
  Bill Wendling authored Feb 21, 2009
```
prologue/epilogue.

llvm-svn: 65206
```
  51919343
- Fix a bug that David Greene found in the DAGCombiner's logic · e7fe80fc
  Dan Gohman authored Feb 20, 2009
```
that checks whether it's safe to transform a store of a bitcast
value into a store of the original value.

llvm-svn: 65201
```
  e7fe80fc
Feb 20, 2009
- Fix strange logic in CollectIVUsers used to determine whether all uses are · 8a9481d5
  Evan Cheng authored Feb 20, 2009
```
addresses, part 1. This fixes an obvious logic bug. Previously if the only
in-loop use is a PHI, it would return AllUsesAreAddresses as true.

llvm-svn: 65178
```
  8a9481d5
- Simplify code and reduce indentation. No functionality change. · 5e309a5b
  Dan Gohman authored Feb 20, 2009
```
llvm-svn: 65167
```
  5e309a5b