Commits · deaad8cc34239d93ca3f31a0585d69c6969d0b43 · Roger Ferrer / llvm-epi-0.8

Feb 26, 2009
- Ignore dbg info intrinsics when folding conditional branch to · 264e46e1
  Zhou Sheng authored Feb 26, 2009
```
conditional branch predecessors.

llvm-svn: 65509
```
  264e46e1
- Enable stack slot coloring DCE. Evan's spiller fixes were needed before this could happen. · 5c53a462
  Owen Anderson authored Feb 26, 2009
```
llvm-svn: 65501
```
  5c53a462
- ADDS{D|S}rr_Int and MULS{D|S}rr_Int are not commutable. The users of these... · 40abb7b5
  Evan Cheng authored Feb 26, 2009
```
ADDS{D|S}rr_Int and MULS{D|S}rr_Int are not commutable. The users of these intrinsics expect the high bits will not be modified.

llvm-svn: 65499
```
  40abb7b5
- The last commit was overly conservative. It's ok to reuse value that's already marked livein. · ca2d6546
  Evan Cheng authored Feb 26, 2009
```
llvm-svn: 65498
```
  ca2d6546
- If an available register falls through to a succ block, unset the last kill.... · ee5fd035
  Evan Cheng authored Feb 26, 2009
```
If an available register falls through to a succ block, unset the last kill. Sorry, it's impossible to reduce a sensible test case. It basically requires the moon and stars to align in order to cause a failure.

llvm-svn: 65497
```
  ee5fd035
Feb 25, 2009
- Revert BuildVectorSDNode related patches: 65426, 65427, and 65296. · a49de9de
  Evan Cheng authored Feb 25, 2009
```
llvm-svn: 65482
```
  a49de9de
- Fix big-endian codegen bug. We're splitting up · 7d12ea0f
  Dale Johannesen authored Feb 25, 2009
```
overly long ints, e.g. i96, into pieces at PHIs
and the nodes that feed into them; however big-endian
reverses the order of the pieces (for some reason), and
wasn't doing it the same way on both sides, so
the pieces didn't match and runtime failures ensued.
Fixes 188.ammp and sqlite3 on ppc32.

llvm-svn: 65481
```
  7d12ea0f
- · 7e7fa83f
  Devang Patel authored Feb 25, 2009
```
Print variable's display name in dwarf DIE.

llvm-svn: 65468
```
  7e7fa83f
- Fix PR3667 · af618171
  Chris Lattner authored Feb 25, 2009
```
llvm-svn: 65464
```
  af618171
- Don't block basic block with only SwitchInst to fold into predecessors. · 5d9cc176
  Zhou Sheng authored Feb 25, 2009
```
llvm-svn: 65456
```
  5d9cc176
- Clean up dwarf writer, part 1. This eliminated the horrible recursive... · 86673f28
  Evan Cheng authored Feb 25, 2009
```
Clean up dwarf writer, part 1. This eliminated the horrible recursive getGlobalVariablesUsing and replaced it something readable. It eliminated use of slow UniqueVector and replaced it with StringMap, SmallVector, and DenseMap, etc. It also fixed some non-deterministic behavior.

This is a very minor compile time win.

llvm-svn: 65438
```
  86673f28
- Add a totally synthetic situation I came up with while looking at a bug in · 5c10a3aa
  Nick Lewycky authored Feb 25, 2009
```
related code.

llvm-svn: 65437
```
  5c10a3aa
- Expand tabs to spaces (overlooked in previous commit) · e2fdc317
  Scott Michel authored Feb 25, 2009
```
llvm-svn: 65427
```
  e2fdc317
- Remove all "cached" data from BuildVectorSDNode, preferring to retrieve · bb878288
  Scott Michel authored Feb 25, 2009
```
results via reference parameters.

This patch also appears to fix Evan's reported problem supplied as a
reduced bugpoint test case.

llvm-svn: 65426
```
  bb878288
- Added support to have TableGen provide information if an intrinsic (core · b4024931
  Mon P Wang authored Feb 24, 2009
```
or target) can be overloaded or not.

llvm-svn: 65404
```
  b4024931
- If compile unit's language is not set then don't crash while dump'ing compile unit. · 0c83e84f
  Devang Patel authored Feb 24, 2009
```
llvm-svn: 65402
```
  0c83e84f
Feb 24, 2009

Extension of GEP in constant folder was broken (apparently this code · 5bf00893
Daniel Dunbar authored Feb 24, 2009
```
has never been run!).
 - Sorry, don't know how to make an LLVM test case for this.

llvm-svn: 65383
```
5bf00893

Rename ScalarEvolution's getIterationCount to getBackedgeTakenCount, · 0bddac16

Dan Gohman authored Feb 24, 2009

to more accurately describe what it does. Expand its doxygen comment
to describe what the backedge-taken count is and how it differs
from the actual iteration count of the loop. Adjust names and
comments in associated code accordingly.

llvm-svn: 65382

0bddac16

Overhaul my earlier submission due to feedback. It's a large patch, but most of · c5437ea4

Bill Wendling authored Feb 24, 2009

them are generic changes.

- Use the "fast" flag that's already being passed into the asm printers instead
  of shoving it into the DwarfWriter.

- Instead of calling "MI->getParent()->getParent()" for every MI, set the
  machine function when calling "runOnMachineFunction" in the asm printers.

llvm-svn: 65379

c5437ea4

Add a debugging option for SSC DCE. · 0fb12efb
Owen Anderson authored Feb 24, 2009
```
llvm-svn: 65375
```
0fb12efb

- Use the "Fast" flag instead of "OptimizeForSize" to determine whether to emit · 786c5973

Bill Wendling authored Feb 24, 2009

  a DBG_LABEL or not. We want to fall back to the original way of emitting debug
  info when we're in -O0/-fast mode.
- Add plumbing in to pass the "Fast" flag to places that need it.
- XFAIL DebugInfo/deaddebuglabel.ll. This is finding 11 labels instead of 8. I
  need to investigate still.

llvm-svn: 65367

786c5973

Fix a ValueTracking rule: RHS means operand 1, not 0. Add a simple · 4f356bb9

Dan Gohman authored Feb 24, 2009

ashr instcombine to help expose this code. And apply the fix to
SelectionDAG's copy of this code too.

llvm-svn: 65364

4f356bb9

Generalize the ChangeCompareStride code, in preparation for · 5d1f458f
Dan Gohman authored Feb 24, 2009
```
handling non-constant strides. No functionality change.

llvm-svn: 65363
```
5d1f458f
Preserve the DominanceFrontier analysis in the LoopDeletion pass. · e6698847
Dan Gohman authored Feb 24, 2009
```
llvm-svn: 65359
```
e6698847
gdb uses DW_AT_prototyped to identify K&R style in C based languages. · 50c63241
Devang Patel authored Feb 24, 2009
```
This fixes objc.dg/dwarf-prototypes.m scan-assembler DW_AT_prototyped from llvmgcc42 test suite.

llvm-svn: 65357
```
50c63241

While folding unconditional return move DbgRegionEndInst into the predecessor,... · e2880826

Devang Patel authored Feb 24, 2009

While folding unconditional return move DbgRegionEndInst into the predecessor, instead of removing it. This fixes following tests from llvmgcc42 testsuite.

gcc.c-torture/execute/20000605-3.c
gcc.c-torture/execute/20020619-1.c
gcc.c-torture/execute/20030920-1.c
gcc.c-torture/execute/loop-ivopts-1.c

llvm-svn: 65353

e2880826

If there is not any debug info available for any global variables and any... · 6ced076e

Devang Patel authored Feb 24, 2009

If there is not any debug info available for any global variables and any subprograms then there is not any debug info to emit.

llvm-svn: 65352

6ced076e

Back out the change in 64918 that used sign-extensions when promoting · f6e8c77e

Dan Gohman authored Feb 23, 2009

trip counts that use signed comparisons. It's not obviously the best
approach for preserving trip count information, and at any rate there
isn't anything in the tree right now that makes use of that, so for
now always using zero-extensions is preferable.

llvm-svn: 65347

f6e8c77e

Feb 23, 2009

Fast-isel can't do TLS yet, so it should fall back to SDISel · 318d7376
Dan Gohman authored Feb 23, 2009
```
if it sees TLS addresses.

llvm-svn: 65341
```
318d7376

LoopDeletion needs to inform ScalarEvolution when a loop is deleted, · e591411f

Dan Gohman authored Feb 23, 2009

so that ScalarEvolution doesn't hang onto a dangling Loop*, which
could be a problem if another Loop happens to get allocated at the
same address.

llvm-svn: 65323

e591411f

IndVarSimplify preserves ScalarEvolution. In the · 42987f52

Dan Gohman authored Feb 23, 2009

-std-compile-opts sequence, this avoids the need for ScalarEvolution to
be rerun before LoopDeletion.

llvm-svn: 65318

42987f52

Should reset DBI_Prev if DBI_Next == 0. · 3a86bcf1
Zhou Sheng authored Feb 23, 2009
```
llvm-svn: 65314
```
3a86bcf1
Only v1i16 (i.e. _m64) is returned via RAX / RDX. · 9f8fddee
Evan Cheng authored Feb 23, 2009
```
llvm-svn: 65313
```
9f8fddee

Generate better code for v8i16 shuffles on SSE2 · e684da3e

Nate Begeman authored Feb 23, 2009

Generate better code for v16i8 shuffles on SSE2 (avoids stack)
Generate pshufb for v8i16 and v16i8 shuffles on SSSE3 where it is fewer uops.
Document the shuffle matching logic and add some FIXMEs for later further
  cleanups.
New tests that test the above.

Examples:

New:
_shuf2:
	pextrw	$7, %xmm0, %eax
	punpcklqdq	%xmm1, %xmm0
	pshuflw	$128, %xmm0, %xmm0
	pinsrw	$2, %eax, %xmm0

Old:
_shuf2:
	pextrw	$2, %xmm0, %eax
	pextrw	$7, %xmm0, %ecx
	pinsrw	$2, %ecx, %xmm0
	pinsrw	$3, %eax, %xmm0
	movd	%xmm1, %eax
	pinsrw	$4, %eax, %xmm0
	ret

=========

New:
_shuf4:
	punpcklqdq	%xmm1, %xmm0
	pshufb	LCPI1_0, %xmm0

Old:
_shuf4:
	pextrw	$3, %xmm0, %eax
	movsd	%xmm1, %xmm0
	pextrw	$3, %xmm1, %ecx
	pinsrw	$4, %ecx, %xmm0
	pinsrw	$5, %eax, %xmm0

========

New:
_shuf1:
	pushl	%ebx
	pushl	%edi
	pushl	%esi
	pextrw	$1, %xmm0, %eax
	rolw	$8, %ax
	movd	%xmm0, %ecx
	rolw	$8, %cx
	pextrw	$5, %xmm0, %edx
	pextrw	$4, %xmm0, %esi
	pextrw	$3, %xmm0, %edi
	pextrw	$2, %xmm0, %ebx
	movaps	%xmm0, %xmm1
	pinsrw	$0, %ecx, %xmm1
	pinsrw	$1, %eax, %xmm1
	rolw	$8, %bx
	pinsrw	$2, %ebx, %xmm1
	rolw	$8, %di
	pinsrw	$3, %edi, %xmm1
	rolw	$8, %si
	pinsrw	$4, %esi, %xmm1
	rolw	$8, %dx
	pinsrw	$5, %edx, %xmm1
	pextrw	$7, %xmm0, %eax
	rolw	$8, %ax
	movaps	%xmm1, %xmm0
	pinsrw	$7, %eax, %xmm0
	popl	%esi
	popl	%edi
	popl	%ebx
	ret

Old:
_shuf1:
	subl	$252, %esp
	movaps	%xmm0, (%esp)
	movaps	%xmm0, 16(%esp)
	movaps	%xmm0, 32(%esp)
	movaps	%xmm0, 48(%esp)
	movaps	%xmm0, 64(%esp)
	movaps	%xmm0, 80(%esp)
	movaps	%xmm0, 96(%esp)
	movaps	%xmm0, 224(%esp)
	movaps	%xmm0, 208(%esp)
	movaps	%xmm0, 192(%esp)
	movaps	%xmm0, 176(%esp)
	movaps	%xmm0, 160(%esp)
	movaps	%xmm0, 144(%esp)
	movaps	%xmm0, 128(%esp)
	movaps	%xmm0, 112(%esp)
	movzbl	14(%esp), %eax
	movd	%eax, %xmm1
	movzbl	22(%esp), %eax
	movd	%eax, %xmm2
	punpcklbw	%xmm1, %xmm2
	movzbl	42(%esp), %eax
	movd	%eax, %xmm1
	movzbl	50(%esp), %eax
	movd	%eax, %xmm3
	punpcklbw	%xmm1, %xmm3
	punpcklbw	%xmm2, %xmm3
	movzbl	77(%esp), %eax
	movd	%eax, %xmm1
	movzbl	84(%esp), %eax
	movd	%eax, %xmm2
	punpcklbw	%xmm1, %xmm2
	movzbl	104(%esp), %eax
	movd	%eax, %xmm1
	punpcklbw	%xmm1, %xmm0
	punpcklbw	%xmm2, %xmm0
	movaps	%xmm0, %xmm1
	punpcklbw	%xmm3, %xmm1
	movzbl	127(%esp), %eax
	movd	%eax, %xmm0
	movzbl	135(%esp), %eax
	movd	%eax, %xmm2
	punpcklbw	%xmm0, %xmm2
	movzbl	155(%esp), %eax
	movd	%eax, %xmm0
	movzbl	163(%esp), %eax
	movd	%eax, %xmm3
	punpcklbw	%xmm0, %xmm3
	punpcklbw	%xmm2, %xmm3
	movzbl	188(%esp), %eax
	movd	%eax, %xmm0
	movzbl	197(%esp), %eax
	movd	%eax, %xmm2
	punpcklbw	%xmm0, %xmm2
	movzbl	217(%esp), %eax
	movd	%eax, %xmm4
	movzbl	225(%esp), %eax
	movd	%eax, %xmm0
	punpcklbw	%xmm4, %xmm0
	punpcklbw	%xmm2, %xmm0
	punpcklbw	%xmm3, %xmm0
	punpcklbw	%xmm1, %xmm0
	addl	$252, %esp
	ret

llvm-svn: 65311

e684da3e

Changed option name from inline-threshold to basic-inline-threshold because · dccfa0b2
Mon P Wang authored Feb 23, 2009
```
inline-threshold option is used by the inliner.

llvm-svn: 65309
```
dccfa0b2
fix some typos that Duncan noticed · d5420f09
Chris Lattner authored Feb 23, 2009
```
llvm-svn: 65306
```
d5420f09
Propagate debug loc info through prologue/epilogue. · 9ee052bc
Bill Wendling authored Feb 23, 2009
```
llvm-svn: 65298
```
9ee052bc

Introduce the BuildVectorSDNode class that encapsulates the ISD::BUILD_VECTOR · 9d31aca6

Scott Michel authored Feb 22, 2009

instruction. The class also consolidates the code for detecting constant
splats that's shared across PowerPC and the CellSPU backends (and might be
useful for other backends.) Also introduces SelectionDAG::getBUID_VECTOR() for
generating new BUILD_VECTOR nodes.

llvm-svn: 65296

9d31aca6

Feb 22, 2009

Revert the part of 64623 that attempted to align the source in a · 648c5e9c

Dan Gohman authored Feb 22, 2009

memcpy to match the alignment of the destination. It isn't necessary
for making loads and stores handled like the SSE loadu/storeu
intrinsics, and it was causing a performance regression in
MultiSource/Applications/JM/lencod.

The problem appears to have been a memcpy that copies from some
highly aligned array into an alloca; the alloca was then being
assigned a large alignment, which required codegen to perform
dynamic stack-pointer re-alignment, which forced the enclosing
function to have a frame pointer, which led to increased spilling.

llvm-svn: 65289

648c5e9c

Properly parenthesize this expression, fixing a real bug in the new · f394e58a
Dan Gohman authored Feb 22, 2009
```
-full-lsr code, as well as a GCC warning.

llvm-svn: 65288
```
f394e58a