Commits · d8fb032cff5db393952efcc3451ab90c0973092b · Roger Ferrer / llvm-epi-0.8

Apr 22, 2013

80 columns. · d8fb032c
Akira Hatanaka authored Apr 22, 2013
```
llvm-svn: 180040
```
d8fb032c
[mips] In performDSPShiftCombine, check that all elements in the vector are · 0d6964cf
Akira Hatanaka authored Apr 22, 2013
```
shifted by the same amount and the shift amount is smaller than the element
size.

llvm-svn: 180039
```
0d6964cf

[ms-inline asm] Remove the identifier parsing logic from the AsmParser. This is · cb78f0d0

Chad Rosier authored Apr 22, 2013

now taken care of by the frontend, which allows us to parse arbitrary C/C++
variables.
Part of rdar://13663589

llvm-svn: 180037

cb78f0d0

[Support] Fix argv string escape bug on Windows · 74679a93

Reid Kleckner authored Apr 22, 2013

Summary:
This is http://llvm.org/PR15802.  Backslashes preceding double quotes in
arguments must be escaped.  The interesting bit is that all other
backslashes should *not* be escaped, because the un-escaping logic is
only triggered by the presence of a double quote character.

Reviewers: Bigcheese

CC: llvm-commits

Differential Revision: http://llvm-reviews.chandlerc.com/D705

llvm-svn: 180035

74679a93

COFF: Fix weak external aliases. · 8988687d
Peter Collingbourne authored Apr 22, 2013
```
Differential Revision: http://llvm-reviews.chandlerc.com/D700

llvm-svn: 180034
```
8988687d

Extra paranoid test for r179925 (verify that tail calls are not generated to... · 2ec1b100

Stephen Lin authored Apr 22, 2013

Extra paranoid test for r179925 (verify that tail calls are not generated to 'this'-returning constructors of objects with different 'this' pointers than the caller)

llvm-svn: 180032

2ec1b100

Document the -filetype option of llc (PR #12902) · 0577df4a
Eli Bendersky authored Apr 22, 2013
```
llvm-svn: 180031
```
0577df4a
Fix for PR 14965: Better error message for GEP with partially defined contents · d9806687
Eli Bendersky authored Apr 22, 2013
```
llvm-svn: 180030
```
d9806687
[ms-inline asm] Refactor/clean up the SemaLookup interface. No functional · f6675c3d
Chad Rosier authored Apr 22, 2013
```
change indended.
Part of rdar://13663589

llvm-svn: 180028
```
f6675c3d
Add AArch64 into $llvm_cv_target_arch in configure, reviewed by Tim Northover & Eric Christopher · 43eb48eb
Jia Liu authored Apr 22, 2013
```
llvm-svn: 180025
```
43eb48eb
typo · 8932a882
Jia Liu authored Apr 22, 2013
```
llvm-svn: 180023
```
8932a882
Make doxygen comment match declaration. · 7dcc5583
Benjamin Kramer authored Apr 22, 2013
```
Found by -Wdocumentation.

llvm-svn: 180021
```
7dcc5583
Also verify llvm.compiler_used. · 8bd2c228
Rafael Espindola authored Apr 22, 2013
```
llvm-svn: 180020
```
8bd2c228

Clarify that llvm.used can contain aliases. · 74f2e46e

Rafael Espindola authored Apr 22, 2013

Also add a check for llvm.used in the verifier and simplify clients now that
they can assume they have a ConstantArray.

llvm-svn: 180019

74f2e46e

No really, don't store anything to this since it's unconditionally · cc2cfe42
Eric Christopher authored Apr 22, 2013
```
set below.

llvm-svn: 180015
```
cc2cfe42
Remove variable store that is never read. · 6647fb2c
Eric Christopher authored Apr 22, 2013
```
llvm-svn: 180014
```
6647fb2c
Remove variable store that is never read. · 845c2ca7
Eric Christopher authored Apr 22, 2013
```
llvm-svn: 180013
```
845c2ca7

Fix for 5.5 Parameter Passing --> Stage C: · f80f9513

Stepan Dyatkovskiy authored Apr 22, 2013

 -- C.4 and C.5 statements, when NSAA is not equal to SP.
 -- C.1.cp statement for VA functions. Note: There are no VFP CPRCs in a
    variadic procedure.

Before this patch "NSAA != 0" means "don't use GPRs anymore ". But there are
some exceptions in AAPCS.
1. For non VA function: allocate all VFP regs for CPRC. When all VFPs are allocated
   CPRCs would be sent to stack, while non CPRCs may be still allocated in GRPs.
2. Check that for VA functions all params uses GPRs and then stack.
   No exceptions, no CPRCs here.

llvm-svn: 180011

f80f9513

Add .ll as a valid test suffix for Object, this allows .ll -> object · f5654986
Eric Christopher authored Apr 22, 2013
```
and then dumping as tests.

llvm-svn: 180010
```
f5654986
Add the same todo about a command iterator interface into the · ef3cd7a6
Eric Christopher authored Apr 22, 2013
```
other mach-o object file as well.

TODO: One interface to rule them all.
llvm-svn: 180009
```
ef3cd7a6
Add a TODO about wanting an iterator interface. · dcc22036
Eric Christopher authored Apr 22, 2013
```
llvm-svn: 180008
```
dcc22036
llvm-readobj: Dump more COFF auxiliary records · 0ab8e602
Nico Rieck authored Apr 22, 2013
```
llvm-svn: 180007
```
0ab8e602
llvm-readobj: Check for null section pointer · a711deef
Nico Rieck authored Apr 22, 2013
```
llvm-svn: 180006
```
a711deef
llvm-readobj: Do not print NULL StringRefs · a8de6537
Nico Rieck authored Apr 22, 2013
```
llvm-svn: 180005
```
a8de6537
Cleanup: test source files do not need to be executable · e206e6e8
Arnaud A. de Grandmaison authored Apr 22, 2013
```
llvm-svn: 180003
```
e206e6e8
Tidy. · 44c6aa67
Eric Christopher authored Apr 22, 2013
```
llvm-svn: 180000
```
44c6aa67
Update comment. Whitespace. · 25e3509c
Eric Christopher authored Apr 22, 2013
```
llvm-svn: 179999
```
25e3509c

Revert "Revert "PR14606: debug info imported_module support"" · f55abeaf

David Blaikie authored Apr 22, 2013

This reverts commit r179840 with a fix to test/DebugInfo/two-cus-from-same-file.ll

I'm not sure why that test only failed on ARM & MIPS and not X86 Linux, even
though the debug info was clearly invalid on all of them, but this ought to fix
it.

llvm-svn: 179996

f55abeaf

Convert windows line endings to linux/unix line endings. · 7af39d7d
Craig Topper authored Apr 22, 2013
```
llvm-svn: 179995
```
7af39d7d
Fix indentation. No functional change. · 2172ad64
Craig Topper authored Apr 22, 2013
```
llvm-svn: 179994
```
2172ad64
Put 'else' on same line as preceding curly brace per coding standards. No functional change. · f15655b2
Craig Topper authored Apr 22, 2013
```
llvm-svn: 179993
```
f15655b2
Remove an unreachable 'break' following a 'return'. · b5ba3d3b
Craig Topper authored Apr 22, 2013
```
llvm-svn: 179991
```
b5ba3d3b

Legalize vector truncates by parts rather than just splitting. · 563983c8

Jim Grosbach authored Apr 21, 2013

Rather than just splitting the input type and hoping for the best, apply
a bit more cleverness. Just splitting the types until the source is
legal often leads to an illegal result time, which is then widened and a
scalarization step is introduced which leads to truly horrible code
generation. With the loop vectorizer, these sorts of operations are much
more common, and so it's worth extra effort to do them well.

Add a legalization hook for the operands of a TRUNCATE node, which will
be encountered after the result type has been legalized, but if the
operand type is still illegal. If simple splitting of both types
ends up with the result type of each half still being legal, just
do that (v16i16 -> v16i8 on ARM, for example). If, however, that would
result in an illegal result type (v8i32 -> v8i8 on ARM, for example),
we can get more clever with power-two vectors. Specifically,
split the input type, but also widen the result element size, then
concatenate the halves and truncate again.  For example on ARM,
To perform a "%res = v8i8 trunc v8i32 %in" we transform to:
  %inlo = v4i32 extract_subvector %in, 0
  %inhi = v4i32 extract_subvector %in, 4
  %lo16 = v4i16 trunc v4i32 %inlo
  %hi16 = v4i16 trunc v4i32 %inhi
  %in16 = v8i16 concat_vectors v4i16 %lo16, v4i16 %hi16
  %res = v8i8 trunc v8i16 %in16

This allows instruction selection to generate three VMOVN instructions
instead of a sequences of moves, stores and loads.

Update the ARMTargetTransformInfo to take this improved legalization
into account.

Consider the simplified IR:

define <16 x i8> @test1(<16 x i32>* %ap) {
  %a = load <16 x i32>* %ap
  %tmp = trunc <16 x i32> %a to <16 x i8>
  ret <16 x i8> %tmp
}

define <8 x i8> @test2(<8 x i32>* %ap) {
  %a = load <8 x i32>* %ap
  %tmp = trunc <8 x i32> %a to <8 x i8>
  ret <8 x i8> %tmp
}

Previously, we would generate the truly hideous:
	.syntax unified
	.section	__TEXT,__text,regular,pure_instructions
	.globl	_test1
	.align	2
_test1:                                 @ @test1
@ BB#0:
	push	{r7}
	mov	r7, sp
	sub	sp, sp, #20
	bic	sp, sp, #7
	add	r1, r0, #48
	add	r2, r0, #32
	vld1.64	{d24, d25}, [r0:128]
	vld1.64	{d16, d17}, [r1:128]
	vld1.64	{d18, d19}, [r2:128]
	add	r1, r0, #16
	vmovn.i32	d22, q8
	vld1.64	{d16, d17}, [r1:128]
	vmovn.i32	d20, q9
	vmovn.i32	d18, q12
	vmov.u16	r0, d22[3]
	strb	r0, [sp, #15]
	vmov.u16	r0, d22[2]
	strb	r0, [sp, #14]
	vmov.u16	r0, d22[1]
	strb	r0, [sp, #13]
	vmov.u16	r0, d22[0]
	vmovn.i32	d16, q8
	strb	r0, [sp, #12]
	vmov.u16	r0, d20[3]
	strb	r0, [sp, #11]
	vmov.u16	r0, d20[2]
	strb	r0, [sp, #10]
	vmov.u16	r0, d20[1]
	strb	r0, [sp, #9]
	vmov.u16	r0, d20[0]
	strb	r0, [sp, #8]
	vmov.u16	r0, d18[3]
	strb	r0, [sp, #3]
	vmov.u16	r0, d18[2]
	strb	r0, [sp, #2]
	vmov.u16	r0, d18[1]
	strb	r0, [sp, #1]
	vmov.u16	r0, d18[0]
	strb	r0, [sp]
	vmov.u16	r0, d16[3]
	strb	r0, [sp, #7]
	vmov.u16	r0, d16[2]
	strb	r0, [sp, #6]
	vmov.u16	r0, d16[1]
	strb	r0, [sp, #5]
	vmov.u16	r0, d16[0]
	strb	r0, [sp, #4]
	vldmia	sp, {d16, d17}
	vmov	r0, r1, d16
	vmov	r2, r3, d17
	mov	sp, r7
	pop	{r7}
	bx	lr

	.globl	_test2
	.align	2
_test2:                                 @ @test2
@ BB#0:
	push	{r7}
	mov	r7, sp
	sub	sp, sp, #12
	bic	sp, sp, #7
	vld1.64	{d16, d17}, [r0:128]
	add	r0, r0, #16
	vld1.64	{d20, d21}, [r0:128]
	vmovn.i32	d18, q8
	vmov.u16	r0, d18[3]
	vmovn.i32	d16, q10
	strb	r0, [sp, #3]
	vmov.u16	r0, d18[2]
	strb	r0, [sp, #2]
	vmov.u16	r0, d18[1]
	strb	r0, [sp, #1]
	vmov.u16	r0, d18[0]
	strb	r0, [sp]
	vmov.u16	r0, d16[3]
	strb	r0, [sp, #7]
	vmov.u16	r0, d16[2]
	strb	r0, [sp, #6]
	vmov.u16	r0, d16[1]
	strb	r0, [sp, #5]
	vmov.u16	r0, d16[0]
	strb	r0, [sp, #4]
	ldm	sp, {r0, r1}
	mov	sp, r7
	pop	{r7}
	bx	lr

Now, however, we generate the much more straightforward:
	.syntax unified
	.section	__TEXT,__text,regular,pure_instructions
	.globl	_test1
	.align	2
_test1:                                 @ @test1
@ BB#0:
	add	r1, r0, #48
	add	r2, r0, #32
	vld1.64	{d20, d21}, [r0:128]
	vld1.64	{d16, d17}, [r1:128]
	add	r1, r0, #16
	vld1.64	{d18, d19}, [r2:128]
	vld1.64	{d22, d23}, [r1:128]
	vmovn.i32	d17, q8
	vmovn.i32	d16, q9
	vmovn.i32	d18, q10
	vmovn.i32	d19, q11
	vmovn.i16	d17, q8
	vmovn.i16	d16, q9
	vmov	r0, r1, d16
	vmov	r2, r3, d17
	bx	lr

	.globl	_test2
	.align	2
_test2:                                 @ @test2
@ BB#0:
	vld1.64	{d16, d17}, [r0:128]
	add	r0, r0, #16
	vld1.64	{d18, d19}, [r0:128]
	vmovn.i32	d16, q8
	vmovn.i32	d17, q9
	vmovn.i16	d16, q8
	vmov	r0, r1, d16
	bx	lr

llvm-svn: 179989

563983c8

ARM: Split out cost model vcvt testcases. · fb08e55c

Jim Grosbach authored Apr 21, 2013

They had a separate RUN line already, so may as well be in a separate file.

llvm-svn: 179988

fb08e55c

Apr 21, 2013

Passing arguments to varags functions under the SPARC v9 ABI. · 84ebe25d
Jakob Stoklund Olesen authored Apr 21, 2013
```
Arguments after the fixed arguments never use the floating point
registers.

llvm-svn: 179987
```
84ebe25d
Tidy up comment grammar. · d4db72db
Jim Grosbach authored Apr 21, 2013
```
llvm-svn: 179986
```
d4db72db
Fix the SETHIimm pattern for 64-bit code. · 65d32872
Jakob Stoklund Olesen authored Apr 21, 2013
```
Don't ignore the high 32 bits of the immediate.

llvm-svn: 179985
```
65d32872
Remove unused, undefined ArgFlagsTy::getArgFlagsString; add a comment about 'returned' · cda1028a
Stephen Lin authored Apr 21, 2013
```
llvm-svn: 179983
```
cda1028a

SROA: Don't crash on a select with two identical operands. · 0212dc27

Benjamin Kramer authored Apr 21, 2013

This is an edge case that can happen if we modify a chain of multiple selects.
Update all operands in that case and remove the assert. PR15805.

llvm-svn: 179982

0212dc27

Revert "SimplifyCFG: If convert single conditional stores" · 6eb32b31

Arnold Schwaighofer authored Apr 21, 2013

There is the temptation to make this tranform dependent on target information as
it is not going to be beneficial on all (sub)targets. Therefore, we should
probably do this in MI Early-Ifconversion.

This reverts commit r179957. Original commit message:

"SimplifyCFG: If convert single conditional stores

This transformation will transform a conditional store with a preceeding
uncondtional store to the same location:

a[i] =
may-alias with a[i] load
if (cond)
    a[i] = Y
into an unconditional store.

a[i] = X
may-alias with a[i] load
tmp = cond ? Y : X;
a[i] = tmp

We assume that on average the cost of a mispredicted branch is going to be
higher than the cost of a second store to the same location, and that the
secondary benefits of creating a bigger basic block for other optimizations to
work on outway the potential case were the branch would be correctly predicted
and the cost of the executing the second store would be noticably reflected in
performance.

hmmer's execution time improves by 30% on an imac12,2 on ref data sets. With
this change we are on par with gcc's performance (gcc also performs this
transformation). There was a 1.2 % performance improvement on a ARM swift chip.
Other tests in the test-suite+external seem to be mostly uninfluenced in my
experiments:
This optimization was triggered on 41 tests such that the executable was
different before/after the patch. Only 1 out of the 40 tests (dealII) was
reproducable below 100% (by about .4%). Given that hmmer benefits so much I
believe this to be a fair trade off.

I am going to watch performance numbers across the builtbots and will revert
this if anything unexpected comes up."

llvm-svn: 179980

6eb32b31