Commits · 03e43f8e686fc3140d912940d426409463160a07 · Roger Ferrer / llvm-epi

Aug 20, 2014

[PeepholeOptimizer] Refactor the advanced copy optimization to take advantage of · 03e43f8e

Quentin Colombet authored Aug 20, 2014

the isRegSequence property.

This is a follow-up of r215394 and r215404, which respectively introduces the
isRegSequence property and uses it for ARM.

Thanks to the property introduced by the previous commits, this patch is able
to optimize the following sequence:
vmov	d0, r2, r3
vmov	d1, r0, r1
vmov	r0, s0
vmov	r1, s2
udiv	r0, r1, r0
vmov	r1, s1
vmov	r2, s3
udiv	r1, r2, r1
vmov.32	d16[0], r0
vmov.32	d16[1], r1
vmov	r0, r1, d16
bx	lr

into:
udiv	r0, r0, r2
udiv	r1, r1, r3
vmov.32	d16[0], r0
vmov.32	d16[1], r1
vmov	r0, r1, d16
bx	lr

This patch refactors how the copy optimizations are done in the peephole
optimizer. Prior to this patch, we had one copy-related optimization that
replaced a copy or bitcast by a generic, more suitable (in terms of register
file), copy.

With this patch, the peephole optimizer features two copy-related optimizations:
1. One for rewriting generic copies to generic copies:
PeepholeOptimizer::optimizeCoalescableCopy.
2. One for replacing non-generic copies with generic copies:
PeepholeOptimizer::optimizeUncoalescableCopy.

The goals of these two optimizations are slightly different: one rewrite the
operand of the instruction (#1), the other kills off the non-generic instruction
and replace it by a (sequence of) generic instruction(s).

Both optimizations rely on the ValueTracker introduced in r212100.

The ValueTracker has been refactored to use the information from the
TargetInstrInfo for non-generic instruction. As part of the refactoring, we
switched the tracking from the index of the definition to the actual register
(virtual or physical). This one change is to provide better consistency with
register related APIs and to ease the use of the TargetInstrInfo.

Moreover, this patch introduces a new helper class CopyRewriter used to ease the
rewriting of generic copies (i.e., #1).

Finally, this patch adds a dead code elimination pass right after the peephole
optimizer to get rid of dead code that may appear after rewriting.

This is related to <rdar://problem/12702965>.

Review: http://reviews.llvm.org/D4874
llvm-svn: 216088

03e43f8e

Tweak CFGPrinter to wrap very long names. · 2223f8ed

Andrew Trick authored Aug 20, 2014

I added wrapping to the CFGPrinter a while back so the -view-cfg
output is actually viewable. I've since enountered very long mangled
names with the same problem, so I'm slightly tweaking this code to
work in that case.

llvm-svn: 216087

2223f8ed

Remove unused field. · 1c509715
Rafael Espindola authored Aug 20, 2014
```
llvm-svn: 216086
```
1c509715

Test: CoverageMapping: use "RUN: FileCheck" command instead of "RUN: cat | Filecheck". · e3b04a9f

Alex Lorenz authored Aug 20, 2014

Change the lit RUN commands for 3 tests to use the following pattern
"FileCheck -input-file ..." instead of "cat ... | FileCheck ..." as
suggested by Justin Bogner.

llvm-svn: 216085

e3b04a9f

[Fix] isl usage errors in ScheduleOptimizer · 495dd053
Johannes Doerfert authored Aug 20, 2014
```
llvm-svn: 216084
```
495dd053
Fix latent bug in try_compile macro and use CMAKE_EXE_LINKER_FLAGS · 6d2022b2
Alexey Samsonov authored Aug 20, 2014
```
when testing for supported architectures, as suggested by Andy Gibbs.

llvm-svn: 216083
```
6d2022b2
Coverage mapping: fix mapping for objective-c for statement · fdd769e8
Alex Lorenz authored Aug 20, 2014
```
llvm-svn: 216082
```
fdd769e8
Coverage mapping: fix mapping for objective-c message expression · 01a0d062
Alex Lorenz authored Aug 20, 2014
```
llvm-svn: 216081
```
01a0d062

Avoid global contstructors and place static variables inside classes as static... · 051fd7cc

Greg Clayton authored Aug 20, 2014

Avoid global contstructors and place static variables inside classes as static local variables and remove the static ivars. Subclasses should use the accessor functions.

llvm-svn: 216080

051fd7cc

[analyzer] UnixAPI: Check that the third argument to open(2) (if present) is an integer. · ba129af6
Jordan Rose authored Aug 20, 2014
```
Patch by Daniel Fahlgren.

llvm-svn: 216079
```
ba129af6
[analyzer] UnixAPI: Check when open(2) is called with more than three arguments. · cd4db5c6
Jordan Rose authored Aug 20, 2014
```
Patch by Daniel Fahlgren.

llvm-svn: 216078
```
cd4db5c6
Fix warnings about overloaded virtual functions. · b6fd112b
Greg Clayton authored Aug 20, 2014
```
llvm-svn: 216077
```
b6fd112b
[analyzer] IdenticalExpr: don't try to compare integer literals with different widths. · f3544e91
Jordan Rose authored Aug 20, 2014
```
PR20659. Patch by Anders Rönnholm.

llvm-svn: 216076
```
f3544e91
[analyzer] IdenticalExpr: use getBytes rather than getString to compare string literals. · b6100301
Jordan Rose authored Aug 20, 2014
```
PR20693. Patch by Anders Rönnholm.

llvm-svn: 216075
```
b6100301

Move Host::GetArchitecture to HostInfo::GetArchitecture. · 13b18261

Zachary Turner authored Aug 20, 2014

As a side effect, this patch also eliminates all of the
preprocessor conditionals previously used to implement
GetArchitecture().

llvm-svn: 216074

13b18261

[FastISel][AArch64] Don't fold the sign-/zero-extend from i1 into the compare. · e1bb055e

Juergen Ributzka authored Aug 20, 2014

This fixes a bug I introduced in a previous commit (r216033). Sign-/Zero-
extension from i1 cannot be folded into the ADDS/SUBS instructions. Instead both
operands have to be sign-/zero-extended with separate instructions.

Related to <rdar://problem/17913111>.

llvm-svn: 216073

e1bb055e

[clang-tidy] Allow /**/ comments on #endifs when checking header guards. · 208faaaa
Benjamin Kramer authored Aug 20, 2014
```
Turning block comments into line comments just creates unecessary churn.

llvm-svn: 216072
```
208faaaa
Quick fix for an use after free. · 061beab6
Rafael Espindola authored Aug 20, 2014
```
llvm-svn: 216071
```
061beab6
Add note to LangRef about how function arguments can be unnamed and · 2661dfc7
Dan Liew authored Aug 20, 2014
```
how this affects the numbering of unnamed temporaries.

llvm-svn: 216070
```
2661dfc7

vload/vstore: Use casts instead of scalarizing everything in CLC version · f991505d

Aaron Watry authored Aug 20, 2014



This generates bitcode which is indistinguishable from what was
hand-written for int32 types in v[load|store]_impl.ll.

v4: Use vec2+scalar for vec3 load/stores to prevent corruption (per Tom)
v3: Also remove unused generic/lib/shared/v[load|store]_impl.ll
v2: (Per Matt Arsenault) Fix alignment issues with vector load stores

Signed-off-by: Aaron Watry <awatry@gmail.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
CC: Matt Arsenault <Matthew.Arsenault@amd.com>
CC: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 216069

f991505d

Silencing a -Wcast-qual warning. NFC. · 47497258
Aaron Ballman authored Aug 20, 2014
```
llvm-svn: 216068
```
47497258

Silencing an MSVC C4334 warning ('<<' : result of 32-bit shift implicitly... · bf6ee221

Aaron Ballman authored Aug 20, 2014

Silencing an MSVC C4334 warning ('<<' : result of 32-bit shift implicitly converted to 64 bits (was 64-bit shift intended?)). NFC.

llvm-svn: 216067

bf6ee221

Optimize ZERO_EXTEND and SIGN_EXTEND in both SelectionDAG Builder and type · f841b3b7

Jiangning Liu authored Aug 20, 2014

legalization stage. With those two optimizations, fewer signed/zero extension
instructions can be inserted, and then we can expose more opportunities to
Machine CSE pass in back-end.

llvm-svn: 216066

f841b3b7

[x32] Fix FrameIndex check in SelectLEA64_32Addr · 01a4e0a1

Pavel Chupin authored Aug 20, 2014

Summary:
Fixes http://llvm.org/bugs/show_bug.cgi?id=20016 reproducible on new
lea-5.ll case.
Also use RSP/RBP for x32 lea to save 1 byte used for 0x67 prefix in
ESP/EBP case.

Test Plan: lea tests modified to include x32/nacl and new test added

Reviewers: nadav, dschuff, t.p.northover

Subscribers: llvm-commits, zinovy.nis

Differential Revision: http://reviews.llvm.org/D4929

llvm-svn: 216065

01a4e0a1

ARM: Fix codegen for rbit intrinsic · c655f0c8

Yi Kong authored Aug 20, 2014

LLVM generates illegal `rbit r0, #352` instruction for rbit intrinsic.
According to ARM ARM, rbit only takes register as argument, not immediate.
The correct instruction should be rbit <Rd>, <Rm>.

The bug was originally introduced in r211057.

Differential Revision: http://reviews.llvm.org/D4980

llvm-svn: 216064

c655f0c8

Update projects lists. · 5d536073
Bill Wendling authored Aug 20, 2014
```
llvm-svn: 216048
```
5d536073
Add libcxxabi to the projects. · e971d6fd
Bill Wendling authored Aug 20, 2014
```
llvm-svn: 216047
```
e971d6fd

InstCombine: Annotate sub with nuw when we prove it's safe · 42158f3e

David Majnemer authored Aug 20, 2014

We can prove that a 'sub' can be a 'sub nuw' if the left-hand side is
negative and the right-hand side is non-negative.

llvm-svn: 216045

42158f3e

Fix an off by 1 bug that prevented SmallPtrSet from using all of its 'small'... · 298f6380

Craig Topper authored Aug 20, 2014

Fix an off by 1 bug that prevented SmallPtrSet from using all of its 'small' capacity. Then fix the early return in the move constructor that prevented 'small' moves from clearing the NumElements in the moved from object. The directed test missed this because it was always testing large moves due to the off by 1 bug.

llvm-svn: 216044

298f6380

Constants.h: Fix possible typo in r216015. [-Wdocumentation] · b8083476
NAKAMURA Takumi authored Aug 20, 2014
```
llvm-svn: 216043
```
b8083476

[dfsan] Treat vararg custom functions like unimplemented functions. · f39430bd

Peter Collingbourne authored Aug 20, 2014

Because declarations of these functions can appear in places like autoconf
checks, they have to be handled somehow, even though we do not support
vararg custom functions. We do so by printing a warning and calling the
uninstrumented function, as we do for unimplemented functions.

llvm-svn: 216042

f39430bd

Revert rL215947: "[clang-rename] revert r215839" · de23726d
Manuel Klimek authored Aug 20, 2014
```
Make tests not depend on grep supporting -bo.

llvm-svn: 216041
```
de23726d

[FastISel][AArch64] Use the proper FMOV instruction to materialize a +0.0. · 0781b860

Juergen Ributzka authored Aug 20, 2014

Use FMOVWSr/FMOVXDr instead of FMOVSr/FMOVDr, which have the proper register
class to be used with the zero register. This makes the MachineInstruction
verifier happy again.

This is related to <rdar://problem/18027157>.

llvm-svn: 216040

0781b860

[PECOFF] Emit PE+ idata tables. · a0b988cb

Rui Ueyama authored Aug 20, 2014

Import tables in the PE+ format is an array of 64 bit numbers,
although the executable size is still limited to 4GB in PE+.

llvm-svn: 216039

a0b988cb

Objective-C [qoi]. Provide fix-it hint when sending · 19c2e2fa
Fariborz Jahanian authored Aug 19, 2014
```
class method to an object receiver. rdar://16263395

llvm-svn: 216038
```
19c2e2fa

InstCombine: Annotate sub with nsw when we prove it's safe · 57d5bc88

David Majnemer authored Aug 19, 2014

We can prove that a 'sub' can be a 'sub nsw' under certain conditions:
- The sign bits of the operands is the same.
- Both operands have more than 1 sign bit.

The subtraction cannot be a signed overflow in either case.

llvm-svn: 216037

57d5bc88

BumpPtrAllocator: don't accept 0 for the alignment parameter · fd1f0f17

Hans Wennborg authored Aug 19, 2014

It seems unnecessary to have to use an extra branch to check for this special case.

http://reviews.llvm.org/D4945

llvm-svn: 216036

fd1f0f17

Changes uint to uint32_t. · e5c1e31d
Zachary Turner authored Aug 19, 2014
```
This fixes the build broken as a result of r216026.

llvm-svn: 216034
```
e5c1e31d

[FastISel][AArch64] Factor out ADDS/SUBS instruction emission and add support... · c0886dd5

Juergen Ributzka authored Aug 19, 2014

[FastISel][AArch64] Factor out ADDS/SUBS instruction emission and add support for extensions and shift folding.

Factor out the ADDS/SUBS instruction emission code into helper functions and
make the helper functions more clever to support most of the different ADDS/SUBS
instructions the architecture support. This includes better immedediate support,
shift folding, and sign-/zero-extend folding.

This fixes <rdar://problem/17913111>.

llvm-svn: 216033

c0886dd5

Add an accessor to ValueObject that determines if the object represents a base... · a3c8f042

Enrico Granata authored Aug 19, 2014

Add an accessor to ValueObject that determines if the object represents a base class, and also returns the depth of base-class-ness. For instance if one has class C : public B {} class B : public A {}, the value for A nested in B nested in C would be a base class of depth 2

llvm-svn: 216032

a3c8f042