Commits · ad34d91343bba205397955cc8e3e82f6ad99b2d8 · Lorenzo Albano / LLVM bpEVL

Jan 19, 2015

[PM] Relax asserts and always try to reconstruct loop simplify form when · ad34d913

Chandler Carruth authored Jan 19, 2015

we can while splitting critical edges.

The only code which called this and didn't require simplified loops to
be preserved is polly, and the code behaves correctly there anyways.
Without this change, it becomes really hard to share this code with the
new pass manager where things like preserving loop simplify form don't
make any sense.

If anyone discovers this code behaving incorrectly, what it *should* be
testing for is whether the loops it needs to be in simplified form are
in fact in that form. It should always be trying to preserve that form
when it exists.

llvm-svn: 226443

ad34d913

SLPVectorizer: limit the number of alias checks to reduce the runtime. · 76cb53a8

Erik Eckstein authored Jan 19, 2015

In case of blocks with many memory-accessing instructions, alias checking can take lot of time
(because calculating the memory dependencies has quadratic complexity).
I chose a limit which resulted in no changes when running the benchmarks.

llvm-svn: 226439

76cb53a8

[PowerPC] Minor correction to r226432 · c3168129

Hal Finkel authored Jan 19, 2015

We don't need to exclude patchpoints from the implicit r2 dependence in
FastISel because it is added as an implicit operand and, thus, should not
confuse that StackMap code.

By inspection / no test case.

llvm-svn: 226434

c3168129

[MIScheduler] Slightly better handling of constrainLocalCopy when both source and dest are local · 54c61ede
Michael Kuperstein authored Jan 19, 2015
```
This fixes PR21792.

Differential Revision: http://reviews.llvm.org/D6823

llvm-svn: 226433
```
54c61ede

[PowerPC] Add r2 as an operand for all calls under both PPC64 ELF V1 and V2 · af51993e

Hal Finkel authored Jan 19, 2015

Our PPC64 ELF V2 call lowering logic added r2 as an operand to all direct call
instructions in order to represent the dependency on the TOC base pointer
value. Restricting this to ELF V2, however, does not seem to make sense: calls
under ELF V1 have the same dependence, and indirect calls have an r2 dependence
just as direct ones. Make sure the dependence is noted for all calls under both
ELF V1 and ELF V2.

llvm-svn: 226432

af51993e

[x86] Change AVX512 intrinsics to take a 8-bit immediate for the comparision... · f4bf9119

Craig Topper authored Jan 19, 2015

[x86] Change AVX512 intrinsics to take a 8-bit immediate for the comparision kind instead of a 32-bit immediate. This better aligns with the emitted instruction. It also matches SSE and AVX1 equivalents. Also add auto upgrade support.

llvm-svn: 226430

f4bf9119

[PM] Lift the analyses into the interface for · 0eae1120

Chandler Carruth authored Jan 19, 2015

SplitLandingPadPredecessors and remove the Pass argument from its
interface.

Another step to the utilities being usable with both old and new pass
managers.

llvm-svn: 226426

0eae1120

Jan 18, 2015

unique_ptrify the RelInfo parameter to TargetRegistry::createMCSymbolizer · 186db431
David Blaikie authored Jan 18, 2015
```
llvm-svn: 226416
```
186db431
std::unique_ptrify the MCStreamer argument to createAsmPrinter · 9459832e
David Blaikie authored Jan 18, 2015
```
llvm-svn: 226414
```
9459832e

[PowerPC] Don't hard-code R2 as register when processing TOC relocations · 58884f9f

Hal Finkel authored Jan 18, 2015

Instructions that have high-order TOC relocations always carry R2 as their base
register, so it does not matter whether we take the register from the
instruction or just hard-code it in PPCAsmPrinter. In the future, however, we
might want to apply these relocations to instructions using a different
register, so taking the register from the instruction is a better thing to do.
No change in functionality here, however.

llvm-svn: 226403

58884f9f

[PowerPC] Add some FIXMEs for fastcc and FPR <-> GPR moves · 8ea446b6

Hal Finkel authored Jan 18, 2015

So we don't forget, once we support FPR <-> GPR moves on the P8, we'll likely
want to re-visit this part of the calling convention.

llvm-svn: 226401

8ea446b6

[PowerPC] Initial PPC64 calling-convention changes for fastcc · f81b6dd7

Hal Finkel authored Jan 18, 2015

The default calling convention specified by the PPC64 ELF (V1 and V2) ABI is
designed to work with both prototyped and non-prototyped/varargs functions. As
a result, GPRs and stack space are allocated for every argument, even those
that are passed in floating-point or vector registers.

GlobalOpt::OptimizeFunctions will transform local non-varargs functions (that
do not have their address taken) to use the 'fast' calling convention.

When functions are using the 'fast' calling convention, don't allocate GPRs for
arguments passed in other types of registers, and don't allocate stack space for
arguments passed in registers. Other changes for the fast calling convention
may be added in the future.

llvm-svn: 226399

f81b6dd7

[PM] Pull the analyses used for another utility routine into its API · b5797b65

Chandler Carruth authored Jan 18, 2015

rather than relying on the pass object.

This one is a bit annoying, but will pay off. First, supporting this one
will make the next one much easier, and for utilities like LoopSimplify,
this is moving them (slowly) closer to not having to pass the pass
object around throughout their APIs.

llvm-svn: 226396

b5797b65

[PM] Sink the specific analyses preserved by SplitBlock into its · 32c52c7e

Chandler Carruth authored Jan 18, 2015

interface, removing Pass from its interface.

This also makes those analyses optional so that passes which don't even
preserve these (or use them) can skip the logic entirely.

llvm-svn: 226394

32c52c7e

[PM] Replace another Pass argument with specific analyses that are · b5c11535

Chandler Carruth authored Jan 18, 2015

optionally updated by MergeBlockIntoPredecessors.

No functionality changed, just refactoring to clear the way for the new
pass manager.

llvm-svn: 226392

b5c11535

[PM] Refactor how the LoopRotation pass access the DominatorTree. · 94209094

Chandler Carruth authored Jan 18, 2015

Instead of querying the pass every where we need to, do that once and
cache a pointer in the pass object. This is both simpler and I'm about
to add yet another place where we need to dig out that pointer.

llvm-svn: 226391

94209094

[PM] Lift the actual analyses used into the inferface rather than · 5eee895c

Chandler Carruth authored Jan 18, 2015

accepting a Pass and querying it for analyses.

This is necessary to allow the utilities to work both with the old and
new pass managers, and I also think this makes the interface much more
clear and helps the reader know what analyses the utility can actually
handle. I plan to repeat this process iteratively to clean up all the
pass utilities.

llvm-svn: 226386

5eee895c

[PM] Now that LoopInfo isn't in the Pass type hierarchy, it is much · 691addc2

Chandler Carruth authored Jan 18, 2015

cleaner to derive from the generic base.

Thise removes a ton of boiler plate code and somewhat strange and
pointless indirections. It also remove a bunch of the previously needed
friend declarations. To fully remove these, I also lifted the verify
logic into the generic LoopInfoBase, which seems good anyways -- it is
generic and useful logic even for the machine side.

llvm-svn: 226385

691addc2

Jan 17, 2015

[PM] Cleanup more warnings my refactoring exposed where now we have · bc045a5a

Chandler Carruth authored Jan 17, 2015

unused variables in a no-asserts build.

I've fixed this by putting the entire loop behind an #ifndef as it
contains nothing other than asserts.

llvm-svn: 226377

bc045a5a

[PM] Remove a dead field. · 24fd029a

Chandler Carruth authored Jan 17, 2015

This was dead even before I refactored how we initialized it, but my
refactoring made it trivially dead and it is now caught by a Clang
warning. This fixes the warning and should clean up the -Werror bot
failures (sorry!).

llvm-svn: 226376

24fd029a

[PM] Split the LoopInfo object apart from the legacy pass, creating · 4f8f307c

Chandler Carruth authored Jan 17, 2015

a LoopInfoWrapperPass to wire the object up to the legacy pass manager.

This switches all the clients of LoopInfo over and paves the way to port
LoopInfo to the new pass manager. No functionality change is intended
with this iteration.

llvm-svn: 226373

4f8f307c

[PowerPC] Don't list R11 as a patchpoint scratch register · c19805a7

Hal Finkel authored Jan 17, 2015

R11's status is the same under both the PPC64 ELF V1 and V2 ABIs: it is
reserved for use as an "environment pointer" for compilation models that
require such a thing. We don't, we also don't need a second scratch register,
and because we support only "local" patchpoint call targets, we might as well
let R11 be used for anyregcc patchpoints.

llvm-svn: 226369

c19805a7

Improve DAG combine pass on certain IR vector patterns · 37f316af

Mehdi Amini authored Jan 17, 2015

Loading 2 2x32-bit float vectors into the bottom half of a 256-bit vector
produced suboptimal code in AVX2 mode with certain IR combinations.

In particular, the IR optimizer folded 2f32 + 2f32 -> 4f32, 4f32 + 4f32
(undef) -> 8f32 into a 2f32 + 2f32 -> 8f32, which seems more canonical,
but then mysteriously generated rather bad code; the movq/movhpd combination
didn't match.

The problem lay in the BUILD_VECTOR optimization path. The 2f32 inputs
would get promoted to 4f32 by the type legalizer, eventually resulting
in a BUILD_VECTOR on two 4f32 into an 8f32. The BUILD_VECTOR then, recognizing
these were both half the output size, concatted them and then produced
a shuffle. However, the resulting concat + shuffle was more complex than
it should be; in the case where the upper half of the output is undef, we
probably want to generate shuffle + concat instead.

This enhancement causes the vector_shuffle combine step to recognize this
suboptimal pattern and correct it. I included it there instead of in BUILD_VECTOR
in case the same suboptimal pattern occurs for other reasons.

This results in the optimizer correctly producing the optimal movq + movhpd
sequence for all three variations on this IR, even with AVX2.

I've included a test case.

Radar link: rdar://problem/19287012
Fix for PR 21943.

From: Fiona Glaser <fglaser@apple.com>
llvm-svn: 226360

37f316af

[RuntimeDyld] Tidy up emitCommonSymbols a little. NFC. · 2996895f
Lang Hames authored Jan 17, 2015
```
llvm-svn: 226358
```
2996895f
Remove std::move that was preventing return value optimization. · 73d06526
Richard Trieu authored Jan 17, 2015
```
llvm-svn: 226356
```
73d06526
RegisterCoalescer: Cleanup and improved comment for a subtle detail. · 7618b2b2
Matthias Braun authored Jan 17, 2015
```
llvm-svn: 226353
```
7618b2b2
RegisterCoalescer: Cleanup by factoring out a common expression · 0eb940ae
Matthias Braun authored Jan 17, 2015
```
llvm-svn: 226352
```
0eb940ae

RegisterCoalescer: Cleanup comment style · e2fa0816

Matthias Braun authored Jan 17, 2015

- Consistenly put comments above the function declaration, not the
  definition. To achieve this some duplicate comments got merged and
  some comment parts describing implementation details got moved into their
  functions.
- Consistently use doxygen comments above functions.
- Do not use doxygen comments inside functions.

llvm-svn: 226351

e2fa0816

RegisterCoalescer: Drive-by typo + whitespace fix · fc6ef3a2
Matthias Braun authored Jan 17, 2015
```
llvm-svn: 226350
```
fc6ef3a2
[RuntimeDyld] Remove the brace initialization that was introduced in r226341. · 1f7eab33
Lang Hames authored Jan 17, 2015
```
Evidently MSVC doesn't like it.

llvm-svn: 226349
```
1f7eab33

Update a comment · 287987ca

Philip Reames authored Jan 16, 2015

Be a bit more explicit about the fact that addrspace(1) is not reserved.

llvm-svn: 226344

287987ca

clang-format all the GC related files (NFC) · 36319538
Philip Reames authored Jan 16, 2015
```
Nothing interesting here...

llvm-svn: 226342
```
36319538

[RuntimeDyld] Track symbol visibility in RuntimeDyld. · 6bfd3980

Lang Hames authored Jan 16, 2015

RuntimeDyld symbol info previously consisted of just a Section/Offset pair. This
patch replaces that pair type with a SymbolInfo class that also tracks symbol
visibility. A new method, RuntimeDyld::getExportedSymbolLoadAddress, is
introduced which only returns a non-zero result for exported symbols. For
non-exported or non-existant symbols this method will return zero. The
RuntimeDyld::getSymbolAddress method retains its current behavior, returning
non-zero results for all symbols regardless of visibility.

No in-tree clients of RuntimeDyld are changed. The newly introduced
functionality will be used by the Orc APIs.

No test case: Since this patch doesn't modify the behavior for any in-tree
clients we don't have a good tool to test this with yet. Once Orc is in we can
use it to write regression tests that test these changes.

llvm-svn: 226341

6bfd3980

Jan 16, 2015

Fix the Archive::Child::getRawSize() method used by llvm-objdump’s -archive-headers option · c1271893
Kevin Enderby authored Jan 16, 2015
```
and tweak its use in llvm-objdump.  Add back the test case for the -archive-headers option.

llvm-svn: 226332
```
c1271893
[Hexagon] Converting halfword to doubleword multiply intrinsics. · 823415b8
Colin LeMahieu authored Jan 16, 2015
```
llvm-svn: 226326
```
823415b8
[Hexagon] Converting accumulating halfword multiply intrinsics to patterns. · cd9b2769
Colin LeMahieu authored Jan 16, 2015
```
llvm-svn: 226324
```
cd9b2769

[Hexagon] Beginning converting intrinsics to patterns instead of duplicated... · 3b047e0e

Colin LeMahieu authored Jan 16, 2015

[Hexagon] Beginning converting intrinsics to patterns instead of duplicated definitions.  Converting halfword multiply intrinsics.

llvm-svn: 226318

3b047e0e

[Hexagon] Fix 226309, replacement atomic store patterns didn't actually exist, added new versions. · 54adb6a5
Colin LeMahieu authored Jan 16, 2015
```
llvm-svn: 226315
```
54adb6a5
X86: fix comment typo in AsmParser · c3f8ad3e
Saleem Abdulrasool authored Jan 16, 2015
```
Fix a typo.  NFC.

llvm-svn: 226313
```
c3f8ad3e

Move ownership of GCStrategy objects to LLVMContext · 2b453958

Philip Reames authored Jan 16, 2015

Note: This change ended up being slightly more controversial than expected. Chandler has tentatively okayed this for the moment, but I may be revisiting this in the near future after we settle some high level questions.

Rather than have the GCStrategy object owned by the GCModuleInfo - which is an immutable analysis pass used mainly by gc.root - have it be owned by the LLVMContext. This simplifies the ownership logic (i.e. can you have two instances of the same strategy at once?), but more importantly, allows us to access the GCStrategy in the middle end optimizer. To this end, I add an accessor through Function which becomes the canonical way to get at a GCStrategy instance.

In the near future, this will allows me to move some of the checks from http://reviews.llvm.org/D6808 into the Verifier itself, and to introduce optimization legality predicates for some of the recent additions to InstCombine. (These will follow as separate changes.)

Differential Revision: http://reviews.llvm.org/D6811

llvm-svn: 226311

2b453958