Commits · 4f8f307c77fa5b4de2eec8868b8734b2ab93dd22 · Lorenzo Albano / LLVM bpEVL

Jan 17, 2015

[PM] Split the LoopInfo object apart from the legacy pass, creating · 4f8f307c

Chandler Carruth authored Jan 17, 2015

a LoopInfoWrapperPass to wire the object up to the legacy pass manager.

This switches all the clients of LoopInfo over and paves the way to port
LoopInfo to the new pass manager. No functionality change is intended
with this iteration.

llvm-svn: 226373

4f8f307c

[PowerPC] Don't list R11 as a patchpoint scratch register · c19805a7

Hal Finkel authored Jan 17, 2015

R11's status is the same under both the PPC64 ELF V1 and V2 ABIs: it is
reserved for use as an "environment pointer" for compilation models that
require such a thing. We don't, we also don't need a second scratch register,
and because we support only "local" patchpoint call targets, we might as well
let R11 be used for anyregcc patchpoints.

llvm-svn: 226369

c19805a7

Improve DAG combine pass on certain IR vector patterns · 37f316af

Mehdi Amini authored Jan 17, 2015

Loading 2 2x32-bit float vectors into the bottom half of a 256-bit vector
produced suboptimal code in AVX2 mode with certain IR combinations.

In particular, the IR optimizer folded 2f32 + 2f32 -> 4f32, 4f32 + 4f32
(undef) -> 8f32 into a 2f32 + 2f32 -> 8f32, which seems more canonical,
but then mysteriously generated rather bad code; the movq/movhpd combination
didn't match.

The problem lay in the BUILD_VECTOR optimization path. The 2f32 inputs
would get promoted to 4f32 by the type legalizer, eventually resulting
in a BUILD_VECTOR on two 4f32 into an 8f32. The BUILD_VECTOR then, recognizing
these were both half the output size, concatted them and then produced
a shuffle. However, the resulting concat + shuffle was more complex than
it should be; in the case where the upper half of the output is undef, we
probably want to generate shuffle + concat instead.

This enhancement causes the vector_shuffle combine step to recognize this
suboptimal pattern and correct it. I included it there instead of in BUILD_VECTOR
in case the same suboptimal pattern occurs for other reasons.

This results in the optimizer correctly producing the optimal movq + movhpd
sequence for all three variations on this IR, even with AVX2.

I've included a test case.

Radar link: rdar://problem/19287012
Fix for PR 21943.

From: Fiona Glaser <fglaser@apple.com>
llvm-svn: 226360

37f316af

[RuntimeDyld] Tidy up emitCommonSymbols a little. NFC. · 2996895f
Lang Hames authored Jan 17, 2015
```
llvm-svn: 226358
```
2996895f
Remove std::move that was preventing return value optimization. · 73d06526
Richard Trieu authored Jan 17, 2015
```
llvm-svn: 226356
```
73d06526
RegisterCoalescer: Cleanup and improved comment for a subtle detail. · 7618b2b2
Matthias Braun authored Jan 17, 2015
```
llvm-svn: 226353
```
7618b2b2
RegisterCoalescer: Cleanup by factoring out a common expression · 0eb940ae
Matthias Braun authored Jan 17, 2015
```
llvm-svn: 226352
```
0eb940ae

RegisterCoalescer: Cleanup comment style · e2fa0816

Matthias Braun authored Jan 17, 2015

- Consistenly put comments above the function declaration, not the
  definition. To achieve this some duplicate comments got merged and
  some comment parts describing implementation details got moved into their
  functions.
- Consistently use doxygen comments above functions.
- Do not use doxygen comments inside functions.

llvm-svn: 226351

e2fa0816

RegisterCoalescer: Drive-by typo + whitespace fix · fc6ef3a2
Matthias Braun authored Jan 17, 2015
```
llvm-svn: 226350
```
fc6ef3a2
[RuntimeDyld] Remove the brace initialization that was introduced in r226341. · 1f7eab33
Lang Hames authored Jan 17, 2015
```
Evidently MSVC doesn't like it.

llvm-svn: 226349
```
1f7eab33

Update a comment · 287987ca

Philip Reames authored Jan 16, 2015

Be a bit more explicit about the fact that addrspace(1) is not reserved.

llvm-svn: 226344

287987ca

clang-format all the GC related files (NFC) · 36319538
Philip Reames authored Jan 16, 2015
```
Nothing interesting here...

llvm-svn: 226342
```
36319538

[RuntimeDyld] Track symbol visibility in RuntimeDyld. · 6bfd3980

Lang Hames authored Jan 16, 2015

RuntimeDyld symbol info previously consisted of just a Section/Offset pair. This
patch replaces that pair type with a SymbolInfo class that also tracks symbol
visibility. A new method, RuntimeDyld::getExportedSymbolLoadAddress, is
introduced which only returns a non-zero result for exported symbols. For
non-exported or non-existant symbols this method will return zero. The
RuntimeDyld::getSymbolAddress method retains its current behavior, returning
non-zero results for all symbols regardless of visibility.

No in-tree clients of RuntimeDyld are changed. The newly introduced
functionality will be used by the Orc APIs.

No test case: Since this patch doesn't modify the behavior for any in-tree
clients we don't have a good tool to test this with yet. Once Orc is in we can
use it to write regression tests that test these changes.

llvm-svn: 226341

6bfd3980

Jan 16, 2015

Fix the Archive::Child::getRawSize() method used by llvm-objdump’s -archive-headers option · c1271893
Kevin Enderby authored Jan 16, 2015
```
and tweak its use in llvm-objdump.  Add back the test case for the -archive-headers option.

llvm-svn: 226332
```
c1271893
[Hexagon] Converting halfword to doubleword multiply intrinsics. · 823415b8
Colin LeMahieu authored Jan 16, 2015
```
llvm-svn: 226326
```
823415b8
[Hexagon] Converting accumulating halfword multiply intrinsics to patterns. · cd9b2769
Colin LeMahieu authored Jan 16, 2015
```
llvm-svn: 226324
```
cd9b2769

[Hexagon] Beginning converting intrinsics to patterns instead of duplicated... · 3b047e0e

Colin LeMahieu authored Jan 16, 2015

[Hexagon] Beginning converting intrinsics to patterns instead of duplicated definitions.  Converting halfword multiply intrinsics.

llvm-svn: 226318

3b047e0e

[Hexagon] Fix 226309, replacement atomic store patterns didn't actually exist, added new versions. · 54adb6a5
Colin LeMahieu authored Jan 16, 2015
```
llvm-svn: 226315
```
54adb6a5
X86: fix comment typo in AsmParser · c3f8ad3e
Saleem Abdulrasool authored Jan 16, 2015
```
Fix a typo.  NFC.

llvm-svn: 226313
```
c3f8ad3e

Move ownership of GCStrategy objects to LLVMContext · 2b453958

Philip Reames authored Jan 16, 2015

Note: This change ended up being slightly more controversial than expected. Chandler has tentatively okayed this for the moment, but I may be revisiting this in the near future after we settle some high level questions.

Rather than have the GCStrategy object owned by the GCModuleInfo - which is an immutable analysis pass used mainly by gc.root - have it be owned by the LLVMContext. This simplifies the ownership logic (i.e. can you have two instances of the same strategy at once?), but more importantly, allows us to access the GCStrategy in the middle end optimizer. To this end, I add an accessor through Function which becomes the canonical way to get at a GCStrategy instance.

In the near future, this will allows me to move some of the checks from http://reviews.llvm.org/D6808 into the Verifier itself, and to introduce optimization legality predicates for some of the recent additions to InstCombine. (These will follow as separate changes.)

Differential Revision: http://reviews.llvm.org/D6811

llvm-svn: 226311

2b453958

[Hexagon] Removing old duplicate atomic load/store patterns. · bb6718b3
Colin LeMahieu authored Jan 16, 2015
```
llvm-svn: 226309
```
bb6718b3

Remove gc.root's findCustomSafePoints mechanism · 7de640a8

Philip Reames authored Jan 16, 2015

Searching all of the existing gc.root implementations I'm aware of (all three of them), there was exactly one use of this mechanism, and that was to implement a performance improvement that should have been applied to the default lowering.

Having this function is requiring a dependency on a CodeGen class (MachineFunction), in a class which is otherwise completely independent of CodeGen. I could solve this differently, but given that I see absolutely no value in preserving this mechanism, I going to just get rid of it.

Note: Tis is the first time I'm intentionally breaking previously supported gc.root functionality. Given 3.6 has branched, I believe this is a good time to do this.

Differential Revision: http://reviews.llvm.org/D7004

llvm-svn: 226305

7de640a8

[Hexagon] Converting old patterns to new versions using classes. · 7d1f6323
Colin LeMahieu authored Jan 16, 2015
```
llvm-svn: 226304
```
7d1f6323

[AVX512] Add intrinsics for masked aligned FP loads and stores · 3e8b22bc

Adam Nemet authored Jan 16, 2015

Similar to the unaligned cases.

Test was generated with update_llc_test_checks.py.

Part of <rdar://problem/17688758>

llvm-svn: 226296

3e8b22bc

IR: Allow 16-bits for column info · 2f5bb313
Duncan P. N. Exon Smith authored Jan 16, 2015
```
Raise the limit for column information from 8 bits to 16 bits.

llvm-svn: 226291
```
2f5bb313

IR: Cleanup dead code, NFC · c9cddb08

Duncan P. N. Exon Smith authored Jan 16, 2015

Line/column fixups already exist in `MDLocation`.  Delete the duplicated
logic in `DebugLoc`.

llvm-svn: 226290

c9cddb08

[Hexagon] Updating call/jump instruction patterns. · 2e3a26de
Colin LeMahieu authored Jan 16, 2015
```
llvm-svn: 226288
```
2e3a26de

[X86][DAG] Disable target specific combine on INSERTPS dag nodes at -O0. · ae47bc6a

Andrea Di Biagio authored Jan 16, 2015

This patch disables target specific combine on X86ISD::INSERTPS dag nodes
if optlevel is CodeGenOpt::None.

The backend currently implements a target specific combine rule that converts
a vector load used by an INSERTPS dag node into a scalar load plus a
scalar_to_vector. This allows ISel to select a single INSERTPSrm instead of
two instructions (i.e. a vector load plus INSERTPSrr).

However, the existing target combine rule on INSERTPS nodes only works under
the assumption that ISel will always be able to match an INSERTPSrm. This is
not true in general at -O0, since the backend only allows folding a load into
the memory operand of an instruction if the optimization level is not
CodeGenOpt::None.

In the example below:

//
__m128 test(__m128 a, __m128 *b) {
  __m128 c = _mm_insert_ps(a, *b, 1 << 6);
  return c;
}
//

Before this patch, at -O0, the backend would have canonicalized the load to 'b'
into a scalar load plus scalar_to_vector. Later on, ISel would have selected an
INSERTPSrr leaving the insertps mask in an inconsistent state:

  movss 4(%rdi), %xmm1
  insertps  $64, %xmm1, %xmm0 # xmm0 = xmm1[1],xmm0[1,2,3].

With this patch, the backend avoids folding the vector load into the operand of
the INSERTPS. The new codegen at -O0 is:

  movaps (%rdi), %xmm1
  insertps  $64, %xmm1, %xmm0 # %xmm1[1],xmm0[1,2,3].

llvm-svn: 226277

ae47bc6a

[mips] Remove a redundant semicolon and add space before curly brackets. NFC. · f476200c
Toma Tabacu authored Jan 16, 2015
```
llvm-svn: 226269
```
f476200c
Revert r226242 - Revert Revert Don't create new comdats in CodeGen · 60b72136
Timur Iskhodzhanov authored Jan 16, 2015
```
This breaks AddressSanitizer (ninja check-asan) on Windows

llvm-svn: 226251
```
60b72136

[PowerPC] Adjust PatchPoints for ppc64le · 52f7c018

Hal Finkel authored Jan 16, 2015

Bill Schmidt pointed out that some adjustments would be needed to properly
support powerpc64le (using the ELF V2 ABI). For one thing, R11 is not available
as a scratch register, so we need to use R12. R12 is also available under ELF
V1, so to maintain consistency, I flipped the order to make R12 the first
scratch register in the array under both ABIs.

llvm-svn: 226247

52f7c018

Fix Reassociate handling of constant in presence of undef float · 590a2700
Mehdi Amini authored Jan 16, 2015
```
http://reviews.llvm.org/D6993

llvm-svn: 226245
```
590a2700

Revert "Revert Don't create new comdats in CodeGen" · 67a79e72

Rafael Espindola authored Jan 16, 2015

This reverts commit r226173, adding r226038 back.

No change in this commit, but clang was changed to also produce trivial comdats for
costructors, destructors and vtables when needed.

Original message:

Don't create new comdats in CodeGen.

This patch stops the implicit creation of comdats during codegen.

Clang now sets the comdat explicitly when it is required. With this patch clang and gcc
now produce the same result in pr19848.

llvm-svn: 226242

67a79e72

Add a new pass "inductive range check elimination" · a1837a34

Sanjoy Das authored Jan 16, 2015

IRCE eliminates range checks of the form

  0 <= A * I + B < Length

by splitting a loop's iteration space into three segments in a way
that the check is completely redundant in the middle segment.  As an
example, IRCE will convert

  len = < known positive >
  for (i = 0; i < n; i++) {
    if (0 <= i && i < len) {
      do_something();
    } else {
      throw_out_of_bounds();
    }
  }

to

  len = < known positive >
  limit = smin(n, len)
  // no first segment
  for (i = 0; i < limit; i++) {
    if (0 <= i && i < len) { // this check is fully redundant
      do_something();
    } else {
      throw_out_of_bounds();
    }
  }
  for (i = limit; i < n; i++) {
    if (0 <= i && i < len) {
      do_something();
    } else {
      throw_out_of_bounds();
    }
  }


IRCE can deal with multiple range checks in the same loop (it takes
the intersection of the ranges that will make each of them redundant
individually).

Currently IRCE does not do any profitability analysis.  That is a
TODO.

Please note that the status of this pass is *experimental*, and it is
not part of any default pass pipeline.  Having said that, I will love
to get feedback and general input from people interested in trying
this out.

This pass was originally r226201.  It was reverted because it used C++
features not supported by MSVC 2012.

Differential Revision: http://reviews.llvm.org/D6693

llvm-svn: 226238

a1837a34

This should fix the build bot clang-cmake-armv7-a15-full failing on · a975d4df
Kevin Enderby authored Jan 16, 2015
```
the macho-archive-headers.test added with r226228.

llvm-svn: 226232
```
a975d4df
R600/SI: Add patterns for v_cvt_{flr|rpi}_i32_f32 · eeb2a7e6
Matt Arsenault authored Jan 15, 2015
```
llvm-svn: 226230
```
eeb2a7e6
Fix edge case when Start overflowed in 32 bit mode · c552c9ab
Filipe Cabecinhas authored Jan 15, 2015
```
llvm-svn: 226229
```
c552c9ab
Add the option, -archive-headers, used with -macho to print the Mach-O archive... · 13023a1a
Kevin Enderby authored Jan 15, 2015
```
Add the option, -archive-headers, used with -macho to print the Mach-O archive headers to llvm-objdump.

llvm-svn: 226228
```
13023a1a

R600/SI: Fix trailing comma with modifiers · 268757ba

Matt Arsenault authored Jan 15, 2015

Instructions with 1 operand can still use source modifiers,
so make sure we don't print an extra comma afterwards.

llvm-svn: 226226

268757ba

[Hexagon] Adding new-value store and bit reverse instructions. · cd9c4e3e
Colin LeMahieu authored Jan 15, 2015
```
llvm-svn: 226224
```
cd9c4e3e