Commits · 0e2171904e1b1b0a95c43227617630bf5f8e7e11 · Roger Ferrer / llvm-epi

Aug 23, 2016

[lanai] Exit early in Mem Alu combiner if sentinel reach. · 0e217190

Jacques Pienaar authored Aug 23, 2016

LanaiMemAluCombiner could try to query the debug value of a list sentinel. Add check to exit early instead.

llvm-svn: 279497

0e217190

[MemorySSA] Remove unused field. NFC. · 7f414b90

George Burgess IV authored Aug 22, 2016

Given that we're not currently using blocker info, and whether or not we
will end up using it it is unclear, don't waste 8 (or 4) bytes of memory
per path node.

llvm-svn: 279493

7f414b90

[InstSimplify] add helper function for SimplifyICmpInst(); NFCI · 67bde286

Sanjay Patel authored Aug 22, 2016

And add a FIXME because the helper excludes folds for vectors. It's
not clear yet how many of these are actually testable (and therefore
necessary?) because later analysis uses computeKnownBits and other
methods to catch many of these cases.

llvm-svn: 279492

67bde286

Fix crash from assert in r279466. · 1523925d

Pete Cooper authored Aug 22, 2016

The assert in r279466 checks that we call the correct version of
Intrinsic::getName.  The version which accepts only an ID should not
be used for intrinsics with overloaded types.  The global-isel
code was calling the wrong version.  The test CodeGen/AArch64/GlobalISel/arm64-irtranslator.ll
will ensure that we call the correct version from now on.

llvm-svn: 279487

1523925d

ADT: Separate some list manipulation API into ilist_base, NFC · 9f5c83b9

Duncan P. N. Exon Smith authored Aug 22, 2016

Separate algorithms in iplist<T> that don't depend on T into ilist_base,
and unit test them.

While I was adding unit tests for these algorithms anyway, I also added
unit tests for ilist_node_base and ilist_sentinel<T>.

To make the algorithms and unit tests easier to write, I also did the
following minor changes as a drive-by:
- encapsulate Prev/Next in ilist_node_base to so that algorithms are
  easier to read, and
- update ilist_node_access API to take nodes by reference.

There should be no real functionality change here.

llvm-svn: 279484

9f5c83b9

Fix header comment for unittests/ADT/ilistTest.cpp · 49a8ebd7
Duncan P. N. Exon Smith authored Aug 22, 2016
```
llvm-svn: 279483
```
49a8ebd7

Aug 22, 2016

[ADT] Actually mutate the iterator VisitStack.back().second, not its copy. · 608ca250

Tim Shen authored Aug 22, 2016

Summary: Before the change, *Opt never actually gets updated by the end
of toNext(), so for every next time the loop has to start over from
child_begin(). This bug doesn't affect the correctness, since Visited prevents
it from re-entering the same node again; but it's slow.

Reviewers: dberris, dblaikie, dannyb

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D23649

llvm-svn: 279482

608ca250

[InstCombine] change param type from Instruction to BinaryOperator for icmp helpers; NFCI · c9196c44
Sanjay Patel authored Aug 22, 2016
```
This saves some casting in the helper functions and eases some further refactoring.

llvm-svn: 279478
```
c9196c44

[GraphTraits] Replace all NodeType usage with NodeRef · f2187ed3

Tim Shen authored Aug 22, 2016

This should finish the GraphTraits migration.

Differential Revision: http://reviews.llvm.org/D23730

llvm-svn: 279475

f2187ed3

ADT: Remove ilist_*sentinel_traits, NFC · b29ec1e0

Duncan P. N. Exon Smith authored Aug 22, 2016

Remove all the dead code around ilist_*sentinel_traits.  This is a
follow-up to gutting them as part of r279314 (originally r278974),
staged to prevent broken builds in sub-projects.

Uses were removed from clang in r279457 and lld in r279458.

llvm-svn: 279473

b29ec1e0

[InstCombine] use m_APInt to allow icmp (shr exact X, Y), 0 folds for splat constant vectors · a3920494
Sanjay Patel authored Aug 22, 2016
```
llvm-svn: 279472
```
a3920494

Add ADT headers to the cmake headers directory for LLVMSupport. NFC. · 067ee5b5

Pete Cooper authored Aug 22, 2016

Xcode and MSVC list the headers and source files for each library.

LLVMSupport lists included the source files for ADT but not the headers.  This
add the ADT headers so that they are browsable by the UI.

llvm-svn: 279470

067ee5b5

Add comments and an assert to follow-up on r279113. NFC. · a5f8c722

Pete Cooper authored Aug 22, 2016

Philip commented on r279113 to ask for better comments as to
when to use the different versions of getName.  Its also possible
to assert in the simple case that we aren't an overloaded intrinsic
as those have to use the more capable version of getName.

Thanks for the comments Philip.

llvm-svn: 279466

a5f8c722

IDFCalculator: Remove unused field. · 775b5541
Daniel Berlin authored Aug 22, 2016
```
llvm-svn: 279465
```
775b5541

AMDGPU: Split SILowerControlFlow into two pieces · 78fc9daf

Matt Arsenault authored Aug 22, 2016

Do most of the lowering in a pre-RA pass. Keep the skip jump
insertion late, plus a few other things that require more
work to move out.

One concern I have is now there may be COPY instructions
which do not have the necessary implicit exec uses
if they will be lowered to v_mov_b32.

This has a positive effect on SGPR usage in shader-db.

llvm-svn: 279464

78fc9daf

MSSA: Factor out phi node placement · 3d512a2d
Daniel Berlin authored Aug 22, 2016
```
llvm-svn: 279462
```
3d512a2d
MSSA: Only rename accesses whose defining access is nullptr · 868381bf
Daniel Berlin authored Aug 22, 2016
```
llvm-svn: 279461
```
868381bf

[SimplifyCFG] Rewrite SinkThenElseCodeToEnd · 5bf21142

James Molloy authored Aug 22, 2016

[Recommitting now an unrelated assertion in SROA is sorted out]

The new version has several advantages:
  1) IMSHO it's more readable and neater
  2) It handles loads and stores properly
  3) It can handle any number of incoming blocks rather than just two. I'll be taking advantage of this in a followup patch.

With this change we can now finally sink load-modify-store idioms such as:

    if (a)
      return *b += 3;
    else
      return *b += 4;

    =>

    %z = load i32, i32* %y
    %.sink = select i1 %a, i32 5, i32 7
    %b = add i32 %z, %.sink
    store i32 %b, i32* %y
    ret i32 %b

When this works for switches it'll be even more powerful.

Round 4. This time we should handle all instructions correctly, and not replace any operands that need to be constant with variables.

This was really hard to determine safely, so the helper function should be put into the Instruction API. I'll do that as a followup.

llvm-svn: 279460

5bf21142

[SROA] Remove incorrect assertion · 0fee97f8

James Molloy authored Aug 22, 2016

Confirmed with aprantl, this assertion is incorrect - code can get here (for example 80-bit FP types) and if it does it's benign. This is exposed by a completely unrelated patch of mine, so stop the compiler falling over.

Original differential: http://reviews.llvm.org/D16187
aprantl's advice to remove assertion: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20160815/382129.html

llvm-svn: 279454

0fee97f8

[SSP] Do not set __guard_local to hidden for OpenBSD SSP · a5cc25e5

Tim Shen authored Aug 22, 2016

__guard_local is defined as long on OpenBSD. If the source file contains
a definition of __guard_local, it mismatches with the int8 pointer type
used in LLVM. In that case, Module::getOrInsertGlobal() returns a
cast operation instead of a GlobalVariable. Trying to set the
visibility on the cast operation leads to random segfaults (seen when
compiling the OpenBSD kernel, which also runs with stack protection).

In the kernel, the hidden attribute does not matter. For userspace code,
__guard_local is defined as hidden in the startup code. If a program
re-defines __guard_local, the definition from the startup code will
either win or the linker complains about multiple definitions
(depending on whether the re-defined __guard_local is placed in the
common segment or not).

It also matches what gcc on OpenBSD does.

Thanks Stefan Kempf <sisnkemp@gmail.com> for the patch!

Differential Revision: http://reviews.llvm.org/D23674

llvm-svn: 279449

a5cc25e5

[InstCombine] Allow sinking from unique predecessor with multiple edges · ec8b8cc5

Jun Bum Lim authored Aug 22, 2016

Summary: We can allow sinking if the single user block has only one unique predecessor, regardless of the number of edges. Note that a switch statement with multiple cases can have the same destination.

Reviewers: mcrosier, majnemer, spatel, reames

Subscribers: reames, mcrosier, llvm-commits

Differential Revision: https://reviews.llvm.org/D23722

llvm-svn: 279448

ec8b8cc5

Revert "[SimplifyCFG] Rewrite SinkThenElseCodeToEnd" · 475f4a76
James Molloy authored Aug 22, 2016
```
This reverts commit r279443. It caused buildbot failures.

llvm-svn: 279447
```
475f4a76

[SimplifyCFG] Rewrite SinkThenElseCodeToEnd · 35305269

James Molloy authored Aug 22, 2016

The new version has several advantages:
  1) IMSHO it's more readable and neater
  2) It handles loads and stores properly
  3) It can handle any number of incoming blocks rather than just two. I'll be taking advantage of this in a followup patch.

With this change we can now finally sink load-modify-store idioms such as:

    if (a)
      return *b += 3;
    else
      return *b += 4;

    =>

    %z = load i32, i32* %y
    %.sink = select i1 %a, i32 5, i32 7
    %b = add i32 %z, %.sink
    store i32 %b, i32* %y
    ret i32 %b

When this works for switches it'll be even more powerful.

Round 4. This time we should handle all instructions correctly, and not replace any operands that need to be constant with variables.

This was really hard to determine safely, so the helper function should be put into the Instruction API. I'll do that as a followup.

llvm-svn: 279443

35305269

[X86][AVX] Don't use SubVectorBroadcast if there are additional users of the chain (PR29088) · c8ad5c06
Simon Pilgrim authored Aug 22, 2016
```
We could improve on this by making X86SubVBroadcast a full memory intrinsic similar to X86vzload

llvm-svn: 279441
```
c8ad5c06
Fix Gold Plugin after API change in the LTO API (constify callback type) · 6ec23331
Mehdi Amini authored Aug 22, 2016
```
llvm-svn: 279440
```
6ec23331

[mips][ias] Support .dtprel[d]word and .tprel[d]word directives · eb9ed610

Simon Atanasyan authored Aug 22, 2016

Assembler directives .dtprelword, .dtpreldword, .tprelword, and
.tpreldword generates relocations R_MIPS_TLS_DTPREL32, R_MIPS_TLS_DTPREL64,
R_MIPS_TLS_TPREL32, and R_MIPS_TLS_TPREL64 respectively.

The main motivation for this patch is to be able to write test cases
for checking correctness of the LLD linker's behaviour.

Differential Revision: https://reviews.llvm.org/D23669

llvm-svn: 279439

eb9ed610

[LTO] Constify the Module Hook function (NFC) · f8c2f08c

Mehdi Amini authored Aug 22, 2016

It use to be non-const for the sole purpose of custom handling of
commons symbol. This is moved now in the regular LTO handling now
and such we can constify the callback.

llvm-svn: 279438

f8c2f08c

Reset isUndef when removing subreg from a def operand · 673b347e
Krzysztof Parzyszek authored Aug 22, 2016
```
llvm-svn: 279437
```
673b347e
[X86] Only accept SM_SentinelUndef (-1) as an undefined shuffle mask in range · 13fa3301
Simon Pilgrim authored Aug 22, 2016
```
As discussed on D23027 we should be trying to be more strict on what is an undefined mask value.

llvm-svn: 279435
```
13fa3301
Remove missing file from r279433 reversal · a1d9a674
Artur Pilipenko authored Aug 22, 2016
```
llvm-svn: 279434
```
a1d9a674

Revert -r278267 [ValueTracking] An improvement to IR ValueTracking on Non-negative Integers · bc76ecad

Artur Pilipenko authored Aug 22, 2016

This change cause performance regression on MultiSource/Benchmarks/TSVC/Symbolics-flt/Symbolics-flt from LNT and some other bechmarks.

See https://reviews.llvm.org/D18777 for details.

llvm-svn: 279433

bc76ecad

Revert -r278269 [IndVarSimplify] Eliminate zext of a signed IV when the IV is... · b78ad9d4

Artur Pilipenko authored Aug 22, 2016

Revert -r278269 [IndVarSimplify] Eliminate zext of a signed IV when the IV is known to be non-negative

This change needs to be reverted in order to revert -r278267 which cause performance regression on MultiSource/Benchmarks/TSVC/Symbolics-flt/Symbolics-flt from LNT and some other bechmarks.

See comments on https://reviews.llvm.org/D18777 for details.

llvm-svn: 279432

b78ad9d4

[PM] Port LoopDataPrefetch AArch64 tests to new pass manager · a927aa4a

Balaram Makam authored Aug 22, 2016

Reviewers: mcrosier, tejohnson

Subscribers: aemerson, rengolin, mcrosier, llvm-commits

Differential Revision: https://reviews.llvm.org/D23724

llvm-svn: 279431

a927aa4a

[X86][SSE] Avoid specifying unused arguments in SHUFPD lowering · 2279e595

Simon Pilgrim authored Aug 22, 2016

As discussed on PR26491, we are missing the opportunity to make use of the smaller MOVHLPS instruction because we set both arguments of a SHUFPD when using it to lower a single input shuffle.

This patch sets the lowered argument to UNDEF if that shuffle element is undefined. This in turn makes it easier for target shuffle combining to decode UNDEF shuffle elements, allowing combines to MOVHLPS to occur.

A fix to match against MOVHPD stores was necessary as well.

This builds on the improved MOVLHPS/MOVHLPS lowering and memory folding support added in D16956

Adding similar support for SHUFPS will have to wait until have better support for target combining of binary shuffles.

Differential Revision: https://reviews.llvm.org/D23027

llvm-svn: 279430

2279e595

[mips][microMIPS] Implement BLTZC, BLEZC, BGEZC and BGTZC instructions, fix... · f0ed16ea

Hrvoje Varga authored Aug 22, 2016

[mips][microMIPS] Implement BLTZC, BLEZC, BGEZC and BGTZC instructions, fix disassembly and add operand checking to existing B<cond>C implementations
Differential Revision: https://reviews.llvm.org/D22667

llvm-svn: 279429

f0ed16ea

[MC] Remove guard(s). NFCI. · 80d379f2
Davide Italiano authored Aug 22, 2016
```
All the methods are already marked with
LLVM_DUMP_METHOD.

llvm-svn: 279428
```
80d379f2

[ThinLTO][X86] Fix windows build · 8738786b

Simon Pilgrim authored Aug 22, 2016

Windows 'rm' complains about non-existent files if a wildcard is used. Be more explicit about the files deleted to avoid this.

llvm-svn: 279426

8738786b

[X86] Create a new instruction format to handle 4VOp3 encoding. This saves one... · 5f8419da

Craig Topper authored Aug 22, 2016

[X86] Create a new instruction format to handle 4VOp3 encoding. This saves one bit in TSFlags and simplifies MRMSrcMem/MRMSrcReg format handling.

llvm-svn: 279424

5f8419da

[X86] Create a new instruction format to handle MemOp4 encoding. This saves... · 9b20fece

Craig Topper authored Aug 22, 2016

[X86] Create a new instruction format to handle MemOp4 encoding. This saves one bit in TSFlags and simplifies MRMSrcMem/MRMSrcReg format handling.

llvm-svn: 279423

9b20fece

[X86] Space out the encodings of X86 instruction formats. I plan to add some... · 61b62e56

Craig Topper authored Aug 22, 2016

[X86] Space out the encodings of X86 instruction formats. I plan to add some new encodings in future commits and this will reduce the size of those commits. NFC

This tries to keep all the ModRM memory and register forms in their own regions of the encodings. Hoping to make it simple on some of the switch statements that operate on these encodings.

llvm-svn: 279422

61b62e56