Commits · d52cf99f534f9f30bf34efa6ca0a674383ab6594 · Roger Ferrer / llvm-epi

Oct 12, 2017

[SelectionDAG] Simplify the ISD::SIGN_EXTEND/ZERO_EXTEND handling to use less... · d52cf99f

Craig Topper authored Oct 12, 2017

[SelectionDAG] Simplify the ISD::SIGN_EXTEND/ZERO_EXTEND handling to use less temporary APInts by counting bits instead. NFCI

llvm-svn: 315628

d52cf99f

[lit] Raise the logic for enabling clang & lld substitutions to llvm. · ad5997e8

Zachary Turner authored Oct 12, 2017

This paves the way for other projects which might /use/ clang or
lld but not necessarily need to the full set of functionality
available to clang and lld tests to be able to have a basic set
of substitutions that allow a project to run the clang or lld
executables.

llvm-svn: 315627

ad5997e8

[LoopPredication] Check whether the loop is already guarded by the first iteration check condition · ead69ee4
Artur Pilipenko authored Oct 12, 2017
```
llvm-svn: 315623
```
ead69ee4

[DWARF] Fix bad comparator in sortGlobalExprs. · 48e75496

Eli Friedman authored Oct 12, 2017

The comparator passed to std::sort must provide a strict weak ordering;
otherwise, the behavior is undefined.

Fixes an assertion failure generating debug info for globals
split by GlobalOpt. I have a testcase, but not sure how to reduce it,
so not included here.  (Someone else came up with a testcase, but I
can't reproduce the crash with it, presumably because my version of LLVM
ends up sorting the array differently.)

This isn't really a complete fix (see the FIXME in the patch), but at
least it doesn't have undefined behavior.

Differential Revision: https://reviews.llvm.org/D38830

llvm-svn: 315619

48e75496

Revert "Reintroduce "[SCCP] Propagate integer range info for parameters in IPSCCP."" · 993d2e67

Bruno Cardoso Lopes authored Oct 12, 2017

This reverts commit r315593: still affect two bots:

http://lab.llvm.org:8011/builders/clang-with-lto-ubuntu/builds/5308
http://green.lab.llvm.org/green/job/clang-stage2-configure-Rlto/21751/

llvm-svn: 315618

993d2e67

[LoopPredication] Support ule, sle latch predicates · b4527e1c

Artur Pilipenko authored Oct 12, 2017

This is a follow up for the loop predication change 313981 to support ule, sle latch predicates.

Reviewed By: mkazantsev

Differential Revision: https://reviews.llvm.org/D38177

llvm-svn: 315616

b4527e1c

[X86] Add CLWB intrinsic. llvm part · 060cb437
Craig Topper authored Oct 12, 2017
```
llvm-svn: 315613
```
060cb437
Implement custom lowering for ISD::CTTZ_ZERO_UNDEF and ISD::CTTZ. · 5676acad
Wei Ding authored Oct 12, 2017
```
Differential Revision: http://reviews.llvm.org/D37348

llvm-svn: 315610
```
5676acad
AMDGPU/NFC: Move AMDGPU specific note types to ELF.h · 70303c01
Konstantin Zhuravlyov authored Oct 12, 2017
```
Differential Revision: https://reviews.llvm.org/D38747

llvm-svn: 315608
```
70303c01

[X86] Add a bunch of -mcpu strings to the cpus.ll test. · 3dc37cc5

Craig Topper authored Oct 12, 2017

We were missing most of the "core" aliases as well as skylake, cannonlake, and knights landing.

llvm-svn: 315606

3dc37cc5

[NVPTX] Implemented wmma intrinsics and instructions. · 3bafc2f0

Artem Belevich authored Oct 12, 2017

WMMA = "Warp Level Matrix Multiply-Accumulate".
These are the new instructions introduced in PTX6.0 and available
on sm_70 GPUs.

Differential Revision: https://reviews.llvm.org/D38645

llvm-svn: 315601

3bafc2f0

[codeview] Don't emit FPO data in funclet prologues · 1a7e3878
Reid Kleckner authored Oct 12, 2017
```
Attempt 3 to work around bugs in FPO data with funclets.

llvm-svn: 315600
```
1a7e3878

llvm-isel-fuzzer: Work around BUILD_SHARED_LIBS testing issues · 754a1a8a

Justin Bogner authored Oct 12, 2017

Building with BUILD_SHARED_LIBS makes it tricky to copy around
executables at will, since they won't be able to find the LLVM
libraries any more. This makes testing a feature that's based on the
executable name problematic, so we'll just disable these two tests in
that configuration.

We could potentially fix this by symlinking the lib directory into the
test directory, but that wouldn't work on windows, and losing testing
on windows would be far worse than losing testing on a configuration
that's barely even supported.

llvm-svn: 315599

754a1a8a

[TableGen] Allow intrinsics to have up to 8 return values. · 786ca6a1
Artem Belevich authored Oct 12, 2017
```
Differential Revision: https://reviews.llvm.org/D38633

llvm-svn: 315598
```
786ca6a1

Work around lack of Wine support for SetFileInformationByHandle harder · 477c974b

Hans Wennborg authored Oct 12, 2017

In r315079 I added a check for the ERROR_CALL_NOT_IMPLEMENTED error
code, but it turns out earlier versions of Wine just returned false
without setting any error code.

This patch handles the unset error code case.

llvm-svn: 315597

477c974b

AMDGPU: Fix warnings introduced in r315526 · 63e87f5a
Konstantin Zhuravlyov authored Oct 12, 2017
```
llvm-svn: 315596
```
63e87f5a
[ValueTracking] return zero when there's conflict in known bits of a shift (PR34838) · e272be7c
Sanjay Patel authored Oct 12, 2017
```
Poison allows us to return a better result than undef.

llvm-svn: 315595
```
e272be7c

Reintroduce "[SCCP] Propagate integer range info for parameters in IPSCCP." · 326fdcbf

Bruno Cardoso Lopes authored Oct 12, 2017

This is r315288 & r315294, which were reverted due to stage2 bot
failures.

Summary:
This updates the SCCP solver to use of the ValueElement lattice for
parameters, which provides integer range information. The range
information is used to remove unneeded icmp instructions.

For the following function, f() can be optimized to `ret i32 2` with
this change

  source_filename = "sccp.c"
  target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
  target triple = "x86_64-unknown-linux-gnu"

  ; Function Attrs: norecurse nounwind readnone uwtable
  define i32 @main() local_unnamed_addr #0 {
  entry:
    %call = tail call fastcc i32 @f(i32 1)
    %call1 = tail call fastcc i32 @f(i32 47)
    %add3 = add nsw i32 %call, %call1
    ret i32 %add3
  }

  ; Function Attrs: noinline norecurse nounwind readnone uwtable
  define internal fastcc i32 @f(i32 %x) unnamed_addr #1 {
  entry:
    %c1 = icmp sle i32 %x, 100

    %cmp = icmp sgt i32 %x, 300
    %. = select i1 %cmp, i32 1, i32 2
    ret i32 %.
  }

  attributes #1 = { noinline }

Reviewers: davide, sanjoy, efriedma, dberlin

Reviewed By: davide, dberlin

Subscribers: mcrosier, gberry, mssimpso, dberlin, llvm-commits

Differential Revision: https://reviews.llvm.org/D36656

llvm-svn: 315593

326fdcbf

[PowerPC] Add profitablilty check for conversion to mtctr loops · 0724fea2

Lei Huang authored Oct 12, 2017

Add profitability checks for modifying counted loops to use the mtctr instruction.

The latency of mtctr is only justified if there are more than 4 comparisons that
will be removed as a result. Usually counted loops are formed relatively early
and before unrolling, so most low trip count loops often don't survive. However
we want to ensure that if they do, we do not mistakenly update them to mtctr loops.

Use CodeMetrics to ensure we are only doing this for small loops with small trip counts.

Differential Revision: https://reviews.llvm.org/D38212

llvm-svn: 315592

0724fea2

[AMDGPU] For amdpal, widen interpolation mode workaround · c8ffffe4

Tim Renouf authored Oct 12, 2017

Summary:
The interpolation mode workaround ensures that at least one
interpolation mode is enabled in PSInputAddr. It does not also check
PSInputEna on the basis that the user might enable bits in that
depending on run-time state.

However, for amdpal os type, the user does not enable some bits after
compilation based on run-time states; the register values being
generated here are the final ones set in the hardware. Therefore, apply
the workaround to PSInputAddr and PSInputEnable together. (The case
where a bit is set in PSInputAddr but not in PSInputEnable is where the
frontend set up an input arg for a particular interpolation mode, but
nothing uses that input arg. Really we should have an earlier pass that
removes such an arg.)

Reviewers: arsenm, nhaehnle, dstuttard

Subscribers: kzhuravl, wdng, yaxunl, t-tye, llvm-commits

Differential Revision: https://reviews.llvm.org/D37758

llvm-svn: 315591

c8ffffe4

[dump] Remove NDEBUG from test to enable dump methods [NFC] · 3e0199f7

Don Hinton authored Oct 12, 2017

Summary:
Add LLVM_FORCE_ENABLE_DUMP cmake option, and use it along with
LLVM_ENABLE_ASSERTIONS to set LLVM_ENABLE_DUMP.

Remove NDEBUG and only use LLVM_ENABLE_DUMP to enable dump methods.

Move definition of LLVM_ENABLE_DUMP from config.h to llvm-config.h so
it'll be picked up by public headers.

Differential Revision: https://reviews.llvm.org/D38406

llvm-svn: 315590

3e0199f7

[x86] replace isEqualTo with == for efficiency · 3a72909b
Sanjay Patel authored Oct 12, 2017
```
This is a follow-up suggested in D37534.
Patch by Yulia Koval.

llvm-svn: 315589
```
3a72909b
[X86][SSE] Pull out repeated INSERT_VECTOR_ELT code from LowerBUILD_VECTOR... · 0903085e
Simon Pilgrim authored Oct 12, 2017
```
[X86][SSE] Pull out repeated INSERT_VECTOR_ELT code from LowerBUILD_VECTOR v16i8/v8i16 insertion. NFCI.

llvm-svn: 315587
```
0903085e

[cfi-verify] Fix typo, actually check X86 target · 1d473652

Vlad Tsyrklevich authored Oct 12, 2017

The typo in r315556 disabled the cfi-verify unit tests from building
unconditionally, have it correctly check for the X86 target.

llvm-svn: 315581

1d473652

MachineInstr: Make isEqual agree with getHashValue in MachineInstrExpressionTrait · 4a5f522d

Diana Picus authored Oct 12, 2017

MachineInstr::isIdenticalTo has a lot of logic for dealing with register
Defs (i.e. deciding whether to take them into account or ignore them).
This logic gets things wrong in some obscure cases, for instance if an
operand is not a Def for both the current MI and the one we are
comparing to.

I'm not sure if it's possible for this to happen for regular register
operands, but it may happen in the ARM backend for special operands
which use sentinel values for the register (i.e. 0, which is neither a
physical register nor a virtual one).

This causes MachineInstrExpressionTrait::isEqual (which uses
MachineInstr::isIdenticalTo) to return true for the following
instructions, which are the same except for the fact that one sets the
flags and the other one doesn't:
%1114 = ADDrsi %1113, %216, 17, 14, _, def _
%1115 = ADDrsi %1113, %216, 17, 14, _, _

OTOH, MachineInstrExpressionTrait::getHashValue returns different values
for the 2 instructions due to the different isDef on the last operand.
In practice this means that when trying to add those instructions to a
DenseMap, they will be considered different because of their different
hash values, but when growing the map we might get an assertion while
copying from the old buckets to the new buckets because isEqual
misleadingly returns true.

This patch makes sure that isEqual and getHashValue agree, by improving
the checks in MachineInstr::isIdenticalTo when we are ignoring virtual
register definitions (which is what the Trait uses). Firstly, instead of
checking isPhysicalRegister, we use !isVirtualRegister, so that we cover
both physical registers and sentinel values. Secondly, instead of
checking MachineOperand::isReg, we use MachineOperand::isIdenticalTo,
which checks isReg, isSubReg and isDef, which are the same values that
the hash function uses to compute the hash.

Note that the function is symmetric with this change, since if the
current operand is not a Def, we check MachineOperand::isIdenticalTo,
which returns false if the operands have different isDef's.

Differential Revision: https://reviews.llvm.org/D38789

llvm-svn: 315579

4a5f522d

Reinstantiate old/bad deduplication logic that was removed in r315279. · 4d931202

Daniel Jasper authored Oct 12, 2017

While this shouldn't be necessary anymore, we have cases where we run
into the assertion below, i.e. cases with two non-fragment entries for the
same variable at different frame indices.

This should be fixed, but for now, we should revert to a version that
does not trigger asserts.

llvm-svn: 315576

4d931202

Fix warnings. [-Wdocumentation] · 12ab07e0
NAKAMURA Takumi authored Oct 12, 2017
```
llvm-svn: 315573
```
12ab07e0

[AsmParser] Suppress compile warning for targets with no register diags · dab52128

Oliver Stannard authored Oct 12, 2017

This fixes the "switch statement contains 'default' but no 'case' labels"
warnings in table-generated code introduced in r315295.

llvm-svn: 315571

dab52128

[ScheduleDAGInstrs] fix behavior of getUnderlyingObjectsForCodeGen when no... · b49b015b

Hiroshi Inoue authored Oct 12, 2017

[ScheduleDAGInstrs] fix behavior of getUnderlyingObjectsForCodeGen when no identifiable object found

This patch fixes the bug introduced in https://reviews.llvm.org/D35907; the bug is reported by http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20171002/491452.html.

Before D35907, when GetUnderlyingObjects fails to find an identifiable object, allMMOsOkay lambda in getUnderlyingObjectsForInstr returns false and Objects vector is cleared. This behavior is unintentionally changed by D35907.

This patch makes the behavior for such case same as the previous behavior.
Since D35907 introduced a wrapper function getUnderlyingObjectsForCodeGen around GetUnderlyingObjects, getUnderlyingObjectsForCodeGen is modified to return a boolean value to ask the caller to clear the Objects vector.

Differential Revision: https://reviews.llvm.org/D38735

llvm-svn: 315565

b49b015b

[RegisterCoalescer] Don't set read-undef in pruneValues, only clear · a079ef68

Mikael Holmen authored Oct 12, 2017

Summary:
The comments in the code said

 // Remove <def,read-undef> flags. This def is now a partial redef.

but the code didn't just remove read-undef, it could introduce new ones which
could cause errors.

E.g. if we have something like

%vreg1<def> = IMPLICIT_DEF
%vreg2:subreg1<def, read-undef> = op %vreg3, %vreg4
%vreg2:subreg2<def> = op %vreg6, %vreg7

and we merge %vreg1 and %vreg2 then we should not set undef on the second subreg
def, which the old code did.

Now we solve this by actually do what the code comment says. We remove
read-undef flags rather than remove or introduce them.

Reviewers: qcolombet, MatzeB

Reviewed By: MatzeB

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D38616

llvm-svn: 315564

a079ef68

Re-commit "llvm-isel-fuzzer: Handle a subset of backend flags in the exec name" · 9ea7fbd1

Justin Bogner authored Oct 12, 2017

Here we add a secondary option parser to llvm-isel-fuzzer (and provide
it for use with other fuzzers). With this, you can copy the fuzzer to
a name like llvm-isel-fuzzer=aarch64-gisel for a fuzzer that fuzzer
AArch64 with GlobalISel enabled, or fuzzer=x86_64 to fuzz x86, with no
flags required. This should be useful for running these in OSS-Fuzz.

Note that this handrolls a subset of cl::opts to recognize, rather
than embedding a complete command parser for argv[0]. If we find we
really need the flexibility of handling arbitrary options at some
point we can rethink this.

This re-applies 315545 using "=" instead of ":" as a separator for
arguments.

llvm-svn: 315557

9ea7fbd1

[cfi-verify] Fix unittest failures w/o x86 target · 85171866

Vlad Tsyrklevich authored Oct 12, 2017

The llvm-cfi-verify unit tests fail if LLVM is built without the X86
target, disable the unit tests from being built unless X86 is enabled
for now.

llvm-svn: 315556

85171866

Revert r315545 "llvm-isel-fuzzer: Handle a subset of backend flags in the executable name" · 022829d8

Hans Wennborg authored Oct 12, 2017

It broke some tests on Windows:

Failing Tests (4):
    LLVM :: tools/llvm-isel-fuzzer/execname-options.ll
    LLVM :: tools/llvm-isel-fuzzer/missing-triple.ll
    LLVM :: tools/llvm-isel-fuzzer/x86-empty-bc.ll
    LLVM :: tools/llvm-isel-fuzzer/x86-empty.ll

> llvm-isel-fuzzer: Handle a subset of backend flags in the executable name
>
> Here we add a secondary option parser to llvm-isel-fuzzer (and provide
> it for use with other fuzzers). With this, you can copy the fuzzer to
> a name like llvm-isel-fuzzer:aarch64-gisel for a fuzzer that fuzzer
> AArch64 with GlobalISel enabled, or fuzzer:x86_64 to fuzz x86, with no
> flags required. This should be useful for running these in OSS-Fuzz.
>
> Note that this handrolls a subset of cl::opts to recognize, rather
> than embedding a complete command parser for argv[0]. If we find we
> really need the flexibility of handling arbitrary options at some
> point we can rethink this.

llvm-svn: 315554

022829d8

[SimplifyIndVar] Replace IVUsers with loop invariant whenever possible · d36f2030
Hongbin Zheng authored Oct 12, 2017
```
Differential Revision: https://reviews.llvm.org/D38415

llvm-svn: 315551
```
d36f2030

docs: Add some links to OSS Fuzz · 8d85ced1

Justin Bogner authored Oct 12, 2017

I'd left a couple of stray links here in a previous commit rather than
writing a paragraph.

llvm-svn: 315550

8d85ced1

docs: Try to fix sphinx build · 857ec155
Justin Bogner authored Oct 12, 2017
```
llvm-svn: 315546
```
857ec155

llvm-isel-fuzzer: Handle a subset of backend flags in the executable name · a5969ce1

Justin Bogner authored Oct 12, 2017

Here we add a secondary option parser to llvm-isel-fuzzer (and provide
it for use with other fuzzers). With this, you can copy the fuzzer to
a name like llvm-isel-fuzzer:aarch64-gisel for a fuzzer that fuzzer
AArch64 with GlobalISel enabled, or fuzzer:x86_64 to fuzz x86, with no
flags required. This should be useful for running these in OSS-Fuzz.

Note that this handrolls a subset of cl::opts to recognize, rather
than embedding a complete command parser for argv[0]. If we find we
really need the flexibility of handling arbitrary options at some
point we can rethink this.

llvm-svn: 315545

a5969ce1

docs: Add some information about Fuzzing LLVM itself · fd5b2a08

Justin Bogner authored Oct 12, 2017

This splits some content out of the libFuzzer docs and adds a fair
amount of detail about the fuzzers in LLVM.

llvm-svn: 315544

fd5b2a08

Speculative build fix 2 · d925f983
Reid Kleckner authored Oct 12, 2017
```
llvm-svn: 315542
```
d925f983
Revert r307036 because of PR34919. · 1736efd1
Wei Mi authored Oct 12, 2017
```
llvm-svn: 315540
```
1736efd1