Commits · 3c56b0bb8f9ec0ff5f6db9fdb0d6a26faf5e3765 · Roger Ferrer / llvm-epi

Aug 23, 2017

[X86] Fix -Wenum-compare warning · 3c56b0bb

Benjamin Kramer authored Aug 23, 2017

lib/Target/X86/X86ISelLowering.cpp:34613:25: error: enumeral mismatch in
conditional expression: 'llvm::ISD::NodeType' vs
'llvm::X86ISD::NodeType'

llvm-svn: 311580

3c56b0bb

[AVX512] Don't create SHRUNKBLEND SDNodes for 512-bit vectors · 853a8d9f

Craig Topper authored Aug 23, 2017

There are no 512-bit blend instructions so we shouldn't create SHRUNKBLEND for them.

On a side note, it looks like there may be a missed opportunity for constant folding TESTM when LHS and RHS are equal.

This fixes PR34139.

Differential Revision: https://reviews.llvm.org/D36992

llvm-svn: 311572

853a8d9f

[X86] Remove X86ISD::FMADD in favor ISD::FMA · f1417ca6

Craig Topper authored Aug 23, 2017

There's no reason to have a target specific node with the same semantics as a target independent opcode.

This should simplify D36335 so that it doesn't need to touch X86ISelDAGToDAG.cpp

Differential Revision: https://reviews.llvm.org/D36983

llvm-svn: 311568

f1417ca6

bpf: close the file descriptor after probe inside getHostCPUNameForBPF · c6d25710
Yonghong Song authored Aug 23, 2017
```
Signed-off-by: Yonghong Song <yhs@fb.com>
llvm-svn: 311567
```
c6d25710

LowerAtomic: Don't skip optnone functions; atomic still need lowering (PR34020) · 66f6fc0a

Hans Wennborg authored Aug 23, 2017

The lowering isn't really an optimization, so optnone shouldn't make a
difference. ARM relies on the pass running when using "-mthread-model
single", because in that mode, it doesn't run AtomicExpand. See bug for
more details.

Differential Revision: https://reviews.llvm.org/D37040

llvm-svn: 311565

66f6fc0a

Fixed invalid variable name in Dockerfile scripts. · b2c0794e

Ilya Biryukov authored Aug 23, 2017

LLVM_SVN_REVISION was used instead of LLVM_SVN_REV.
This caused a revision option to be ignored in Dockerfiles.

llvm-svn: 311564

b2c0794e

Revert r311546 as it breaks build · 3697ebe2

Victor Leschuk authored Aug 23, 2017

http://lab.llvm.org:8011/builders/llvm-clang-x86_64-expensive-checks-win/builds/4394

llvm-svn: 311560

3697ebe2

Make lit :: shtest-format.py supported on Windows again · 9f11c0bd

Victor Leschuk authored Aug 23, 2017

It was marked as unsupported on Windows in r311230 because on some Win10 
machines it failed or caused hang. The problem was that on these machines
system bash (C:\Windows\System32\bash.exe) was used which requires paths to be
passed like '/mnt/c/path/to/my/script' instead of 'C:\path\to\my\script'.

TODO: we should make lit detect if system bash is used instead of msys and set
appropriate path format.

llvm-svn: 311558

9f11c0bd

Revert r311552: [Bash-autocompletion] Add support for static analyzer flags · a93f087d
Rui Ueyama authored Aug 23, 2017
```
This reverts commit r311552 because it broke ubsan and asan bots.

llvm-svn: 311557
```
a93f087d

[coroutines] CoroBegin from inner coroutines should be considered for spills · 2f55b958

Gor Nishanov authored Aug 23, 2017

Summary:
If a coroutine outer calls another coroutine inner and the inner coroutine body is inlined into the outer, coro.begin from the inner coroutine should be considered for spilling if accessed across suspends.

Prior to this change, coroutine frame building code was not considering any coro.begins for spilling.
With this change, we only ignore coro.begin for the current coroutine, but, any coro.begins that were inlined into the current coroutine are eligible for spills.

Fixes PR34267

Reviewers: GorNishanov

Subscribers: qcolombet, llvm-commits, EricWF

Differential Revision: https://reviews.llvm.org/D37062

llvm-svn: 311556

2f55b958

[Reassociate] Don't canonicalize x + (-Constant * y) -> x - (Constant * y).. · 8db41e9d

Chad Rosier authored Aug 23, 2017

..if the resulting subtract will be broken up later.  This can cause us to get
into an infinite loop.

x + (-5.0 * y)      -> x - (5.0 * y)       ; Canonicalize neg const
x - (5.0 * y)       -> x + (0 - (5.0 * y)) ; Break up subtract
x + (0 - (5.0 * y)) -> x + (-5.0 * y)      ; Replace 0-X with X*-1.

PR34078

llvm-svn: 311554

8db41e9d

[Bash-autocompletion] Add support for static analyzer flags · 5e7071f5

Yuka Takahashi authored Aug 23, 2017

Summary:
This is a patch for clang autocomplete feature.

It will collect values which -analyzer-checker takes, which is defined in
clang/StaticAnalyzer/Checkers/Checkers.inc, dynamically.
First, from ValuesCode class in Options.td, TableGen will generate C++
code in Options.inc. Options.inc will be included in DriverOptions.cpp, and
calls OptTable's addValues function. addValues function will add second
argument to Option's Values class. Values contains string like "foo,bar,.."
which is handed to Values class
in OptTable.

Reviewers: v.g.vassilev, teemperor, ruiu

Subscribers: hiraditya, cfe-commits

Differential Revision: https://reviews.llvm.org/D36782

llvm-svn: 311552

5e7071f5

[globalisel][tablegen] Add support for ImmLeaf without SDNodeXForm · c3885c45

Daniel Sanders authored Aug 23, 2017

Summary:
This patch adds support for predicates on imm nodes but only for ImmLeaf and not for PatLeaf or PatFrag and only where the value does not need to be transformed before being rendered into the instruction.

The limitation on PatLeaf/PatFrag/SDNodeXForm is due to differences in the necessary target-supplied C++ for GlobalISel.

Depends on D36085

Reviewers: ab, t.p.northover, qcolombet, rovka, aditya_nandakumar

Reviewed By: rovka

Subscribers: kristof.beyls, javed.absar, igorb, llvm-commits

Differential Revision: https://reviews.llvm.org/D36086

llvm-svn: 311546

c3885c45

[ARM] Check for assembler instructions in test. · 5b929600

Florian Hahn authored Aug 23, 2017

Currently this test causes test failures on some machines, due to isel not being registered. Update the test to run all passes and check emitted assembly instructions for now. 

llvm-svn: 311545

5b929600

[ARM] Add missing patterns for insert_subvector. · 214e13d9

Florian Hahn authored Aug 23, 2017

Summary: In some cases, shufflevector instruction can be transformed involving insert_subvector instructions. The ARM backend was missing some insert_subvector patterns, causing a failure during instruction selection. AArch64 has similar patterns.

Reviewers: t.p.northover, olista01, javed.absar, rengolin

Reviewed By: javed.absar

Subscribers: aemerson, kristof.beyls, llvm-commits

Differential Revision: https://reviews.llvm.org/D36796

llvm-svn: 311543

214e13d9

[globalisel][tablegen] Add tests for FeatureBitsets and ComplexPattern predicates. · 49980707
Daniel Sanders authored Aug 23, 2017
```
llvm-svn: 311542
```
49980707

[gold] Test we don't strip globals when producing relocatables. · 06d9eda1

Davide Italiano authored Aug 23, 2017

lld was broken in this regard (PR33097). The gold plugin gets this
right so, no changes needed, but better adding a test.

llvm-svn: 311541

06d9eda1

[InstCombine] Fold branches with irrelevant conditions to a constant. · c7888581

Davide Italiano authored Aug 23, 2017

InstCombine folds instructions with irrelevant conditions to undef.
This, as Nuno confirmed is a bug.
(see https://bugs.llvm.org/show_bug.cgi?id=33409#c1 )

Given the original motivation for the change is that of removing an
USE, we now fold to false instead (which reaches the same goal
without undesired side effects).

Fixes PR33409.

Differential Revision:  https://reviews.llvm.org/D36975

llvm-svn: 311540

c7888581

[PowerPC] better instruction selection for OR (XOR) with a 32-bit immediate · cc555bd0

Hiroshi Inoue authored Aug 23, 2017

- recommitting after fixing a test failure on MacOS

On PPC64, OR (XOR) with a 32-bit immediate can be done with only two instructions, i.e. ori + oris.
But the current LLVM generates three or four instructions for this purpose (and also it clobbers one GPR).

This patch makes PPC backend generate ori + oris (xori + xoris) for OR (XOR) with a 32-bit immediate.

e.g. (x | 0xFFFFFFFF) should be

	ori 3, 3, 65535
	oris 3, 3, 65535

but LLVM generates without this patch

	li 4, 0
	oris 4, 4, 65535
	ori 4, 4, 65535
	or 3, 3, 4

Differential Revision: https://reviews.llvm.org/D34757

llvm-svn: 311538

cc555bd0

[AArch64] Silence unused variable warning in opt mode after r311533 · 3d55cef4
Krasimir Georgiev authored Aug 23, 2017
```
llvm-svn: 311535
```
3d55cef4

[AArch64] ISel legalization debug messages. NFCI. · 24c98189

Sjoerd Meijer authored Aug 23, 2017

Debugging AArch64 instruction legalization and custom lowering is really an
unpleasant experience because it shows nodes that appear out of thin air.
In commit r311444, some debug messages have been added to SelectionDAG, the
target independent part, and this patch adds some AArch64 specific messages.

Differential Revision: https://reviews.llvm.org/D36964

llvm-svn: 311533

24c98189

[Lanai] Remove dead functions from LanaiRegisterInfo · d5d55942

Alex Bradbury authored Aug 23, 2017

getEHExceptionRegister and getEHHandlerRegister are unused and were removed 
from most backends in rL192099. This patch removes them from Lanai.

Differential Revision: https://reviews.llvm.org/D36829

llvm-svn: 311531

d5d55942

Revert rL311526: [PowerPC] better instruction selection for OR (XOR) with a 32-bit immediate · dbb285ca
Hiroshi Inoue authored Aug 23, 2017
```
This reverts commit rL311526 due to failures in some buildbot.

llvm-svn: 311530
```
dbb285ca
[InstCombine] Remove unused argument. NFC · a85f8622
Craig Topper authored Aug 23, 2017
```
llvm-svn: 311529
```
a85f8622
[InstCombine] Replace a simple matcher with a plain old dyn_cast. NFC · a94069fb
Craig Topper authored Aug 23, 2017
```
llvm-svn: 311528
```
a94069fb

[InstCombine] Remove an unnecessary dyn_cast to Instruction and a switch over... · 524c44f7

Craig Topper authored Aug 23, 2017

[InstCombine] Remove an unnecessary dyn_cast to Instruction and a switch over two opcodes. Just dyn_cast to the specific instruction classes individually. NFC

Change the helper methods to take the more specific class as well.

llvm-svn: 311527

524c44f7

[PowerPC] better instruction selection for OR (XOR) with a 32-bit immediate · c4449df1

Hiroshi Inoue authored Aug 23, 2017

On PPC64, OR (XOR) with a 32-bit immediate can be done with only two instructions, i.e. ori + oris.
But the current LLVM generates three or four instructions for this purpose (and also it clobbers one GPR).

This patch makes PPC backend generate ori + oris (xori + xoris) for OR (XOR) with a 32-bit immediate.

e.g. (x | 0xFFFFFFFF) should be

	ori 3, 3, 65535
	oris 3, 3, 65535

but LLVM generates without this patch

	li 4, 0
	oris 4, 4, 65535
	ori 4, 4, 65535
	or 3, 3, 4

Differential Revision: https://reviews.llvm.org/D34757

llvm-svn: 311526

c4449df1

[XRay][CodeGen] Use PIC-friendly code in XRay sleds; remove synthetic references in .text · 0884b732

Dean Michael Berris authored Aug 23, 2017

Summary:
This change achieves two things:

  - Redefine the Custom Event handling instrumentation points emitted by
    the compiler to not require dynamic relocation of references to the
    __xray_CustomEvent trampoline.

  - Remove the synthetic reference we emit at the end of a function that
    we used to keep auxiliary sections alive in favour of SHF_LINK_ORDER
    associated with the section where the function is defined.

To achieve the custom event handling change, we've had to introduce the
concept of sled versioning -- this will need to be supported by the
runtime to allow us to understand how to turn on/off the new version of
the custom event handling sleds. That change has to land first before we
change the way we write the sleds.

To remove the synthetic reference, we rely on a relatively new linker
feature that preserves the sections that are associated with each other.
This allows us to limit the effects on the .text section of ELF
binaries.

Because we're still using absolute references that are resolved at
runtime for the instrumentation map (and function index) maps, we mark
these sections write-able. In the future we can re-define the entries in
the map to use relative relocations instead that can be statically
determined by the linker. That change will be a bit more invasive so we
defer this for later.

Depends on D36816.

Reviewers: dblaikie, echristo, pcc

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D36615

llvm-svn: 311525

0884b732

bpf: add variants of -mcpu=# and support for additional jmp insns · dc1dbf6e

Yonghong Song authored Aug 23, 2017



-mcpu=# will support:
  . generic: the default insn set
  . v1: insn set version 1, the same as generic
  . v2: insn set version 2, version 1 + additional jmp insns
  . probe: the compiler will probe the underlying kernel to
           decide proper version of insn set.

We did not not use -mcpu=native since llc/llvm will interpret -mcpu=native
as the underlying hardware architecture regardless of -march value.

Currently, only x86_64 supports -mcpu=probe. Other architecture will
silently revert to "generic".

Also added -mcpu=help to print available cpu parameters.
llvm will print out the information only if there are at least one
cpu and at least one feature. Add an unused dummy feature to
enable the printout.

Examples for usage:
$ llc -march=bpf -mcpu=v1 -filetype=asm t.ll
$ llc -march=bpf -mcpu=v2 -filetype=asm t.ll
$ llc -march=bpf -mcpu=generic -filetype=asm t.ll
$ llc -march=bpf -mcpu=probe -filetype=asm t.ll
$ llc -march=bpf -mcpu=v3 -filetype=asm t.ll
'v3' is not a recognized processor for this target (ignoring processor)
...
$ llc -march=bpf -mcpu=help -filetype=asm t.ll
Available CPUs for this target:

  generic - Select the generic processor.
  probe   - Select the probe processor.
  v1      - Select the v1 processor.
  v2      - Select the v2 processor.

Available features for this target:

  dummy - unused feature.

Use +feature to enable a feature, or -feature to disable it.
For example, llc -mcpu=mycpu -mattr=+feature1,-feature2
...

Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>
Signed-off-by: Yonghong Song <yhs@fb.com>
Acked-by: Alexei Starovoitov <ast@kernel.org>
llvm-svn: 311522

dc1dbf6e

Fix tail-merge-after-mbp test · d6c0868d

Matthias Braun authored Aug 23, 2017

The output of this test changed after the fix in r311520 to have
-run-pass=block-placement behave like it does in a normal pipeline.
Adjust the test.

llvm-svn: 311521

d6c0868d

Add test case for r311511 · 8426d134

Matthias Braun authored Aug 23, 2017

This also changes the TailDuplicator to be configured explicitely
pre/post regalloc rather than relying on the isSSA() flag. This was
necessary to have `llc -run-pass` work reliably.

llvm-svn: 311520

8426d134

NFC: fix ToolDrivers syntax and typo errors · cc82cdff
Martell Malone authored Aug 23, 2017
```
infoTable -> InfoTable camelCase
Libtool Options #define offset

llvm-svn: 311517
```
cc82cdff
Update LLVM fuzzers to use the libFuzzer bundled with the compiler toolchain · 0ac90d3f
George Karpenkov authored Aug 23, 2017
```
Differential Revision: https://reviews.llvm.org/D37041

llvm-svn: 311515
```
0ac90d3f

Remove llvm-pdbutil/fuzzer. · 218ea7f6

George Karpenkov authored Aug 23, 2017

The code does not compile, is not maintained, and does not have a buildbot.

Differential Revision: https://reviews.llvm.org/D37032

llvm-svn: 311512

218ea7f6

TargetInstrInfo: Change duplicate() to work on bundles. · 55bc9b3f

Matthias Braun authored Aug 22, 2017

Adds infrastructure to clone whole instruction bundles rather than just
single instructions. This fixes a bug where tail duplication would
unbundle instructions while cloning.

This should unbreak the "Clang Stage 1: cmake, RA, with expensive checks
enabled" build on greendragon. The bot broke with r311139 hitting this
pre-existing bug.

A proper testcase will come next.

llvm-svn: 311511

55bc9b3f

[SelectionDAG] Make ISD::isConstantSplatVector always return an element sized APInt. · 35189d52

Craig Topper authored Aug 22, 2017

This partially reverts r311429 in favor of making ISD::isConstantSplatVector do something not confusing. Turns out the only other user of it was also having to deal with the weird property of it returning a smaller size.

So rather than continue to deal with this quirk everywhere, just make the interface do something sane.

Differential Revision: https://reviews.llvm.org/D37039

llvm-svn: 311510

35189d52

[InstCombine] Remove check for sext of vector icmp from shouldOptimizeCast · ec4b8257

Craig Topper authored Aug 22, 2017

Looks like for 'and' and 'or' we end up performing at least some of the transformations this is bocking in a round about way anyway.

For 'and sext(cmp1), sext(cmp2) we end up later turning it into 'select cmp1, sext(cmp2), 0'. Then we optimize that back to sext (and cmp1, cmp2). This is the same result we would have gotten if shouldOptimizeCast hadn't blocked it. We do something analogous for 'or'.

With this patch we allow that transformation to happen directly in foldCastedBitwiseLogic. And we now support the same thing for 'xor'. This is definitely opening up many other cases, but since we already went around it for some cases hopefully it's ok.

Differential Revision: https://reviews.llvm.org/D36213

llvm-svn: 311508

ec4b8257

Aug 22, 2017

Revert "[llvm-dwarfdump] Print type names in DW_AT_type DIEs" · 4942a0b0
Jonas Devlieghere authored Aug 22, 2017
```
This reverts commit r311492.

llvm-svn: 311499
```
4942a0b0

[llvm-dwarfdump] Print type names in DW_AT_type DIEs · f456d186

Jonas Devlieghere authored Aug 22, 2017

This patch adds printing for DW_AT_type DIEs like it's currently already
the case for DW_AT_specification DIEs.

llvm-svn: 311492

f456d186

WholeProgramDevirt: Create bitcast to i8* at each virtual call site. · 001052a0

Peter Collingbourne authored Aug 22, 2017

We can't reuse the llvm.assume instruction's bitcast because it may not
dominate every user of the vtable pointer.

Differential Revision: https://reviews.llvm.org/D36994

llvm-svn: 311491

001052a0