Commits · 799854904046964b4a037fbdc06df98b24a96132 · Roger Ferrer / llvm-epi

Jan 03, 2018

Remove left-over debug printout from r321692 · 79985490

Hans Wennborg authored Jan 03, 2018

Besides the unsightly print-out, it was causing some buildbots to fail,
e.g. http://lab.llvm.org:8011/builders/clang-x86-windows-msvc2015/builds/9311

llvm-svn: 321711

79985490

[InstSimplify] Missed optimization in math expression: squashing exp(log), log(exp) · 3d8cd34a

Dmitry Venikov authored Jan 03, 2018

Summary: This patch enables folding following expressions under -ffast-math flag: exp(log(x)) -> x, exp2(log2(x)) -> x, log(exp(x)) -> x, log2(exp2(x)) -> x

Reviewers: spatel, hfinkel, davide

Reviewed By: spatel, hfinkel, davide

Subscribers: scanon, llvm-commits

Differential Revision: https://reviews.llvm.org/D41381

llvm-svn: 321710

3d8cd34a

[ARM][NFC] Avoid recreating MCSubtargetInfo in ARMAsmBackend · 46db78b7

Alex Bradbury authored Jan 03, 2018

After D41349, we can now directly access MCSubtargetInfo from 
createARM*AsmBackend. This patch makes use of this, avoiding the need to 
create a fresh MCSubtargetInfo (which was previously always done with a blank 
CPU and feature string). Given the total size of the change remains pretty 
tiny and we're removing the old explicit destructor, I changed the STI field 
to a reference rather than a pointer.

Differential Revision: https://reviews.llvm.org/D41693

llvm-svn: 321707

46db78b7

[InstCombine] Add test to remove VarArg casts (NFC) · dcc0ba9b
Florian Hahn authored Jan 03, 2018
```
llvm-svn: 321706
```
dcc0ba9b

[TableGen] Add support of Intrinsics with multiple returns · 8b4bdfdb

Hal Finkel authored Jan 03, 2018

This change deals with intrinsics with multiple outputs, for example load
instrinsic with address updated.

DAG selection for Instrinsics could be done either through source code or
tablegen. Handling all intrinsics in source code would introduce a huge chunk
of repetitive code if we have a large number of intrinsic that return multiple
values (see NVPTX as an example). While intrinsic class in tablegen supports
multiple outputs, tablegen only supports Intrinsics with zero or one output on
TreePattern. This appears to be a simple bug in tablegen that is fixed by this
change.

For Intrinsics defined as:

def int_xxx_load_addr_updated: Intrinsic<[llvm_i32_ty, llvm_ptr_ty], [llvm_ptr_ty, llvm_i32_ty], []>;

Instruction will be defined as:

def L32_X: Inst<(outs reg:$d1, reg:$d2), (ins reg:$s1, reg:$s2), "ld32_x $d1, $d2, $s2", [(set i32:$d1, i32:$d2, (int_xxx_load_addr_updated i32:$s1, i32:$s2))]>;

Patch by Wenbo Sun, thanks!

Differential Revision: https://reviews.llvm.org/D32888

llvm-svn: 321704

8b4bdfdb

[AArch64][SVE] Asm: Add restricted register classes for SVE predicate vectors. · dc5e081b

Sander de Smalen authored Jan 03, 2018

Summary:
Add a register class for SVE predicate operands that can only be p0-p7 (as opposed to p0-p15)

Patch [1/3] in a series to add predicated ADD/SUB instructions for SVE.

Reviewers: rengolin, mcrosier, evandro, fhahn, echristo, olista01, SjoerdMeijer, javed.absar

Reviewed By: fhahn

Subscribers: aemerson, javed.absar, tschuett, kristof.beyls, llvm-commits

Differential Revision: https://reviews.llvm.org/D41441

llvm-svn: 321699

dc5e081b

Fix build of WebAssembly and AVR backends after r321692 · 7c093bf1

Alex Bradbury authored Jan 03, 2018

As experimental backends, I didn't have them configured to build in my local 
build config.

llvm-svn: 321696

7c093bf1

Fix incorrect documentation comment left after r321692 · 86b99cb5

Alex Bradbury authored Jan 03, 2018

TargetRegistryInfo::createMCAsmBackend no longer takes a TheTriple parameter.
The majory of the TargetRegistryInfo::create* functions have no or very
limitied per-parameter doc comments, and adding a comment for the
MCSubtargetInfo, MCRegisterInfo and MCTargetOptions parameters seems like it
would add no real value beyond reading the function signature. As such, I've
just deleted the doc comment for TheTriple.

llvm-svn: 321694

86b99cb5

Thread MCSubtargetInfo through Target::createMCAsmBackend · b22f751f

Alex Bradbury authored Jan 03, 2018

Currently it's not possible to access MCSubtargetInfo from a TgtMCAsmBackend. 
D20830 threaded an MCSubtargetInfo reference through 
MCAsmBackend::relaxInstruction, but this isn't the only function that would 
benefit from access. This patch removes the Triple and CPUString arguments 
from createMCAsmBackend and replaces them with MCSubtargetInfo.

This patch just changes the interface without making any intentional 
functional changes. Once in, several cleanups are possible:
* Get rid of the awkward MCSubtargetInfo handling in ARMAsmBackend
* Support 16-bit instructions when valid in MipsAsmBackend::writeNopData
* Get rid of the CPU string parsing in X86AsmBackend and just use a SubtargetFeature for HasNopl
* Emit 16-bit nops in RISCVAsmBackend::writeNopData if the compressed instruction set extension is enabled (see D41221)

This change initially exposed PR35686, which has since been resolved in r321026.

Differential Revision: https://reviews.llvm.org/D41349

llvm-svn: 321692

b22f751f

[GlobalISel][Legalizer] Fix legalization of llvm.smul.with.overflow · 9de62130

Amara Emerson authored Jan 03, 2018

Previously the code for handling G_SMULO didn't properly check for the signed
multiply overflow, instead treating it the same as the unsigned G_UMULO.

Fixes PR35800.

llvm-svn: 321690

9de62130

[llvm-objcopy] Add support for visibility · 30d927a1

Jake Ehrlich authored Jan 02, 2018

I have no clue how this was missed when symbol table support was added. This
change ensures that the visibility of symbols is preserved by default.

llvm-svn: 321681

30d927a1

Jan 02, 2018

Handle the case of live 16-bit subregisters in X86FixupBWInsts · e12e08c6

Andrew Kaylor authored Jan 02, 2018

Differential Revision: https://reviews.llvm.org/D40524

Change-Id: Ie3a405b28503ceae999f5f3ba07a68fa733a2400
llvm-svn: 321674

e12e08c6

[AArch64] fix typos in comments; NFC · 24e6a8bd
Sanjay Patel authored Jan 02, 2018
```
llvm-svn: 321673
```
24e6a8bd

[ValueTracking] recognize min/max of min/max patterns · 78114305

Sanjay Patel authored Jan 02, 2018

This is part of solving PR35717:
https://bugs.llvm.org/show_bug.cgi?id=35717

The larger IR optimization is proposed in D41603, but we can show 
the improvement in ValueTracking using codegen tests because 
SelectionDAG creates min/max nodes based on ValueTracking. 

Any target with min/max ops should show wins here. I chose AArch64
vector ops because they're clean and uniform.

Some Alive proofs for the tests (can't put more than 2 tests in 1 
page currently because the web app says it's too long):
https://rise4fun.com/Alive/WRN
https://rise4fun.com/Alive/iPm
https://rise4fun.com/Alive/HmY
https://rise4fun.com/Alive/CNm
https://rise4fun.com/Alive/LYf

llvm-svn: 321672

78114305

[AArch64] add tests for min/max of min/max (PR35717); NFC · 35a6ee86
Sanjay Patel authored Jan 02, 2018
```
llvm-svn: 321668
```
35a6ee86

[AArch64][GlobalISel] Fix assert fail with unknown intrinsic. · 913918cb

Amara Emerson authored Jan 02, 2018

A call may have an intrinsic name but not have a valid intrinsic ID,
for example with llvm.invariant.group.barrier. If so, treat it as a
normal call like FastISel does.

llvm-svn: 321662

913918cb

[opt-viewer] Check for pygments.lexer.c_cpp · 9e4431ff

Jonas Hahnfeld authored Jan 02, 2018

Some systems still don't have this module which was introduced in
version 2.0 (CentOS 7, sigh).

Differential Revision: https://reviews.llvm.org/D41611

llvm-svn: 321659

9e4431ff

[x86] allow pairs of PCMPEQ for vector-sized integer equality comparisons (PR33325) · 9a80871f

Sanjay Patel authored Jan 02, 2018

This is an extension of D31156 with the goal that we'll allow memcmp() == 0 expansion 
for x86 to use 2 pairs of loads per block.

The memcmp expansion pass (formerly part of CGP) will generate this kind of pattern 
with oversized integer compares, so we want to transform these into x86-specific vector
nodes before legalization splits things into scalar chunks.

See PR33325 for more details:
https://bugs.llvm.org/show_bug.cgi?id=33325

Differential Revision: https://reviews.llvm.org/D41618

llvm-svn: 321656

9a80871f

[AArch64][GlobalISel] Enable GlobalISel at -O0 by default · 854d10d1

Amara Emerson authored Jan 02, 2018

Tests updated to explicitly use fast-isel at -O0 instead of implicitly.

This change also allows an explicit -fast-isel option to override an
implicitly enabled global-isel. Otherwise -fast-isel would have no effect at -O0.

Differential Revision: https://reviews.llvm.org/D41362

llvm-svn: 321655

854d10d1

[BasicBlockUtils] Check for unreachable preds before updating LI in UpdateAnalysisInformation · bdb94309

Anna Thomas authored Jan 02, 2018

Summary:
We are incorrectly updating the LI when loop-simplify generates
dedicated exit blocks for a loop. The issue is that there's an implicit
assumption that the Preds passed into UpdateAnalysisInformation are
reachable. However, this is not true and breaks LI by incorrectly
updating the header of a loop.

One such case is when we generate dedicated exits when the exit block is
a landing pad (through SplitLandingPadPredecessors). There maybe other
cases as well, since we do not guarantee that Preds passed in are
reachable basic blocks.

The added test case shows how loop-simplify breaks LI for the outer loop (and DT in turn)
after we try to generate the LoopSimplifyForm.

Reviewers: davide, chandlerc, sanjoy

Reviewed By: davide

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D41519

llvm-svn: 321653

bdb94309

[Hexagon] Fix generation of vector sign extensions · cfe4a361
Krzysztof Parzyszek authored Jan 02, 2018
```
llvm-svn: 321650
```
cfe4a361

Revert r321089: "[DAG] Elide overlapping store" (and subsequent fix in r321204) · cc4903e2

Daniel Jasper authored Jan 02, 2018

Our internal testing has revealed has discovered bugs in PPC builds.
I have forward reproduction instructions to the original author (Nirav).

llvm-svn: 321649

cc4903e2

NFC. Add description comments to Function header · 527784b3

Dmitry Venikov authored Jan 02, 2018

Reviewers: ruiu, davidxl, silvas, brzycki

Reviewed By: brzycki

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D41609

llvm-svn: 321648

527784b3

[AArch64][AsmParser] Add isScalarReg() and repurpose isReg() · c9b3e1cf

Sander de Smalen authored Jan 02, 2018

Summary:
isReg() in AArch64AsmParser.cpp is a bit of a misnomer, and would be better named 'isScalarReg()' instead.

Patch [1/3] in a series to add operand constraint checks for SVE's predicated ADD/SUB.

Reviewers: rengolin, mcrosier, evandro, fhahn, echristo

Reviewed By: fhahn

Subscribers: aemerson, javed.absar, llvm-commits, kristof.beyls

Differential Revision: https://reviews.llvm.org/D41445

llvm-svn: 321646

c9b3e1cf

Strip trailing whitespace. NFCI · 39f50e10
Simon Pilgrim authored Jan 02, 2018
```
llvm-svn: 321644
```
39f50e10
[RISCV] Add Defs Uses information for c.jal and c.addi4spn · 8cb894b3
Alex Bradbury authored Jan 02, 2018
```
Differential Revision: https://reviews.llvm.org/D41339
Patch by Shiva Chen.

llvm-svn: 321643
```
8cb894b3
[RISCV][NFC] Resolve unused variable warning in RISCVISelLowering · 3633d120
Alex Bradbury authored Jan 02, 2018
```
XLenVT in LowerFormalArguments is used only in an assert.

llvm-svn: 321642
```
3633d120

[DAGCombine] Fix for PR35765 · 3570c554

Sam Parker authored Jan 02, 2018

Remove the acceptance of ANY_EXTEND nodes while trying to move and
nodes back to loads.

Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=35765

Differential Revision: https://reviews.llvm.org/D41625

llvm-svn: 321641

3570c554

[X86] Codegen test for pr35765 · 2dea5e0f
Sam Parker authored Jan 02, 2018
```
Committing reproducer test for pr35765, fix to follow.

llvm-svn: 321640
```
2dea5e0f
[SelectionDAG] Teach WidenVecOp_Convert to widen the operation if a widened... · e3b6bd33
Craig Topper authored Jan 02, 2018
```
[SelectionDAG] Teach WidenVecOp_Convert to widen the operation if a widened result type would still be legal.

llvm-svn: 321638
```
e3b6bd33

[InstCombine] Missed optimization in math expression: squashing sqrt functions · a58d8deb

Dmitry Venikov authored Jan 02, 2018

Summary: This patch enables folding under -ffast-math flag sqrt(a) * sqrt(b) -> sqrt(a*b)

Reviewers: hfinkel, spatel, davide

Reviewed By: spatel, davide

Subscribers: davide, llvm-commits

Differential Revision: https://reviews.llvm.org/D41322

llvm-svn: 321637

a58d8deb

Test commit · d2257be8

Dmitry Venikov authored Jan 02, 2018

Reviewers: Quolyk

Reviewed By: Quolyk

Differential Revision: https://reviews.llvm.org/D41561

llvm-svn: 321636

d2257be8

[SelectionDAG] Remove ifs on getTypeAction being TypeWidenVector from some of... · 3e528c75

Craig Topper authored Jan 02, 2018

[SelectionDAG] Remove ifs on getTypeAction being TypeWidenVector from some of the WideVecOp handlers.

We should only be in the handler if the tyep action is TypeWidenVector. There's no reason to try to do anything else.

llvm-svn: 321635

3e528c75

Jan 01, 2018

[ValueTracking] Don't assume shift values are in range · 6720726d
Simon Pilgrim authored Jan 01, 2018
```
Reduced (as best I could...) from oss-fuzz #4857 test case

llvm-svn: 321634
```
6720726d
[InstCombine] Regenerate udiv tests. · af35f5ec
Simon Pilgrim authored Jan 01, 2018
```
llvm-svn: 321633
```
af35f5ec
[X86] Promote vXi1 fp_to_uint/fp_to_sint to vXi32 to avoid scalarization. · c8898b36
Craig Topper authored Jan 01, 2018
```
llvm-svn: 321632
```
c8898b36
[X86] Add test cases for vXi1 fptosi/fptoui. · bb8b79b0
Craig Topper authored Jan 01, 2018
```
Currently we do a lot of scalarization in these test cases.

llvm-svn: 321631
```
bb8b79b0
[X86] Replace custom lowering of vXi1 SINT_TO_FP/UINT_TO_FP with promotion. · e5943bb3
Craig Topper authored Jan 01, 2018
```
The custom lowering was just doing the same thing promotion would do.

llvm-svn: 321630
```
e5943bb3

[SelectionDAG][X86][AArch64] Require targets to specify the promotion type... · a4f99976

Craig Topper authored Jan 01, 2018

[SelectionDAG][X86][AArch64] Require targets to specify the promotion type when using setOperationAction Promote for INT_TO_FP and FP_TO_INT

Currently the promotion for these ignores the normal getTypeToPromoteTo and instead just tries to double the element width. This is because the default behavior of getTypeToPromote to just adds 1 to the SimpleVT, which has the affect of increasing the element count while keeping the scalar size the same.

If multiple steps are required to get to a legal operation type, int_to_fp will be promoted multiple times. And fp_to_int will keep trying wider types in a loop until it finds one that works.

getTypeToPromoteTo does have the ability to query a promotion map to get the type and not do the increasing behavior. It seems better to just let the target specify the promotion type in the map explicitly instead of letting the legalizer iterate via widening.

FWIW, it's worth I think for any other vector operations that need to be promoted, we have to specify the type explicitly because the default behavior of getTypeToPromote isn't useful for vectors. The other types of promotion already require either the element count is constant or the total vector width is constant, but neither happens by incrementing the SimpleVT enum.

Differential Revision: https://reviews.llvm.org/D40664

llvm-svn: 321629

a4f99976

[x86] add runs for more vector variants; NFC · 18962dab
Sanjay Patel authored Jan 01, 2018
```
Preliminary step to see what the effects of D41618 look like.

llvm-svn: 321624
```
18962dab