Commits · d8d940a872e13f962c33ad9516d676be98202425 · Roger Ferrer / llvm-epi

May 17, 2018

[llvm-mca] Add an example showing how to get Intel assembly syntax · d8d940a8
Andrea Di Biagio authored May 17, 2018
```
Patch by Jeff Muizelaar.

llvm-svn: 332627
```
d8d940a8

[X86] Split WriteCMOV + WriteCMOV2 scheduler classes · 2782a19f

Simon Pilgrim authored May 17, 2018

Handle SNB+ targets which treat CMOVA/CMOVBE specially due to partial EFLAGS handling.

llvm-svn: 332626

2782a19f

AMDGPU/SI: Handle infinite loop for the structurizer to work with CFG with infinite loops. · 391bcf88

Changpeng Fang authored May 17, 2018

Summary:
The current StructurizeCFG pass only works for CFG with one exit. AMDGPUUnifyDivergentExitNodes combines multiple "return" blocks and/or "unreachable" blocks
to one exit block for the Structurizer to work. However, infinite loop is another kind of special "exit", and if we don't handle it, the case of multiple exits will prevent the structurizer from working.

In this work, for each infinite loop, we add a dummy edge to the "return" block, and thus the AMDGPUUnifyDivergentExitNodes pass will work with infinite loops.
This will make CFG with infinite loops be structurized.

Reviewer:
nhaehnle

Differential Revision:
https://reviews.llvm.org/D46340

llvm-svn: 332625

391bcf88

[mips] Add support for Global INValidate ASE · daf51693

Petar Jovanovic authored May 17, 2018

This includes

  Instructions: ginvi, ginvt,

  Assembler directives: .set ginv, .set noginv, .module ginv, .module noginv

  Attribute: ginv

  .MIPS.abiflags: GINV (0x20000)

Patch by Vladimir Stefanovic.

Differential Revision: https://reviews.llvm.org/D46268

llvm-svn: 332624

daf51693

[InstCombine] Propagate the nsw/nuw flags from the add in the 'shifty' abs... · bd332588

Craig Topper authored May 17, 2018

[InstCombine] Propagate the nsw/nuw flags from the add in the 'shifty' abs pattern to the sub in the select version.

According to alive this is valid. I'm hoping to use this to make an assumption that the sign bit is zero after this sequence. The only way it wouldn't be is if the input was INT__MIN, but by preserving the flags we can make doing this to INT_MIN UB.

The nuw flags is weird because it creates such a contradiction that the original number would have to be positive meaning we could remove the select entirely, but we don't get that far.

Differential Revision: https://reviews.llvm.org/D46988

llvm-svn: 332623

bd332588

[llvm-mca][X86] Add CMOV test files · e389ea0e
Simon Pilgrim authored May 17, 2018
```
llvm-svn: 332622
```
e389ea0e

[RISCV] Set isReMaterializable on ADDI and LUI instructions · 6a53023b

Alex Bradbury authored May 17, 2018

The isReMaterlizable flag is somewhat confusing, unlike most other instruction
flags it is currently interpreted as a hint (mightBeRematerializable would be
a better name). While LUI is always rematerialisable, for an instruction like
ADDI it depends on its operands. TargetInstrInfo::isTriviallyReMaterializable
will call TargetInstrInfo::isReallyTriviallyReMaterializable, which in turn
calls TargetInstrInfo::isReallyTriviallyReMaterializableGeneric. We rely on
the logic in the latter to pick out instances of ADDI that really are
rematerializable.

The isReMaterializable flag does make a difference on a variety of test
programs. The recently committed remat.ll test case demonstrates how stack
usage is reduce and a unnecessary lw/sw can be removed. Stack usage in the
Proc0 function in dhrystone reduces from 192 bytes to 112 bytes.

For the sake of completeness, this patch also implements
RISCVRegisterInfo::isConstantPhysReg. Although this is called from a number of
places, it doesn't seem to result in different codegen for any programs I've
thrown at it. However, it is called in the rematerialisation codepath and it
seems sensible to implement something correct here.

Differential Revision: https://reviews.llvm.org/D46182

llvm-svn: 332617

6a53023b

[X86][BtVer2] ADC/SBB take 2cy on an ALU pipe, not 1cy like ADD/SUB · b5741f5c
Simon Pilgrim authored May 17, 2018
```
llvm-svn: 332616
```
b5741f5c
[llvm-mca] Hide unrelated flags from the -help output. · 55e9e0fe
Andrea Di Biagio authored May 17, 2018
```
llvm-svn: 332615
```
55e9e0fe
[llvm-exegesis] Remove redudant explicit template instantiations. · a1bee623
Clement Courbet authored May 17, 2018
```
llvm-svn: 332611
```
a1bee623

In thin and full LTO + CFI, direct function calls may go through jump table · 3c6b4e35

Dmitry Mikulin authored May 17, 2018

entries to reach the target. Since these calls don't require type checks,
we can short-circuit them to their real targets.

Differential Revision: https://reviews.llvm.org/D46326

llvm-svn: 332610

3c6b4e35

[llvm-exegesis] Write out inconsistencies to a file. · cf210746

Clement Courbet authored May 17, 2018

Reviewers: gchatelet

Subscribers: tschuett, llvm-commits

Differential Revision: https://reviews.llvm.org/D47013

llvm-svn: 332608

cf210746

[Hexagon] Use addAliasForDirective for data directives · 5e41fc83

Alex Bradbury authored May 17, 2018

Data directives such as .word, .half, .hword are currently parsed using 
HexagonAsmParser::ParseDirectiveValue which effectively duplicates logic from 
AsmParser::parseDirectiveValue. This patch deletes that duplicated logic in 
favour of using addAliasForDirective.

Differential Revision: https://reviews.llvm.org/D46999

llvm-svn: 332607

5e41fc83

[X86] Split WriteADC/WriteADCRMW scheduler classes · 0c0336e0
Simon Pilgrim authored May 17, 2018
```
For integer ALU instructions taking eflags as an input (ADC/SBB/ADCX/ADOX)

llvm-svn: 332605
```
0c0336e0
[llvm-exegesis] Disable failing ARM assembler tests. · 2abea6f2
Clement Courbet authored May 17, 2018
```
llvm-svn: 332604
```
2abea6f2

[llvm-mca] add flag -all-views and flag -all-stats. · 650b5fc6

Andrea Di Biagio authored May 17, 2018

Flag -all-views enables all the views.
Flag -all-stats enables all the views that print hardware statistics.

llvm-svn: 332602

650b5fc6

[llvm-exegesis] Analysis: detect clustering inconsistencies. · 448550d9

Clement Courbet authored May 17, 2018

Summary:
Warn on instructions that should have the same performance
characteristics according to the sched model but actually
differ in their benchmarks.

Next step: Make the display nicer to browse, I was thinking maybe html.

Reviewers: gchatelet

Subscribers: tschuett, llvm-commits

Differential Revision: https://reviews.llvm.org/D46945

llvm-svn: 332601

448550d9

[llvm-exegesis] Disable the tests failing on buildbots while we investigate. · 3d5e08de
Clement Courbet authored May 17, 2018
```
llvm-svn: 332600
```
3d5e08de

[SystemZ] Commenting (NFC) · caafed55

Jonas Paulsson authored May 17, 2018

Some minor commenting in scheduler files.

Review: Ulrich Weigand
llvm-svn: 332599

caafed55

[llvm-exegesis][NFC] Remove dead function. · 3bbdea4a
Clement Courbet authored May 17, 2018
```
llvm-svn: 332597
```
3bbdea4a
[llvm-mca][X86] Add ADX test files · b4fd145f
Simon Pilgrim authored May 17, 2018
```
llvm-svn: 332595
```
b4fd145f
Fix r332592 : X86 tests should use the X86 target, not the native targets. · 0994ec2f
Clement Courbet authored May 17, 2018
```
llvm-svn: 332594
```
0994ec2f

reland r332579: [llvm-exegesis] Update to cover latency through another opcode. · 0e69e2d7

Clement Courbet authored May 17, 2018

Restructuring the code to measure latency and uops.
The end goal is to have this program spawn another process to deal with SIGILL and other malformed programs. It is not yet the case in this redesign, it is still the main program that runs the code (and may crash).
It now uses BitVector instead of Graph for performance reasons.

https://reviews.llvm.org/D46821

(with fixed ARM tests)

Authored by Guillaume Chatelet

llvm-svn: 332592

0e69e2d7

[X86][SNB] Minor scheduler cleanup · ceb4933d
Simon Pilgrim authored May 17, 2018
```
Merge 2 instregex and explain the VMOVDQArr/MOVDQArr difference

llvm-svn: 332591
```
ceb4933d

[AArch64][SVE] Asm: Support for structured ST2, ST3 and ST4 (scalar+scalar) store instructions. · 75cfa341

Sander de Smalen authored May 17, 2018

Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar

Reviewed By: SjoerdMeijer

Differential Revision: https://reviews.llvm.org/D46680

llvm-svn: 332584

75cfa341

Require DominatorTree when requiring/preserving LoopInfo in the old pass manager · 2ca16899

Mikael Holmen authored May 17, 2018

Summary:
Require DominatorTree when requiring/preserving LoopInfo in the old pass manager

BreakCriticalEdges tries to keep LoopInfo and DominatorTree updated if they
exist. However, since commit r321653 and r321805, to update LoopInfo we
must have a DominatorTree, or we will hit an assert.

To fix this we now make a couple of passes that only required/preserved
LoopInfo also require DominatorTree.

This solves PR37334.

Reviewers: eli.friedman, efriedma

Reviewed By: efriedma

Subscribers: efriedma, llvm-commits

Differential Revision: https://reviews.llvm.org/D46829

llvm-svn: 332583

2ca16899

[Analysis] Only use _unlocked stdio functions on linux · c1078872

Martin Storsjö authored May 17, 2018

The existing comment said that the functions were available only
on GNU/Linux (and on certain Android versions), but only checked
T.isGNUEnvironment() which also is true on MinGW (for arch-windows-gnu
triplets), which doesn't have such functions.

Existing checks in the initialize function in TargetLibraryInfo.cpp
also use only T.isOSLinux() to check for glibc features.

This fixes use of stdio on MinGW.

Differential Revision: https://reviews.llvm.org/D47002

llvm-svn: 332581

c1078872

Revert r332579 "[llvm-exegesis] Update to cover latency through another opcode." · 295a554c
Clement Courbet authored May 17, 2018
```
The revision failed to update the ARM tests.

llvm-svn: 332580
```
295a554c

[llvm-exegesis] Update to cover latency through another opcode. · ee110fb7

Clement Courbet authored May 17, 2018

Restructuring the code to measure latency and uops.
The end goal is to have this program spawn another process to deal with SIGILL and other malformed programs. It is not yet the case in this redesign, it is still the main program that runs the code (and may crash).
It now uses BitVector instead of Graph for performance reasons.

https://reviews.llvm.org/D46821

Authored by Guillaume Chatelet

llvm-svn: 332579

ee110fb7

[SROA] Handle PHI with multiple duplicate predecessors · 81a76a38

Bjorn Pettersson authored May 17, 2018

Summary:
The verifier accepts PHI nodes with multiple entries for the
same basic block, as long as the value is the same.

As seen in PR37203, SROA did not handle such PHI nodes properly
when speculating loads over the PHI, since it inserted multiple
loads in the predecessor block and changed the PHI into having
multiple entries for the same basic block, but with different
values.

This patch teaches SROA to reuse the same speculated load for
each PHI duplicate entry in such situations.

Resolves: https://bugs.llvm.org/show_bug.cgi?id=37203

Reviewers: uabelho, chandlerc, hfinkel, bkramer, efriedma

Reviewed By: efriedma

Subscribers: dberlin, efriedma, llvm-commits

Differential Revision: https://reviews.llvm.org/D46426

llvm-svn: 332577

81a76a38

[SROA] pr37267: fix assertion failure in integer widening · f5c0e6c2

Hiroshi Inoue authored May 17, 2018

The current integer widening does not support rewriting partial split slices in rewriteIntegerStore (and rewriteIntegerLoad).
This patch adds explicit checks for this case in isIntegerWideningViableForSlice.
Before r322533, splitting is allowed only for the whole-alloca slice and hence the above case is implicitly rejected by another check `if (DL.getTypeStoreSize(ValueTy) > Size)` because whole-alloca slice is larger than the partition.

Differential Revision: https://reviews.llvm.org/D46750

llvm-svn: 332575

f5c0e6c2

[RISCV] Add support for .half, .hword, .word, .dword directives · cea6db04

Alex Bradbury authored May 17, 2018

These directives are recognised by gas. Support is added through the use of 
addAliasForDirective.

Also match RISC-V gcc in preferring .half and .word for 16-bit and 32-bit data 
directives.

llvm-svn: 332574

cea6db04

[X86] Add OptForSize to a couple load folding patterns. Remove some bad FIXME comments. · a2c52647

Craig Topper authored May 17, 2018

The FIXME comments were about preventing load folding to avoid a partial xmm update. But these instructions use GPR as input when the load isn't folded. This won't help prevent a partial xmm update.

llvm-svn: 332573

a2c52647

[CMake] Support building shared library for Fuchsia · dfbb9416

Petr Hosek authored May 17, 2018

Fuchsia uses ELF as a file format and LLD as the linker so we can
use the same implementation as other ELF based platforms.

Differential Revision: https://reviews.llvm.org/D46991

llvm-svn: 332570

dfbb9416

[Thumb2] fix typo in test from r332548 · 2e50cec5
Sanjay Patel authored May 17, 2018
```
llvm-svn: 332569
```
2e50cec5
Mark test with "REQUIRES: shell" since it directly invokes "sh" and was failing on Windows. · 2dd62a3d
Douglas Yung authored May 17, 2018
```
llvm-svn: 332563
```
2dd62a3d
[AMDGPU] Move lsr test. NFC. · 595fdcf4
Stanislav Mekhanoshin authored May 17, 2018
```
llvm-svn: 332562
```
595fdcf4
[WebAssembly] Fix the opcode number for i64.load16_u. · aef67410
Dan Gohman authored May 17, 2018
```
Fixes PR37488.

llvm-svn: 332561
```
aef67410

[CodeGen] Use MachineInstr::getOperand(0) instead of gets the defs... · 342273a1

Craig Topper authored May 16, 2018

[CodeGen] Use MachineInstr::getOperand(0) instead of gets the defs iterator_range and calling begin. NFC

Defs are well defined to come first in MachineInstr operand list. No need for a more complex indirection.

llvm-svn: 332559

342273a1

Revert 332508 as it caused problems in the clang test suite. · f81f3a83
Greg Clayton authored May 16, 2018
```
llvm-svn: 332555
```
f81f3a83