Commits · 63aca02f084f99c5d3c82703a0bb07b590919460 · Roger Ferrer / llvm-epi

Oct 26, 2018

add myself to the CREDITS.TXT · 63aca02f
Li Jia He authored Oct 26, 2018
```
llvm-svn: 345355
```
63aca02f

Teach the DominatorTree fallback to recalculation when applying updates to speedup JT (PR37929) · 32fd196c

Chijun Sima authored Oct 26, 2018

Summary:
This patch makes the dominatortree recalculate when applying updates with the size of the update vector larger than a threshold. Directly applying updates is usually slower than recalculating the whole domtree in this case. This patch fixes an issue which causes JT running slowly on some inputs.

In bug 37929, the dominator tree is trying to apply 19,000+ updates several times, which takes several minutes.

After this patch, the time used by DT.applyUpdates:

| Input | Before (s) | After (s) | Speedup |
| the 2nd Reproducer in 37929 | 297 | 0.15 | 1980x |
| clang-5.0.0.0.bc | 9.7 | 4.3 | 2.26x |
| clang-5.0.0.4.bc | 11.6 | 2.6 | 4.46x |

Reviewers: kuhar, brzycki, trentxintong, davide, dmgreen, grosser

Reviewed By: kuhar, brzycki

Subscribers: kristina, llvm-commits

Differential Revision: https://reviews.llvm.org/D53245

llvm-svn: 345353

32fd196c

[SystemZ] Implement SystemZOperand::print() · dda46307

Jonas Paulsson authored Oct 26, 2018

SystemZAsmParser can now handle -debug by printing the operands neatly to the
output stream. Before this patch this lead to an llvm_unreachable().

It seems that now '-mllvm -debug' does not cause any crashes anywhere (at
least not on SPEC).

Review: Ulrich Weigand
https://reviews.llvm.org/D53328

llvm-svn: 345349

dda46307

Dump public symbol records in pdb2yaml mode · ed2597e9
Zachary Turner authored Oct 26, 2018
```
llvm-svn: 345348
```
ed2597e9

[SystemZ] Pass the DAG pointer from SystemZAddressingMode::dump(). · e2c5cbc1

Jonas Paulsson authored Oct 26, 2018

In order to print the IR slot number for the memory operand, the DAG pointer
must be passed to SDNode::dump().

The isel-debug.ll test updated to also check for the IR Value reference being
printed correctly.

Review: Ulrich Weigand
https://reviews.llvm.org/D53333

llvm-svn: 345347

e2c5cbc1

Reland "[WebAssembly] LSDA info generation" · 24faf859

Heejin Ahn authored Oct 25, 2018

Summary:
This adds support for LSDA (exception table) generation for wasm EH.
Wasm EH mostly follows the structure of Itanium-style exception tables,
with one exception: a call site table entry in wasm EH corresponds to
not a call site but a landing pad.

In wasm EH, the VM is responsible for stack unwinding. After an
exception occurs and the stack is unwound, the control flow is
transferred to wasm 'catch' instruction by the VM, after which the
personality function is called from the compiler-generated code. (Refer
to WasmEHPrepare pass for more information on this part.)

This patch:
- Changes wasm.landingpad.index intrinsic to take a token argument, to
make this 1:1 match with a catchpad instruction
- Stores landingpad index info and catch type info MachineFunction in
before instruction selection
- Lowers wasm.lsda intrinsic to an MCSymbol pointing to the start of an
exception table
- Adds WasmException class with overridden methods for table generation
- Adds support for LSDA section in Wasm object writer

Reviewers: dschuff, sbc100, rnk

Subscribers: mgorny, jgravelle-google, sunfish, llvm-commits

Differential Revision: https://reviews.llvm.org/D52748

llvm-svn: 345345

24faf859

[WebAssembly] Support EH instructions in InstPrinter · 3103d3dc

Heejin Ahn authored Oct 25, 2018

Summary: This adds support for exception handling instructions to InstPrinter.

Reviewers: dschuff, aardappel

Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits

Differential Revision: https://reviews.llvm.org/D53634

llvm-svn: 345343

3103d3dc

Fix in MachineOperand::printIRValueReference(). · f213f81d

Jonas Paulsson authored Oct 25, 2018

Handle the case where getCurrentFunction() returns nullptr by passing -1 to
printIRSlotNumber(). This will result in <badref> being printed instead of an
assertion failure.

Review: Francis Visoiu Mistrih
https://reviews.llvm.org/D53333

llvm-svn: 345342

f213f81d

[AArch64] Implement FP16FML intrinsics · f0923f16

Bryan Chan authored Oct 25, 2018

Add LLVM intrinsics for the ARMv8.2-A FP16FML vector-form instructions. Add a
DAG pattern to define the indexed-form intrinsics in terms of the vector-form
ones, similarly to how the Dot Product intrinsics were implemented.

Based on a patch by Gao Yiling.

Differential Revision: https://reviews.llvm.org/D53632

llvm-svn: 345337

f0923f16

Delete test case. Assertions can't be tested. · 8370a95d
Heejin Ahn authored Oct 25, 2018
```
llvm-svn: 345336
```
8370a95d
Tidy up test case · cc719ba0
Heejin Ahn authored Oct 25, 2018
```
llvm-svn: 345335
```
cc719ba0

Address comments · 1d13e6be

Heejin Ahn authored Oct 25, 2018

- Add llvm-mc test case (and delete the old one)
- Change report_fatal_error to assertions

llvm-svn: 345334

1d13e6be

[WebAssembly] Error out when block/loop markers mismatch · 1147d914

Heejin Ahn authored Oct 25, 2018

Summary:
Currently InstPrinter ignores if there are mismatches between block/loop
and end markers by skipping the case if ControlFlowStack is empty. I
guess it is better to explicitly error out in this case, because this
signals invalid input.

Reviewers: aardappel

Subscribers: dschuff, sbc100, jgravelle-google, sunfish, llvm-commits

Differential Revision: https://reviews.llvm.org/D53620

llvm-svn: 345333

1147d914

[SystemZ] NFC reformatting in SystemZTargetTransformInfo.cpp · 2b280ea6
Jonas Paulsson authored Oct 25, 2018
```
Some lines more than 80 characters long reformatted.

llvm-svn: 345331
```
2b280ea6

[SystemZ] Improve getMemoryOpCost() to find foldable loads that are converted. · b7caa809

Jonas Paulsson authored Oct 25, 2018

The SystemZ backend can do arithmetic of memory by loading and then extending
one of the operands. Similarly, a load + truncate can be folded into an
operand.

This patch improves the SystemZ TTI cost function to recognize this.

Review: Ulrich Weigand
https://reviews.llvm.org/D52692

llvm-svn: 345327

b7caa809

DebugInfo: Explain why DW_LLE_(GNU_)startx_length is used · 73c2f197

David Blaikie authored Oct 25, 2018

This isn't the most object-size efficient encoding, but it's the only
one GDB supports for the pre-standard fission format. I've written fixes
for this twice now... - so perhaps this comment will help me remember
why neither of these have been committed and why I shouldn't try to
write a third fix another year from now...

llvm-svn: 345326

73c2f197

[x86] add tests for missed load folding; NFC · c14aafda
Sanjay Patel authored Oct 25, 2018
```
llvm-svn: 345325
```
c14aafda

Oct 25, 2018

[SystemZ] Improve handling and cost estimates of vector integer div/rem · 4645711a

Jonas Paulsson authored Oct 25, 2018

Enable the DAG optimization that converts vector div/rem with constants into
multiply+shifts sequences by expanding them early. This is needed since
ISD::SMUL_LOHI is 'Custom' lowered on SystemZ, and will therefore not be
available to BuildSDIV after legalization.

Better cost values for these instructions based on how they will be
implemented (a constant divisor is cheaper).

Review: Ulrich Weigand
https://reviews.llvm.org/D53196

llvm-svn: 345321

4645711a

llvm-dwarfdump: loclists: Don't expect an (albeit empty) expression for LLE_base_address · 2f9c42c9
David Blaikie authored Oct 25, 2018
```
llvm-svn: 345320
```
2f9c42c9

[Pipeliner] Ignore Artificial dependences while computing recurrences. · ada0f511

Sumanth Gundapaneni authored Oct 25, 2018

The artificial dependencies are not real dependencies. In some cases, they
form circuits with bigger MII. However, they are used to schedule instructions
better.

Differential Revision: https://reviews.llvm.org/D53450

llvm-svn: 345319

ada0f511

[Pipeliner] Remove the unneeded include header(NFC). · dfdbc716
Sumanth Gundapaneni authored Oct 25, 2018
```
Differential Revision: https://reviews.llvm.org/D53451

llvm-svn: 345318
```
dfdbc716

[X86] Change X86 backend to look for 'min-legal-vector-width' attribute... · 813064bf

Craig Topper authored Oct 25, 2018

[X86] Change X86 backend to look for 'min-legal-vector-width' attribute instead of 'required-vector-width' when determining whether 512-bit vectors should be legal.

The required-vector-width attribute was only used for backend testing and has never been generated by clang.

I believe clang is now generating min-legal-vector-width for vector uses in user code.

With this I believe passing -mprefer-vector-width=256 to clang should prevent use of zmm registers in the generated assembly unless the user used a 512-bit intrinsic in their source code.

llvm-svn: 345317

813064bf

[CodeGen] Remove operands from FENTRY_CALL · 5be9e6de

Francis Visoiu Mistrih authored Oct 25, 2018

FENTRY_CALL is actually not taking any input / output operands. The
machine verifier complains now because the target description says that:

* It needs 1 unknown output
* It needs 1 or more variable inputs

llvm-svn: 345316

5be9e6de

[AArch64] Create proper memoperand for multi-vector stores · 53e869da

David Greene authored Oct 25, 2018

Include all of the store's source vector operands when creating the
MachineMemOperand. Previously, we were missing the first operand,
making the store size seem smaller than it really is.

Differential Revision: https://reviews.llvm.org/D52816

llvm-svn: 345315

53e869da

[AArch64][GlobalISel] Simplify a legalizer test. NFC. · f28e81f6
Volkan Keles authored Oct 25, 2018
```
llvm-svn: 345307
```
f28e81f6

[WebAssembly] Use target-independent saturating add · 0aad98fd

Thomas Lively authored Oct 25, 2018

Reviewers: aheejin, dschuff

Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits

Differential Revision: https://reviews.llvm.org/D53721

llvm-svn: 345299

0aad98fd

[X86] Add some non-AVX512VL command lines to the *vl-vec-test-testn.ll tests. · 4a825e7b
Craig Topper authored Oct 25, 2018
```
This will expose some regressions in the WIP and/or/xor promotion removal patch.

llvm-svn: 345297
```
4a825e7b

[FPEnv] Last BinaryOperator::isFNeg(...) to m_FNeg(...) changes · 384a74b0

Cameron McInally authored Oct 25, 2018

Replacing BinaryOperator::isFNeg(...) to avoid regressions when we
separate FNeg from the FSub IR instruction.

Differential Revision: https://reviews.llvm.org/D53650

llvm-svn: 345295

384a74b0

[X86] Add KNL command lines to movmsk-cmp.ll. · ce0bc381

Craig Topper authored Oct 25, 2018

Some of this code looks pretty bad and we should probably still be using movmskb more with avx512f.

llvm-svn: 345293

ce0bc381

[GlobalISel] LegalizerHelper: Fix the incorrect alignment when splitting... · 60c6affc

Volkan Keles authored Oct 25, 2018

[GlobalISel] LegalizerHelper: Fix the incorrect alignment when splitting loads/stores in narrowScalar

Reviewers: dsanders, bogner, jpaquette, aemerson, ab, paquette

Reviewed By: dsanders

Subscribers: rovka, kristof.beyls, javed.absar, llvm-commits

Differential Revision: https://reviews.llvm.org/D53664

llvm-svn: 345292

60c6affc

[LegalizeDAG] Remove dead SINT_TO_FP legalization code · f02c0f8a

Simon Pilgrim authored Oct 25, 2018

As noticed on D52965, the SINT_TO_FP i64 to f32 legalization code has been dead for years - protected by an assert.

Differential Revision: https://reviews.llvm.org/D53703

llvm-svn: 345290

f02c0f8a

[GISel] LegalizerInfo: Rename MemDesc::Size to SizeInBits to make the value clearer · f87473fe
Volkan Keles authored Oct 25, 2018
```
Requested in D53679.

llvm-svn: 345288
```
f87473fe
[X86] Remove ProcIntelKNL and replace with a SlowPMADDWD flag to use in the... · c10de9a3
Craig Topper authored Oct 25, 2018
```
[X86] Remove ProcIntelKNL and replace with a SlowPMADDWD flag to use in the one place it was checked.

llvm-svn: 345286
```
c10de9a3

[X86] Remove some uarch tuning flags from KNL that look to have been inherited... · 5d787ac4

Craig Topper authored Oct 25, 2018

[X86] Remove some uarch tuning flags from KNL that look to have been inherited from SNB/IVB incorrectly

KNL is based on a modified Silvermont core so I don't think these features apply. I think the LEA flag is probably also wrong, but I'm less sure as I barely understand the 3 LEA flags we have currently.

Differential Revision: https://reviews.llvm.org/D53671

llvm-svn: 345285

5d787ac4

[AArch64][GlobalISel] Fix the LegalityPredicate for lowerIf for G_LOAD/G_STORE · 3a103b1d

Volkan Keles authored Oct 25, 2018

Summary:
Currently, Legalizer is trying to lower G_LOAD with a vector type
that has more than two elements due to the incorrect LegalityPredicate.

This patch fixes the issue by removing the multiplication by 8
as `MemDesc.Size` already contains the size in bits.

Reviewers: dsanders, aemerson

Reviewed By: dsanders

Subscribers: rovka, javed.absar, kristof.beyls, llvm-commits

Differential Revision: https://reviews.llvm.org/D53679

llvm-svn: 345282

3a103b1d

[llvm-mca] Introduce a new base class for mca::Instruction, and change how... · 1e6d0aad

Andrea Di Biagio authored Oct 25, 2018

[llvm-mca] Introduce a new base class for mca::Instruction, and change how read/write information is stored.

This patch introduces a new base class for Instruction named InstructionBase.
Class InstructionBase is responsible for tracking data dependencies with the
help of ReadState and WriteState objects.  Class Instruction now derives from
InstructionBase, and adds extra information related to the `InstrStage` as well
as the `RCUTokenID`.

ReadState and WriteState objects are no longer unique pointers. This avoids
extra heap allocation and pointer checks that weren't really needed.  Now, those
objects are simply stored into SmallVectors.  We use a SmallVector instead of a
std::vector because we expect most instructions to only have a very small number
of reads and writes.  By using a simple SmallVector we also avoid extra heap
allocations most of the time.
In a debug build, this improves the performance of llvm-mca by roughly 10% (I
still have to verify the impact in performance on a release build).

llvm-svn: 345280

1e6d0aad

[AArch64] Refactor Exynos feature sets (NFC) · b53cf993
Evandro Menezes authored Oct 25, 2018
```
llvm-svn: 345279
```
b53cf993
[ARM] Regenerate vdup tests · 8f11ddc3
Simon Pilgrim authored Oct 25, 2018
```
llvm-svn: 345276
```
8f11ddc3

[AArch64] Add EXT patterns for 64-bit EXT of a subvector of a 128-bit vector · 95886520

John Brawn authored Oct 25, 2018

If we have a 64-bit EXT where one of the operands is a subvector of a 128-bit
vector then in some cases we can eliminate an extract_subvector by converting
to a 128-bit EXT of the 128-bit vector.

Differential Revision: https://reviews.llvm.org/D53582

llvm-svn: 345275

95886520

[ARM] Use Cortex-A57 sched model for Cortex-A72 · a16667e7

Sam Parker authored Oct 25, 2018

This mirrors what we already do for AArch64 as the cores are similar.
As discussed in the review, enabling the machine scheduler causes
more variations in performance changes so it is not enabled for now.
This patch improves LNT scores by a geomean of 1.57% at -O3.

Differential Revision: https://reviews.llvm.org/D53562

llvm-svn: 345272

a16667e7