Commits · baad3f6016b87cbd03578a1df6c6ea69280c4608 · Roger Ferrer / llvm-epi

Aug 09, 2018

[NFC] ConstantMerge: factor out some functions · 42ca9ccb

JF Bastien authored Aug 09, 2018

This makes the code easier to read and will make an upcoming patch I have easier to review because that patch needed this refactoring to reuse some of the functions.

llvm-svn: 339391

42ca9ccb

ConstantMerge: update MadeChange when change is made · ebcaa317
JF Bastien authored Aug 09, 2018
```
It was always false, which is obviously wrong.

llvm-svn: 339390
```
ebcaa317
[LICM] Suppress a compiler warning noticed by one of the bots · 7d794331
Philip Reames authored Aug 09, 2018
```
llvm-svn: 339388
```
7d794331

[RISC-V] Fixed alias for addi x2, x2, 0 · 10de2349

Ana Pazos authored Aug 09, 2018

A missing check for non-zero immediate in MCOperandPredicate
caused c.addi16sp sp, 0 to be selected which is not a valid
instruction.

llvm-svn: 339381

10de2349

[LICM] hoist fences out of loops w/o memory operations · ca256d93

Philip Reames authored Aug 09, 2018

The motivating case is an otherwise dead loop with a fence in it. At the moment, this goes all the way through the optimizer and we end up emitting an entirely pointless loop on x86. This case may seem a bit contrived, but we've seen it in real code as the result of otherwise reasonable lowering strategies combined w/thread local memory optimizations (such as escape analysis).

To handle this simple case, we can teach LICM to hoist must execute fences when there is no other memory operation within the loop.

Differential Revision: https://reviews.llvm.org/D50489

llvm-svn: 339378

ca256d93

Fix typo · ed4f5175
Stephen Kelly authored Aug 09, 2018
```
llvm-svn: 339377
```
ed4f5175

Remove obsolete policy settings · de6dde8b

Stephen Kelly authored Aug 09, 2018

Summary:
The line

 cmake_minimum_required(VERSION 3.4.3)

already has the effect of setting to NEW all policies present in that
release:

 https://cmake.org/cmake/help/v3.4/manual/cmake-policies.7.html

Subscribers: mgorny, llvm-commits

Differential Revision: https://reviews.llvm.org/D50407

llvm-svn: 339376

de6dde8b

[InstCombine] allow fsub+fmul FMF folds for vectors · 55accd7d
Sanjay Patel authored Aug 09, 2018
```
llvm-svn: 339368
```
55accd7d

Fix few g++ 8 warning with non obvious copy object operations · 89005c33

David Carlier authored Aug 09, 2018

Reviewers: dblaikie, dexonsmith	

Reviewed By: dblaikie

Differential Revision: https://reviews.llvm.org/D50296

llvm-svn: 339367

89005c33

[NFC] Remove magic bool param in RAUW · e69ae76b
JF Bastien authored Aug 09, 2018
```
Use an enum class instead.

llvm-svn: 339366
```
e69ae76b
[Hexagon] Map ISD::TRAP to J2_trap0(#0) · 75c2ca36
Krzysztof Parzyszek authored Aug 09, 2018
```
llvm-svn: 339365
```
75c2ca36

SCEV should forget all loops containing a deleted block. · bf9fe793

Alina Sbirlea authored Aug 09, 2018

Summary:
LoopSimplifyCFG should update ScEv for all loops after a block is deleted.
If the deleted block "Succ" is part of L, then it is part of all parent loops, so forget topmost loop.

Reviewers: greened, mkazantsev, sanjoy

Subscribers: jlebar, javed.absar, uabelho, llvm-commits

Differential Revision: https://reviews.llvm.org/D50422

llvm-svn: 339363

bf9fe793

[llvm-objcopy] Add --prefix-symbols option · 7a3dc2c1
Paul Semel authored Aug 09, 2018
```
Differential Revision: https://reviews.llvm.org/D50381

llvm-svn: 339362
```
7a3dc2c1
[InstCombine] add vector tests for fsub+fmul; NFC · 37379029
Sanjay Patel authored Aug 09, 2018
```
llvm-svn: 339361
```
37379029

[GlobalOpt] Don't apply fastcc if it would break inalloca invariants · 80c6ec11

Reid Kleckner authored Aug 09, 2018

The inalloca parameter has to be the only parameter passed in memory.
Changing the convention to fastcc can break that.

At some point we should teach global opt how to optimize ABI attributes
like inalloca and maybe byval. These attributes are mainly used to match
C ABIs. They are harder for LLVM to optimize and they don't always
generate the best code.

Fixes PR38487

llvm-svn: 339360

80c6ec11

[SelectionDAG] try harder to convert funnel shift to rotate · 15d1501a

Sanjay Patel authored Aug 09, 2018

Similar to rL337966 - if the DAGCombiner's rotate matching was 
working as expected, I don't think we'd see any test diffs here.

AArch only goes right, and PPC only goes left. 
x86 has both, so no diffs there.

Differential Revision: https://reviews.llvm.org/D50091

llvm-svn: 339359

15d1501a

[llvm-objcopy] Add --dump-section · a42dec7a
Paul Semel authored Aug 09, 2018
```
Differential Revision: https://reviews.llvm.org/D49979

llvm-svn: 339358
```
a42dec7a

extend folding fsub/fadd to fneg for FMF · ca382546

Michael Berg authored Aug 09, 2018

Summary: This change provides a common optimization path for both Unsafe and FMF driven optimization for this fsub fold adding reassociation, as it the flag that most closely represents the translation

Reviewers: spatel, wristow, arsenm

Reviewed By: spatel

Subscribers: wdng

Differential Revision: https://reviews.llvm.org/D50195

llvm-svn: 339357

ca382546

[ARM] Adjust the feature set for Exynos · 8c436627

Evandro Menezes authored Aug 09, 2018

Enable `FeatureZCZeroing`, `FeatureHasSlowFPVMLx`, `FeatureExpandMLx`,
`FeatureProfUnpredicate`, `FeatureSlowVDUP32`, `FeatureSlowVGETLNi32`,
`FeatureSplatVFPToNeon`, `FeatureHasRetAddrStack`, `FeatureSlowFPBrcc` for
all Exynos processors.

llvm-svn: 339356

8c436627

[ARM] Replace processor check with feature · 9a92fe0c

Evandro Menezes authored Aug 09, 2018

Add new feature, `FeatureUseWideStrideVFP`, that replaces the need for a
processor check.  Otherwise, NFC.

llvm-svn: 339354

9a92fe0c

[MC][PredicateExpander] Extend the grammar to support simple switch and return statements. · f3bde048

Andrea Di Biagio authored Aug 09, 2018

This patch introduces tablegen class MCStatement.

Currently, an MCStatement can be either a return statement, or a switch
statement.

```
MCStatement:
   MCReturnStatement
   MCOpcodeSwitchStatement
```

A MCReturnStatement expands to a return statement, and the boolean expression
associated with the return statement is described by a MCInstPredicate.

An MCOpcodeSwitchStatement is a switch statement where the condition is a check
on the machine opcode. It allows the definition of multiple checks, as well as a
default case. More details on the grammar implemented by these two new
constructs can be found in the diff for TargetInstrPredicates.td.

This patch makes it easier to read the body of auto-generated TargetInstrInfo
predicates.

In future, I plan to reuse/extend the MCStatement grammar to describe more
complex target hooks. For now, this is just a first step (mostly a minor
cosmetic change to polish the new predicates framework).

Differential Revision: https://reviews.llvm.org/D50457

llvm-svn: 339352

f3bde048

[MC] Remove PhysRegSize from MCRegisterClass · c8b782ce

Bjorn Pettersson authored Aug 09, 2018

Summary:
The interface to get size and spill size of a register
was moved from MCRegisterInfo to TargetRegisterInfo over
a year ago. Afaik the old interface has bee around
to give out-of-tree targets a chance to adapt to the
new interface.

One problem with the old MCRegisterClass::PhysRegSize was that
it represented the size of a register as "size in bits" / 8.
So a register had to be a multiple of eight bits wide for the
size to be correct (and the byte size for the target needed to
be eight bits).

Reviewers: kparzysz, qcolombet

Reviewed By: kparzysz

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D47199

llvm-svn: 339350

c8b782ce

[InstCombine] reduce code duplication; NFC · ebec4204
Sanjay Patel authored Aug 09, 2018
```
llvm-svn: 339349
```
ebec4204
[TargetLowering] Add BuildSDIVPattern helper to BuildExactSDIV (NFCI). · a9f95429
Simon Pilgrim authored Aug 09, 2018
```
As requested in D50392, pull the magic constant calculations out into a helper function.

llvm-svn: 339346
```
a9f95429
[ARM] FP16: codegen support for VTRN · 806f70d2
Sjoerd Meijer authored Aug 09, 2018
```
Differential Revision: https://reviews.llvm.org/D50454

llvm-svn: 339340
```
806f70d2

[X86][SSE] Remove PMULDQ/PMULUDQ by zero · 511c3fc5

Simon Pilgrim authored Aug 09, 2018

Exposed by D50328

Differential Revision: https://reviews.llvm.org/D50328

llvm-svn: 339337

511c3fc5

[X86][SSE] Combine (some) target shuffles with multiple uses · 01ae462f

Simon Pilgrim authored Aug 09, 2018

As discussed on D41794, we have many cases where we fail to combine shuffles as the input operands have other uses.

This patch permits these shuffles to be combined as long as they don't introduce additional variable shuffle masks, which should reduce instruction dependencies and allow the total number of shuffles to still drop without increasing the constant pool.

However, this may mean that some memory folds may no longer occur, and on pre-AVX require the occasional extra register move.

This also exposes some poor PMULDQ/PMULUDQ codegen which was doing unnecessary upper/lower calculations which will in fact fold to zero/undef - the fix will be added in a followup commit.

Differential Revision: https://reviews.llvm.org/D50328

llvm-svn: 339335

01ae462f

vs integration: bump version number · 79cf42e8
Hans Wennborg authored Aug 09, 2018
```
llvm-svn: 339330
```
79cf42e8
vs integration: update the manifest to require VS 2017 · 0d35871a
Hans Wennborg authored Aug 09, 2018
```
It previously erroneously said only VS2015 was required.

llvm-svn: 339329
```
0d35871a
[X86] Improved sched models for X86 XCHG*rr and XADD*rr instructions. · 24f63bcb
Andrew V. Tischenko authored Aug 09, 2018
```
Differential Revision: https://reviews.llvm.org/D49861

llvm-svn: 339321
```
24f63bcb
cmake: don't pack system libs unless CMAKE_INSTALL_UCRT_LIBRARIES is set (PR38476) · 5df524f8
Hans Wennborg authored Aug 09, 2018
```
llvm-svn: 339319
```
5df524f8

[NVPTX] Select atomic loads and stores · 20526bf4

Jonas Hahnfeld authored Aug 09, 2018

According to PTX ISA .volatile has the same memory synchronization
semantics as .relaxed.sys, so it can be used to implement monotonic
atomic loads and stores. This is important for OpenMP's atomic
construct where
 - 'read's and 'write's are lowered to atomic loads and stores, and
 - an update of float or double types are lowered into a cmpxchg loop.
(Note that PTX could do better because it has atom.add.f{32,64} but
LLVM's atomicrmw instruction only allows integer types.)

Higher levels of atomicity (like acquire and release) need additional
synchronization properties which were added with PTX ISA 6.0 / sm_70.
So using these instructions still results in an error.

Differential Revision: https://reviews.llvm.org/D50391

llvm-svn: 339316

20526bf4

[RISCV] Add "lla" pseudo-instruction to assembler · 577a97e2

Roger Ferrer Ibanez authored Aug 09, 2018

This pseudo-instruction is similar to la but uses PC-relative addressing
unconditionally. This is, la is only different to lla when using -fPIC. This
pseudo-instruction seems often forgotten in several specs but it is definitely
mentioned in binutils opcodes/riscv-opc.c. The semantics are defined both in
page 37 of the "RISC-V Reader" book but also in function macro found in
gas/config/tc-riscv.c.

This is a very first step towards adding PIC support for Linux in the RISC-V
backend.

The lla pseudo-instruction expands to a sequence of auipc + addi with a couple
of pc-rel relocations where the second points to the first one. This is
described in
https://github.com/riscv/riscv-elf-psabi-doc/blob/master/riscv-elf.md#pc-relative-symbol-addresses

For now, this patch only introduces support of that pseudo instruction at the
assembler parser.

Differential Revision: https://reviews.llvm.org/D49661

llvm-svn: 339314

577a97e2

[LICM] Add tests for future hoisting of fence instructions [NFC] · 954eab10

Philip Reames authored Aug 09, 2018

The main interesting case is a fence in an otherwise dead loop or one containing only arithmetic.  This can happen as a result of DSE or other transforms from seemingly reasonable initial IR.  

llvm-svn: 339310

954eab10

[NFC] ConstantMerge: don't insert when find should be used · 3f270336

JF Bastien authored Aug 09, 2018

Summary: DenseMap's operator[] performs an insertion if the entry isn't found. The second phase of ConstantMerge isn't trying to insert anything: it's just looking to see if the first phased performed an insertion. Use find instead, avoiding insertion of every single global initializer in the map of constants. This has the side-effect of making all entries in CMap non-null (because only global declarations would have null initializers, and that would be a bug).

Subscribers: dexonsmith, llvm-commits

Differential Revision: https://reviews.llvm.org/D50476

llvm-svn: 339309

3f270336

[LICM] Add an assert to ensure all instruction types needing aliasing are handled [NFC] · 22b20a09
Philip Reames authored Aug 09, 2018
```
llvm-svn: 339308
```
22b20a09

[CMake] Use normalized Windows target triples · eb46c95c

Petr Hosek authored Aug 09, 2018

Changes the default Windows target triple returned by
GetHostTriple.cmake from the old environment names (which we wanted to
move away from) to newer, normalized ones. This also requires updating
all tests to use the new systems names in constraints.

Differential Revision: https://reviews.llvm.org/D47381

llvm-svn: 339307

eb46c95c

[DWARF] Verifier now handles .debug_types sections. · 508b0815
Paul Robinson authored Aug 08, 2018
```
Differential Revision: https://reviews.llvm.org/D50466

llvm-svn: 339302
```
508b0815
[x86] add test for commuted variant for fsub fold; NFC · f9a80fe8
Sanjay Patel authored Aug 08, 2018
```
llvm-svn: 339300
```
f9a80fe8

[DAGCombiner] loosen constraints for fsub+fadd fold · e47dc1a4

Sanjay Patel authored Aug 08, 2018

isNegatibleForFree() should not matter here (as the test diffs show)
because it's always a win to replace an fsub+fadd with fneg. The
problem in D50195 persists because either (1) we are doing these
folds in the wrong order or (2) we're missing another fold for fadd.

llvm-svn: 339299

e47dc1a4