Commits · 0f30960619f9c6b3566d3d4ed9a9bdd6ee0d3725 · Roger Ferrer / llvm-epi

Sep 30, 2019

Reland "[utils] Implement the llvm-locstats tool" · 0f309606

Djordje Todorovic authored Sep 30, 2019

The tool reports verbose output for the DWARF debug location coverage.
The llvm-locstats for each variable or formal parameter DIE computes what
percentage from the code section bytes, where it is in scope, it has
location description. The line 0 shows the number (and the percentage) of
DIEs with no location information, but the line 100 shows the number (and
the percentage) of DIEs where there is location information in all code
section bytes (where the variable or parameter is in the scope). The line
50..59 shows the number (and the percentage) of DIEs where the location
information is in between 50 and 59 percentage of its scope covered.

Differential Revision: https://reviews.llvm.org/D66526

llvm-svn: 373183

0f309606

[SystemZ] Add SystemZPostRewrite in addPostRegAlloc() instead at -O0. · e794c049

Jonas Paulsson authored Sep 30, 2019

SystemZPostRewrite needs to be run before (it may emit COPYs) the Post-RA
pseudo pass also at -O0, so it should be added in addPostRegAlloc().

Review: Ulrich Weigand
llvm-svn: 373182

e794c049

[X86] Remove some redundant isel patterns. NFCI · 5951e3f8

Craig Topper authored Sep 30, 2019

These are all also implemented in avx512_logical_lowering_types
with support for masking.

llvm-svn: 373181

5951e3f8

AMDGPU/GlobalISel: Fix select for v2s16 and/or/xor · 317d991f
Matt Arsenault authored Sep 30, 2019
```
llvm-svn: 373180
```
317d991f
[test] Change llvm-readobj --arm-attributes to --arch-specific after r373125 · 34f9e98a
Fangrui Song authored Sep 30, 2019
```
llvm-svn: 373179
```
34f9e98a

[X86] Split v16i32/v8i64 bitreverse on avx512f targets without avx512bw to... · 1b0ea0a1

Craig Topper authored Sep 30, 2019

[X86] Split v16i32/v8i64 bitreverse on avx512f targets without avx512bw to enable the use of vpshufb on the 256-bit halves.

llvm-svn: 373177

1b0ea0a1

Undef the macros after their use · 9a5e3d39

Aditya Kumar authored Sep 30, 2019

Summary:

Reviewers:
t.p.northover

Subscribers:

Differential Revision: https://reviews.llvm.org/D46378

llvm-svn: 373176

9a5e3d39

[X86] Fix -Wunused-variable in -DLLVM_ENABLE_ASSERTIONS=off builds after r373174 · 6c320b22
Fangrui Song authored Sep 30, 2019
```
llvm-svn: 373175
```
6c320b22

[X86] Remove -x86-experimental-vector-widening-legalization command line flag · 1069c019

Craig Topper authored Sep 29, 2019

This was added back to allow some performance regressions to be
investigated. The main perf issue was fixed shortly after adding
this back and no other major issues have been reported. So I
think its safe to remove this again.

llvm-svn: 373174

1069c019

Sep 29, 2019

[X86] Add custom isel logic to match VPTERNLOG from 2 logic ops. · 0e3f6591

Craig Topper authored Sep 29, 2019

There's room from improvement here, but this is a decent
starting point.

There are a few minor regressions in the vector-rotate tests,
where we are now forming a vpternlog from an and before we get
a chance to form it for a bitselect that we were matching
previously. This results in an AND and an ANDN feeding the
vpternlog where previously we just had an AND after the
vpternlog. I think we can probably DAG combine the AND with
the bitselect to get back to similar codegen.

llvm-svn: 373172

0e3f6591

Add test case peeking through vector concat when combining insert into shuffles. NFC · aabf8cbf
Amaury Sechet authored Sep 29, 2019
```
llvm-svn: 373171
```
aabf8cbf

[LLVM-C][Ocaml] Add MergeFunctions and DCE pass · a6d9d312

Aditya Kumar authored Sep 29, 2019

MergeFunctions and DCE pass are missing from OCaml/C-api. This patch
adds them.

Differential Revision: https://reviews.llvm.org/D65071

Reviewers: whitequark, hiraditya, deadalnix

Reviewed By: whitequark

Subscribers: llvm-commits

Tags: #llvm

Authored by: kren1

llvm-svn: 373170

a6d9d312

[Docs] Moves article links to new pages · eb78dea4

DeForest Richards authored Sep 29, 2019

Moves existing article links on the Programming, Subsystem, and Reference documentation pages to new locations. Also moves Github Repository and Publications links to the sidebar.

llvm-svn: 373169

eb78dea4

[MC] Emit unused undefined symbol even if its binding is not set · c5133606

Fangrui Song authored Sep 29, 2019

For the following two cases, we currently suppress the symbols. This
patch emits them (compatible with GNU as).

* `test2_a = undef`: if `undef` is otherwise unused.
* `.hidden hidden`: if `hidden` is unused. This is the main point of the
  patch, because omitting the symbol would cause a linker semantic
  difference.

It causes a behavior change that is not compatible with GNU as:

.weakref foo1, bar1

When neither foo1 nor bar1 is used, we now emit bar1, which is arguably
more consistent.

Another change is that we will emit .TOC. for .TOC.@tocbase .  For this
directive, suppressing .TOC. can be seen as a size optimization, but we
choose to drop it for simplicity and consistency.

llvm-svn: 373168

c5133606

[DivRemPairs] Don't assert that we won't ever get expanded-form rem pairs in... · d30093bb

Roman Lebedev authored Sep 29, 2019

[DivRemPairs] Don't assert that we won't ever get expanded-form rem pairs in different BB's (PR43500)

If we happen to have the same div in two basic blocks,
and in one of those we also happen to have the rem part,
we'd match the div-rem pair, but the wrong ones.
So let's drop overly-ambiguous assert.

Fixes https://bugs.llvm.org/show_bug.cgi?id=43500

llvm-svn: 373167

d30093bb

[SLP] Fix for PR31847: Assertion failed: (isLoopInvariant(Operands[i], L) &&... · 8b1eeafb

Alexey Bataev authored Sep 29, 2019

[SLP] Fix for PR31847: Assertion failed: (isLoopInvariant(Operands[i], L) && "SCEVAddRecExpr operand is not loop-invariant!")

Initially SLP vectorizer replaced all going-to-be-vectorized
instructions with Undef values. It may break ScalarEvaluation and may
cause a crash.
Reworked SLP vectorizer so that it does not replace vectorized
instructions by UndefValue anymore. Instead vectorized instructions are
marked for deletion inside if BoUpSLP class and deleted upon class
destruction.

Reviewers: mzolotukhin, mkuper, hfinkel, RKSimon, davide, spatel

Subscribers: RKSimon, Gerolf, anemet, hans, majnemer, llvm-commits, sanjoy

Differential Revision: https://reviews.llvm.org/D29641

llvm-svn: 373166

8b1eeafb

[PowerPC] Fix conditions of assert in PPCAsmPrinter · 72b544e6

Jinsong Ji authored Sep 29, 2019

Summary:
g++ build emits warning:

llvm/lib/Target/PowerPC/PPCAsmPrinter.cpp:667:77: error: suggest parentheses around ?&&? within ?||? [-Werror=parentheses]
     assert(MO.isGlobal() || MO.isCPI() || MO.isJTI() || MO.isBlockAddress() &&
                                                         ~~~~~~~~~~~~~~~~~~~~^~
            "Unexpected operand type for LWZtoc pseudo.");

I believe the intension is to assert all different types,
so we should add a parentheses to include all '||'.

Reviewers: #powerpc, sfertile, hubert.reinterpretcast, Xiangling_L

Reviewed By: Xiangling_L

Subscribers: wuzish, nemanjai, hiraditya, kbarton, MaskRay, shchenz, steven.zhang, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D68180

llvm-svn: 373164

72b544e6

[ARM] Cortex-M4 schedule additions · 120a5e9a

David Green authored Sep 29, 2019

This is an attempt to fill in some of the missing instructions from the
Cortex-M4 schedule, and make it easier to do the same for other ARM cpus.

- Some instructions are marked as hasNoSchedulingInfo as they are pseudos or
  otherwise do not require scheduling info
- A lot of features have been marked not supported
- Some WriteRes's have been added for cvt instructions.
- Some extra instruction latencies have been added, notably by relaxing the
  regex for dsp instruction to catch more cases, and some fp instructions.

This goes a long way to get the CompleteModel working for this CPU. It does not
go far enough as to get all scheduling info for all output operands correct.

Differential Revision: https://reviews.llvm.org/D67957

llvm-svn: 373163

120a5e9a

[Docs] Adds sections for Command Line and LibFuzzer articles · ac596993
DeForest Richards authored Sep 29, 2019
```
Adds sections for Command Line and Libfuzzer articles on Programming Documentation page.

llvm-svn: 373158
```
ac596993
[X86] Enable isel to fold broadcast loads that have been bitcasted from FP into a vpternlog. · 494bfd9f
Craig Topper authored Sep 29, 2019
```
llvm-svn: 373157
```
494bfd9f

[X86] Move bitselect matching to vpternlog into X86ISelDAGToDAG.cpp · b6a2207b

Craig Topper authored Sep 29, 2019

This allows us to reduce the use count on the condition node before
the match. This enables load folding for that operand without
relying on the peephole pass. This will be improved on for
broadcast load folding in a subsequent commit.

This still requires a bunch of isel patterns for vXi16/vXi8 types
though.

llvm-svn: 373156

b6a2207b

[X86] Enable canonicalizeBitSelect for AVX512 since we can use VPTERNLOG now. · 0ac4aace
Craig Topper authored Sep 29, 2019
```
llvm-svn: 373155
```
0ac4aace
[X86] Match (or (and A, B), (andn (A, C))) to VPTERNLOG with AVX512. · 6195ed83
Craig Topper authored Sep 29, 2019
```
This uses a similar isel pattern as we used for vpcmov with XOP.

llvm-svn: 373154
```
6195ed83

Sep 28, 2019

[NFC] Move hot cold splitting class to header file · 2adae76c

Aditya Kumar authored Sep 28, 2019

Summary:  This is to facilitate unittests

Reviewers: compnerd, vsk, tejohnson, sebpop, brzycki, SirishP

Reviewed By: tejohnson

Subscribers: llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D68079

llvm-svn: 373151

2adae76c

[PowerPC] make tests immune to improved undef handling · 520876d8

Sanjay Patel authored Sep 28, 2019

The fma mutate test will not exercise what it was intended to test
once we simplify those ops immediately, but the test will still
pass with the existing CHECKs, so I'm leaving it in case that
still has minimal value.

llvm-svn: 373149

520876d8

[GlobalISel Enable memcpy inlining with optsize. · 7d62e480
Amara Emerson authored Sep 28, 2019
```
We should be disabling inline for minsize, not optsize.

llvm-svn: 373143
```
7d62e480

[TimeProfiler] Fix "OptModule" section and add new "Backend" sections · f7a428ec

Anton Afanasyev authored Sep 28, 2019

Remove unnecessary "OptModule" section. Add "PerFunctionPasses",
"PerModulePasses" and "CodeGenPasses" sections under "Backend" section.

llvm-svn: 373142

f7a428ec

Add an operand to memory intrinsics to denote the "tail" marker. · 509a4947

Amara Emerson authored Sep 28, 2019

We need to propagate this information from the IR in order to be able to safely
do tail call optimizations on the intrinsics during legalization. Assuming
it's safe to do tail call opt without checking for the marker isn't safe because
the mem libcall may use allocas from the caller.

This adds an extra immediate operand to the end of the intrinsics and fixes the
legalizer to handle it.

Differential Revision: https://reviews.llvm.org/D68151

llvm-svn: 373140

509a4947

AMDGPU/GlobalISel: Avoid getting MRI in every function · 76f44f6b

Matt Arsenault authored Sep 28, 2019

Store it in AMDGPUInstructionSelector to avoid boilerplate in nearly
every select function.

llvm-svn: 373139

76f44f6b

[X86] Add broadcast load unfolding support for VPTESTMD/Q and VPTESTNMD/Q. · 8b5ad3d1
Craig Topper authored Sep 28, 2019
```
llvm-svn: 373138
```
8b5ad3d1

[X86] Stop using UpdateNodeOperands in combineGatherScatter. Create new nodes... · 82a707e9

Craig Topper authored Sep 28, 2019

[X86] Stop using UpdateNodeOperands in combineGatherScatter. Create new nodes like most other DAG combines.

Creating new nodes is what we usually do. Have to explicitly
check that we don't update to an existing node and having
to manually manage the worklist is unusual.

We can probably add a helper function to reduce the duplication
of having to check if we should create a gather or scatter, but
I wanted to just get the simple thing done.

llvm-svn: 373137

82a707e9

[X86] Split combineGatherScatter into a version for generic ISD nodes and... · 22984ebd

Craig Topper authored Sep 28, 2019

[X86] Split combineGatherScatter into a version for generic ISD nodes and another version for X86 specific nodes.

The majority of the code doesn't run on the X86 nodes today since
its gated by isBeforeLegalizeOps and we don't formm X86 nodes
until after that. Except for a couple special case in type
legalization. But I think we would probably break those if
some of the transforms fire on them.

I want to remove the hardcoded operand numbers and the unusual
use of UpdateNodeOperands. Being able to know which ISD opcodes
are present should help with that.

llvm-svn: 373136

22984ebd

[SampleFDO] Create a separate flag profile-accurate-for-symsinlist to handle · f0c4e70e

Wei Mi authored Sep 27, 2019

profile symbol list.

Currently many existing users using profile-sample-accurate want to reduce
code size as much as possible. Their use cases are different from the scenario
profile symbol list tries to handle -- the major motivation of adding profile
symbol list is to get the major memory/code size saving without introduce
performance regression. So to keep the behavior of profile-sample-accurate
unchanged, we think decoupling these two things and using a new flag to
control the handling of profile symbol list may be better.

When profile-sample-accurate and the new flag profile-accurate-for-symsinlist
are both present, since profile-sample-accurate is a user assertion we let it
have a higher precedence.

Differential Revision: https://reviews.llvm.org/D68047

llvm-svn: 373133

f0c4e70e

[llvm-lipo] Add support for -arch · fa6584c5

Alexander Shaposhnikov authored Sep 27, 2019

Add support for -arch.

Differential revision: https://reviews.llvm.org/D68116

Test plan: make check-all

llvm-svn: 373132

fa6584c5

[X86] Add test case to show missed opportunity to turn (add (zext (vXi1 X)),... · 305c811f

Craig Topper authored Sep 27, 2019

[X86] Add test case to show missed opportunity to turn (add (zext (vXi1 X)), Y) -> (sub Y, (sext (vXi1 X))) with avx512.

With avx512, the vXi1 type is legal. And we can more easily sign
extend them to vector registers. zext requires a sign extend and
a shift.

If we can easily turn the zext into a sext we should.

llvm-svn: 373131

305c811f

Sep 27, 2019

[PatternMatch] Add m_SExtOrSelf(), m_ZExtOrSExtOrSelf() matchers + unittests · 8c39d016

Roman Lebedev authored Sep 27, 2019

m_SExtOrSelf() is for consistency.

m_ZExtOrSExtOrSelf() is motivated by the D68103/r373106 :
sometimes it is useful to look past any extensions of the shift amount,
and m_ZExtOrSExtOrSelf() may be exactly the tool to do that.

llvm-svn: 373128

8c39d016

[llvm-readobj] Rename --arm-attributes to --arch-specific · 121ef04f

Yi Kong authored Sep 27, 2019

This is for compatibility with GNU readobj. --arm-attributes option is
left as a hidden alias due to large number of tests using it.

Differential Revision: https://reviews.llvm.org/D68110

llvm-svn: 373125

121ef04f

[InstSimplify] generalize FP folds with undef/NaN; NFC · 8cecc30c
Sanjay Patel authored Sep 27, 2019
```
We can reuse this logic for things like fma.

llvm-svn: 373119
```
8cecc30c
Revert [Dominators][CodeGen] Clean up MachineDominators · 159ef377
Jakub Kuderski authored Sep 27, 2019
```
This reverts r373101 (git commit 72c57ec3)

llvm-svn: 373117
```
159ef377
Revert XFAIL a codegen test AArch64/tailmerging_in_mbp.ll · 9bccdfcd
Jakub Kuderski authored Sep 27, 2019
```
This reverts r373103 (git commit a524e630)

llvm-svn: 373116
```
9bccdfcd