Commits · b7d50ba1ee39bbcea9497f105d8e9c651cc925b4 · Lorenzo Albano / LLVM bpEVL

Feb 28, 2020

[MLIR] Refactor library initialization of JitRunner. · b7d50ba1

Stephen Neuendorffer authored Feb 27, 2020

Previously, lib/Support/JitRunner.cpp was essentially a complete application,
performing all library initialization, along with dealing with command line
arguments and actually running passes.  This differs significantly from
mlir-opt and required a dependency on InitAllDialects.h.  This dependency
is significant, since it requires a dependency on all of the resulting
libraries.

This patch refactors the code so that tools are responsible for library
initialization, including registering all dialects, prior to calling
JitRunnerMain.  This places the concern about what dialect to support
with the end application, enabling more extensibility at the cost of
a small amount of code duplication between tools.  It also fixes
BUILD_SHARED_LIBS=on.

Differential Revision: https://reviews.llvm.org/D75272

b7d50ba1

[MLIR] Refactor library handling for conversions. · c07fb9e0

Stephen Neuendorffer authored Feb 26, 2020

Collect a list of conversion libraries in cmake, so we don't have to
list these explicitly in most binaries.

Differential Revision: https://reviews.llvm.org/D75222

c07fb9e0

[MLIR] Refactor handling of dialect libraries · 58695528

Stephen Neuendorffer authored Feb 26, 2020

Instead of creating extra libraries we don't really need, collect a
list of all dialects and use that instead.

Differential Revision: https://reviews.llvm.org/D75221

58695528

[mlir] Fix typo · 4dc39ae7
Jacques Pienaar authored Feb 28, 2020

4dc39ae7

Add a pass that specializes parallel loops for easier unrolling and vectorization · 5abf128d

Benjamin Kramer authored Feb 27, 2020

This matches loops with a affine.min upper bound, limiting the trip
count to a constant, and rewrites them into two loops, one with constant
upper bound and one with variable upper bound. The assumption is that
the constant upper bound loop will be unrolled and vectorized, which is
preferable if this is the hot path.

Differential Revision: https://reviews.llvm.org/D75240

5abf128d

[AST Matchers] Fix bug in 'optionally' matcher wherein all previous bindings... · 586f13ae

Yitzhak Mandelbaum authored Feb 28, 2020

[AST Matchers] Fix bug in 'optionally' matcher wherein all previous bindings are cleared when all inner matchers fail.

Summary: The implementation of 'optionally' doesn't preserve bindings when none of the submatchers succeed. This patch adds a regression test for that behavior and fixes it.

Reviewers: aaron.ballman, sbenza

Subscribers: cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D75365

586f13ae

[DAGCombine] Fix alias analysis for unaligned accesses · 1de10705

David Green authored Feb 28, 2020

The alias analysis in DAG Combine looks at the BaseAlign, the Offset and
the Size of two accesses, and determines if they are known to access
different parts of memory by the fact that they are different offsets
from inside that "alignment window". It does not seem to account for
accesses that are not a multiple of the size, and may overflow from one
alignment window into another.

For example in the test case we have a 19byte memset that is splits into
a 16 byte neon store and an unaligned 4 byte store with a 15 byte
offset. This 15byte offset (with a base align of 8) wraps around to the
next alignment windows. When compared to an access that is a 16byte
offset (of the same 4byte size and 8byte basealign), the two accesses
are said not to alias.

I've fixed this here by just ensuring that the offsets are a multiple of
the size, ensuring that they don't overlap by wrapping. Fixes PR45035,
which was exposed by the UseAA changes in the arm backend.

Differential Revision: https://reviews.llvm.org/D75238

1de10705

[VectorCombine] Fix assert on compare extract index · 4fa63fd4

Austin Kerbow authored Feb 28, 2020

Extract index could be a differnet integral type.

Differential Revision: https://reviews.llvm.org/D75327

4fa63fd4

[libc++] update GCC cherry-pick to build 4.8.5 · b4b4259a
Eric Fiselier authored Feb 28, 2020

b4b4259a

[SLP][NFC] Assert that tree entry operands completed when scheduler looks for dependencies. · d723ec4f

Valery N Dmitriev authored Feb 27, 2020

This change adds an assertion to prevent tricky bug related to recursive
approach of building vectorization tree. For loop below takes number of
operands directly from tree entry rather than from scalars.
If the entry at this moment turns out incomplete (i.e. not all operands set)
then not all the dependencies will be seen by the scheduler.
This can lead to failed scheduling (and thus failed vectorization)
for perfectly vectorizable tree.
Here is code example which is likely to fire the assertion:
for (i : VL0->getNumOperands()) {
  ...
  TE->setOperand(i, Operands);
  buildTree_rec(Operands, Depth + 1,...);
}

Correct way is two steps process: first set all operands to a tree entry
and then recursively process each operand.

Differential Revision: https://reviews.llvm.org/D75296

d723ec4f

[SLP]Update test checks, NFC. · afa45d23
Alexey Bataev authored Feb 28, 2020

afa45d23

[X86] Recognize CVTPH2PS from STRICT_FP_EXTEND · c0d0e6b1

Craig Topper authored Feb 28, 2020

This should avoid scalarizing the cvtph2ps intrinsics with D75162

Differential Revision: https://reviews.llvm.org/D75304

c0d0e6b1

[lld][WebAssembly] Handle mixed strong and weak undefined symbols · a57f1a54

Sam Clegg authored Feb 27, 2020

When there are both strong and weak references to an undefined
symbol ensure that the strong reference prevails in the output symbol
generating the correct error.

Test case copied from lld/test/ELF/weak-and-strong-undef.s

Differential Revision: https://reviews.llvm.org/D75322

a57f1a54

Devirtualize a call on alloca without waiting for post inline cleanup and next... · f16d2bec

Hiroshi Yamauchi authored Feb 28, 2020

Devirtualize a call on alloca without waiting for post inline cleanup and next DevirtSCCRepeatedPass iteration.

This aims to fix a missed inlining case.

If there's a virtual call in the callee on an alloca (stack allocated object) in
the caller, and the callee is inlined into the caller, the post-inline cleanup
would devirtualize the virtual call, but if the next iteration of
DevirtSCCRepeatedPass doesn't happen (under the new pass manager), which is
based on a heuristic to determine whether to reiterate, we may miss inlining the
devirtualized call.

This enables inlining in clang/test/CodeGenCXX/member-function-pointer-calls.cpp.

This is a second commit after a revert
https://reviews.llvm.org/rG4569b3a86f8a4b1b8ad28fe2321f936f9d7ffd43 and a fix
https://reviews.llvm.org/rG41e06ae7ba91.

Differential Revision: https://reviews.llvm.org/D69591

f16d2bec

[CallPromotionUtils] Add missing promotion legality check to tryPromoteCall. · 41e06ae7

Hiroshi Yamauchi authored Feb 27, 2020

Summary: This fixes the crash that led to the revert of D69591.

Reviewers: davidxl

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D75307

41e06ae7

[SLP][NFC] Delete some unreachable code. · 02e5e47e

Valery N Dmitriev authored Feb 27, 2020

This patch deletes some dead code out of SLP vectorizer.
Couple of changes taken out of D57059 to slightly lighten it
plus one more similar case fixed.

Differential Revision: https://reviews.llvm.org/D75276

02e5e47e

Revert "[NFC][ARM] Update test" · 0590c9b9

Christopher Tetreault authored Feb 28, 2020

Summary:
There exists no corresponding code change for this commit, and this
commit causes downstream breakages.

This reverts commit 2db5547c.

Reviewers: samparker

Subscribers: kristof.beyls, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D75358

0590c9b9

[mlir] [VectorOps] Add vector.broadcast to EDSC · a8a7ee10

aartbik authored Feb 27, 2020

Reviewers: nicolasvasilache, andydavis1

Reviewed By: nicolasvasilache

Subscribers: mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D75320

a8a7ee10

[AVX512] Add strict-fp cvtph2ps constrained tests · bfa0aaf3
Simon Pilgrim authored Feb 28, 2020
```
As suggested on D75162
```
bfa0aaf3
[F16C] Add strict-fp constrained tests · a06402cc
Simon Pilgrim authored Feb 28, 2020
```
As suggested on D75162
```
a06402cc
[mlir] Add reifyReturnShape to shaped type OpInterface · e706533f
Jacques Pienaar authored Feb 28, 2020
```
This call results in inserting operations that compute the return shape
dynamically for the operation.
```
e706533f

[Inliner] Inlining should honor nobuiltin attributes · f9ca75f1

Teresa Johnson authored Feb 06, 2020

Summary:
Final patch in series to fix inlining between functions with different
nobuiltin attributes/options, which was specifically an issue in LTO.
See discussion on D61634 for background.

The prior patch in this series (D67923) enabled per-Function TLI
construction that identified the nobuiltin attributes.

Here I have allowed inlining to proceed if the callee's nobuiltins are a
subset of the caller's nobuiltins, but not in the reverse case, which
should be conservatively correct. This is controlled by a new option,
-inline-caller-superset-nobuiltin, which is enabled by default.

Reviewers: hfinkel, gchatelet, chandlerc, davidxl

Subscribers: arsenm, jvesely, nhaehnle, mehdi_amini, eraman, hiraditya, haicheng, dexonsmith, kerbowa, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D74162

f9ca75f1

Fix MSVC "32-bit shift implicitly converted to 64 bits" warning. NFCI. · b6e80864
Simon Pilgrim authored Feb 28, 2020

b6e80864

[TargetLowering] SimplifyDemandedBits - fix SCALAR_TO_VECTOR knownbits bug · 4bc6f633

Simon Pilgrim authored Feb 28, 2020

We can only report the knownbits for a SCALAR_TO_VECTOR node if we only demand the 0'th element - the upper elements are undefined and shouldn't be trusted.

This is causing a number of regressions that need addressing but we need to get the bugfix in first.

4bc6f633

[Transform][MemCpyOpt] Add missing DebugLoc to %tmpbitcast · 2809abbd

Pierre-vh authored Feb 26, 2020

Fix for https://bugs.llvm.org/show_bug.cgi?id=37967

Differential Revision: https://reviews.llvm.org/D75173

2809abbd

Reland with a MSAN fix · c8bfed05

Krzysztof Parzyszek authored Feb 26, 2020

In some cases when HexagonTargetLowering::allowsMemoryAccess returned
true, it did not set the "Fast" argument, leaving it uninitialized.

[Hexagon] Improve casting of boolean HVX vectors to scalars

- Mark memory access for bool vectors as disallowed in target lowering.
  This will prevent combining bitcasts of bool vectors with stores.
- Replace the actual bitcasting code with a faster version.
- Handle casting of v16i1 to i16.

c8bfed05

[ARM] MVE VMLAS · e2a2f3f7

David Green authored Feb 28, 2020

This addes extra patterns for the VMLAS MVE instruction, which performs
Qda = Qda * Qn + Rm, a similar pattern to the existing VMLA. The sinking
of splat(Rm) into the loop is already performed, meaning we just need
extra Pat's in tablegen.

Differential Revision: https://reviews.llvm.org/D75115

e2a2f3f7

[ARM] Additional MVE VMLA tests. NFC · 78e5d134
David Green authored Feb 27, 2020

78e5d134
Skip TemplateSpecializedType in modernize-pass-by-value. · 365c99fd
Karasev Nikita authored Feb 28, 2020
```
Existing 'modernize-pass-by-value' check works only with non template values in
initializers. Fixes PR37210.
```
365c99fd
[cmake][msvc] Don't disable C4345 any more. · d76fddf2
Simon Pilgrim authored Feb 28, 2020
```
This shouldn't be relevant now that we just support VS2017+.
```
d76fddf2
[Utils] Make some scripts directly executable · 395e2c06
Jay Foad authored Feb 28, 2020

395e2c06
[AMDGPU] Mark the scheduling model as complete · 970558df
Jay Foad authored Feb 28, 2020

970558df
[AMDGPU] Update a comment missed in 74e2974a · addcbc40
Jay Foad authored Feb 28, 2020

addcbc40
Fix buildbots after c074f523. · f5e3c039
Alexey Lapshin authored Feb 28, 2020
```
Removed unused function getSectionByName() from dsymutil/DwarfStreamer.cpp.
```
f5e3c039

[clang-tidy] Added virtual isLanguageVersionSupported to ClangTidyCheck · 39c4246e

Nathan James authored Feb 28, 2020

Summary:
Motivated by [[ https://bugs.llvm.org/show_bug.cgi?id=45045 | Tune inspections to a specific C++ standard. ]]
Moves the isLanguageVersionSupported virtual function from `MakeSmartPtrCheck` to the base `ClangTidyCheck` class.
This will disable registering matchers or pp callbacks on unsupported language versions for a check.
Having it as a standalone function is cleaner than manually disabling the check in the register function and should hopefully
encourage check developers to actually restrict the check based on language version.
As an added bonus this could enable automatic detection of what language version a check runs on for the purpose of documentation generation

Reviewers: aaron.ballman, gribozavr2, Eugene.Zelenko, JonasToth, alexfh, hokein

Reviewed By: gribozavr2

Subscribers: xazax.hun, jkorous, arphaman, kadircet, usaxena95, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D75289

39c4246e

[clang-format] Improve C# handling of spaces in square brackets · f8296152

Jonathan Coe authored Feb 28, 2020

Reviewers: MyDeveloperDay, krasimir

Reviewed By: krasimir

Subscribers: cfe-commits

Tags: #clang-format, #clang

Differential Revision: https://reviews.llvm.org/D75336

f8296152

[RISCV] Compress instructions based on function features · ca950a6b

Simon Cook authored Feb 28, 2020

When running under LTO, it is common to not specify the architecture
spec, which is used for setting up the target machine, and instead rely
on features specified in each function to generate the correct
instructions.

This works for the code generator, but the RISC-V backend uses the
AsmPrinter to do instruction compression, which does not see these
features but instead uses a MCSubtargetInfo object to see whether
compression is enabled. Since this is configured based on the
TargetMachine at startup, it will result in compressed instructions not
being emitted when it has not been given the 'c' TargetFeature, but the
function has it.

This changes the RISCVAsmPrinter to re-initialize the STI feature set
based on the current MachineFunction, such that compressed instructions
are now correctly emitted regardless of the method used to enable them.

Differential revision: https://reviews.llvm.org/D73339

ca950a6b

[gn build] Port 6af859dc · 29fb0b13
LLVM GN Syncbot authored Feb 28, 2020

29fb0b13
[ELF][LLD][ARM] Add missing REQUIRES: arm to tests · 1b025665
Peter Smith authored Feb 28, 2020
```
Fix buildbots that don't build ARM backend.
```
1b025665

[DebugInfo] Re-implement LexicalScopes dominance method, add unit tests · 6af859dc

Jeremy Morse authored Feb 28, 2020

Way back in D24994, the combination of LexicalScopes::dominates and
LiveDebugValues was identified as having worst-case quadratic complexity,
but it wasn't triggered by any code path at the time. I've since run into a
scenario where this occurs, in a very large basic block where large numbers
of inlined DBG_VALUEs are present.

The quadratic-ness comes from LiveDebugValues::join calling "dominates" on
every variable location, and LexicalScopes::dominates potentially touching
every instruction in a block to test for the presence of a scope. We have,
however, already computed the presence of scopes in blocks, in the
"InstrRanges" of each scope. This patch switches the dominates method to
examine whether a block is present in a scope's InsnRanges, avoiding
walking through the whole block.

At the same time, fix getMachineBasicBlocks to account for the fact that
InsnRanges can cover multiple blocks, and add some unit tests, as Lexical
Scopes didn't have any.

Differential revision: https://reviews.llvm.org/D73725

6af859dc