Commits · 348f22eac83d9a3ee946e41be43fe507f04a89b6 · Lorenzo Albano / LLVM bpEVL

Dec 13, 2019

Correct gcc vector splat conversion from float to int-vector · 348f22ea

Erich Keane authored Dec 13, 2019

In looking into some other code, I came across this issue where a
float converted to a gcc integer vector via a splat causes it to miss
the float-to-integral cast, which causes some REALLY strange codegen
bugs.

The AST looked like:
`-ImplicitCastExpr <col:13>
'gcc_int_2':'__attribute__((__vector_size__(2 * sizeof(int)))) int' <VectorSplat>
        `-ImplicitCastExpr <col:13> 'float' <LValueToRValue>
                  `-DeclRefExpr <col:13> 'float' lvalue ParmVar
                  0x556f16a5dc90 'f' 'float'

Despite the type of the VectorSplat cast as printed, it ended up
becoming a vector of float, which caused non-matching instructions. For
example, IntVector + a float constant resulted in:

add <2 x i32> %8, <2 x float> <float 3.000000e+00, float 3.000000e+00>

This patch corrects the conversion so that the float is first converted
to an integral, THEN splatted.

348f22ea

[RISCV] Move DebugLoc Copy into CompressInstEmitter · a0f43b00

Sam Elliott authored Dec 13, 2019

Summary:
This copy ensures that debug location information is kept for
compressed instructions. There are places where both compressInstruction and
uncompressInstruction are called that were not doing this copy, discarding some
debug info.

This change merely moves the copy into the generated file, so you cannot forget
to copy the location over when compressing or uncompressing.

Reviewers: asb, luismarques

Reviewed By: luismarques

Subscribers: sameer.abuasal, aprantl, hiraditya, rbar, johnrusso, simoncook, apazos, sabuasal, niosHD, kito-cheng, shiva0217, jrtc27, MaskRay, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, rkruppe, PkmX, jocewei, psnobl, benna, Jim, s.egerton, pzheng, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D67493

a0f43b00

Revert "[VectorUtils] Introduce the Vector Function Database (VFDatabase)." · 19f73f0d

Francesco Petrogalli authored Dec 13, 2019

This reverts commit 0be81968.

The VFDatabase needs some rework to be able to handle vectorization
and subsequent scalarization of intrinsics in out-of-tree versions of
the compiler. For more details, see the discussion in
https://reviews.llvm.org/D67572.

19f73f0d

[profile] Fix a crash when -fprofile-remapping-file= triggers an error · 193da743
Fangrui Song authored Dec 13, 2019
```
Reviewed By: wmi

Differential Revision: https://reviews.llvm.org/D71485
```
193da743
[InstSimplify] improve test coverage for insert+splat; NFC · 940600ae
Sanjay Patel authored Dec 13, 2019

940600ae

[DAGCombiner] fold shift-trunc-shift to shift-mask-trunc (2nd try) · 2f0c7fd2

Sanjay Patel authored Dec 13, 2019

The initial attempt (rG89633320) botched the logic by reversing
the source/dest types. Added x86 tests for additional coverage.
The vector tests show a potential improvement (fold vector load
instead of broadcasting), but that's a known/existing problem.

This fold is done in IR by instcombine, and we have a special
form of it already here in DAGCombiner, but we want the more
general transform too:
https://rise4fun.com/Alive/3jZm

Name: general
Pre: (C1 + zext(C2) < 64)
%s = lshr i64 %x, C1
%t = trunc i64 %s to i16
%r = lshr i16 %t, C2
=>
%s2 = lshr i64 %x, C1 + zext(C2)
%a = and i64 %s2, zext((1 << (16 - C2)) - 1)
%r = trunc %a to i16

Name: special
Pre: C1 == 48
%s = lshr i64 %x, C1
%t = trunc i64 %s to i16
%r = lshr i16 %t, C2
=>
%s2 = lshr i64 %x, C1 + zext(C2)
%r = trunc %s2 to i16

...because D58017 exposes a regression without this fold.

2f0c7fd2

[PGO][PGSO] Enable size optimizations in code gen / target passes for cold code. · ed50e606

Hiroshi Yamauchi authored Nov 07, 2019

Summary: Split off of D67120.

Reviewers: davidxl

Subscribers: hiraditya, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D71288

ed50e606

[NFC] Guard scudo_standalone's optional dependency on GWP-ASan behind flags. · d6c445ea
Mitch Phillips authored Dec 13, 2019

d6c445ea

[ARM] Fix in ICE when retrieving the number of micro-ops for vlldm/vlstm · 8e8e3181

Momchil Velikov authored Dec 13, 2019

The big switch in `ARMBaseInstrInfo::getNumMicroOps` is missing cases for
`VLLDM` and `VLSTM`, which are currently defined with itineraries having a
dynamic count of micro-ops.

Assuming an optimistic case in which these instruction do not actually perform
loads or stores, and with the idea that Armv8-m cores are supposed to use the
new style scheduling models, this patch just sets the itinerary for those two
instructions to `NoItinerary`.

Differential Revision: https://reviews.llvm.org/D71266

8e8e3181

gn docs: remove obsolete reference to monorepo · b5059421
Nico Weber authored Dec 13, 2019

b5059421
[lldb/Test] C++ test should use CXXFLAGS_EXTRAS · 1ef7c426
Jonas Devlieghere authored Dec 13, 2019
```
Thanks Ted Woodward for noticing this.
```
1ef7c426
[lldb/Host] Use cmakedefine01 for LLDB_ENABLE_POSIX · 3011d55f
Jonas Devlieghere authored Dec 12, 2019
```
Rename LLDB_DISABLE_POSIX to LLDB_ENABLE_POSIX and use cmakedefine01 for
consistency.
```
3011d55f

[libomptarget] Build most of common/src for amdgcn · 40d72134

Jon Chesterfield authored Dec 13, 2019

Summary:
[libomptarget] Build most of common/src for amdgcn

Excluding parallel.cu, which uses an integer min() from cuda,
Excluding support.cu, which calls malloc that is not yet available for amdgcn

Reviewers: jdoerfert, ABataev, grokos

Reviewed By: jdoerfert

Subscribers: gregrodgers, ronlieb, jvesely, mgorny, openmp-commits

Tags: #openmp

Differential Revision: https://reviews.llvm.org/D71446

40d72134

[GWP-ASan] [Scudo] ifdef entire GWP-ASan tests. · a00cd6df

Mitch Phillips authored Dec 13, 2019

Turns out that gtest in LLVM is only 1.8.0 (the newest version 1.10.0)
supports the GTEST_SKIP() macro, and apparently I didn't build w/o
GWP-ASan.

Should fix the GN bot, as well as any bots that may spuriously break on
platforms where the code wasn't correctly ifdef'd out as well.

a00cd6df

Revert "[ELF] Allow getErrPlace() to work before Out::bufferStart is set" · 17063abd

Vlad Tsyrklevich authored Dec 13, 2019

This reverts commit 2bbd32f5, it was
causing UBSan failures like the following:
lld/ELF/Target.cpp:103:41: runtime error: applying non-zero offset 24 to null pointer

17063abd

[AArch64] Emit PAC/BTI .note.gnu.property flags · d53e6186

Momchil Velikov authored Dec 13, 2019

This patch make LLVM emit the processor specific program property types
defined in AArch64 ELF spec
https://developer.arm.com/docs/ihi0056/f/elf-for-the-arm-64-bit-architecture-aarch64-abi-2019q2-documentation

A file containing no functions gets both property flags. Otherwise, a property
is set iff all the functions in the file have the corresponding attribute.

Patch by Daniel Kiss and Momchil Velikov.

Differential Revision: https://reviews.llvm.org/D71019

d53e6186

[MC][PowerPC] Fix a crash when redefining a symbol after .set · f99eedeb

Fangrui Song authored Dec 12, 2019

Fix PR44284. This is probably not valid assembly but we should not crash.

Reviewed By: luporl, #powerpc, steven.zhang

Differential Revision: https://reviews.llvm.org/D71443

f99eedeb

[ARM][MVE][Intrinsics] All vqdmulhq/vqrdmulhq tests should be for signed numbers. · a2cd4600
Mark Murray authored Dec 13, 2019
```
Fix broken tests. I can't yet explain how they worked locally pre-commit.
```
a2cd4600
[ARM][MVE] Fix -Wunused-variable in -DLLVM_ENABLE_ASSERTIONS=Off builds after D71062 · f16377f1
Fangrui Song authored Dec 13, 2019

f16377f1

[ELF] Update st_size when merging a common symbol with a shared symbol · 69d10d28

Fangrui Song authored Dec 06, 2019

When a common symbol is merged with a shared symbol, increase st_size if
the shared symbol has a larger st_size. At runtime, the executable's
symbol overrides the shared symbol.  The shared symbol may be created
from common symbols in a previous link.  This rule makes sure we pick
the largest size among all common symbols.

This behavior matches GNU ld. See
https://sourceware.org/bugzilla/show_bug.cgi?id=25236 for discussions.

A shared symbol does not hold alignment constraints. Ignore the
alignment update.

Reviewed By: peter.smith

Differential Revision: https://reviews.llvm.org/D71161

69d10d28

[Scudo] [GWP-ASan] Add GWP-ASan to Scudo Standalone. · ed4618ed

Mitch Phillips authored Dec 13, 2019

Summary:
Adds GWP-ASan to Scudo standalone. Default parameters are pulled across from the
GWP-ASan build. No backtrace support as of yet.

Reviewers: cryptoad, eugenis, pcc

Reviewed By: cryptoad

Subscribers: merge_guards_bot, mgorny, #sanitizers, llvm-commits, cferris, vlad.tsyrklevich, pcc

Tags: #sanitizers, #llvm

Differential Revision: https://reviews.llvm.org/D71229

ed4618ed

[ARM][MVE][Intrinsics] remove extraneous intrinsics. (Reapply) · c1ef116c

Mark Murray authored Dec 13, 2019

Summary:
I overstepped my reach and generated too many intrinsics; these never
made it into the tests.

Remove these extras. Some needed to be signed-olny, and there were some
possible but unrequired _x variants that needed an extra argument to
IntrinsicMX to allow [de-]selection at compile-time.

Reviewers: simon_tatham

Subscribers: kristof.beyls, dmgreen, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D71466

c1ef116c

gn build: Merge 84728e65 · 65a3e1dc
LLVM GN Syncbot authored Dec 13, 2019

65a3e1dc

Revert "[ARM][MVE][Intrinsics] remove extraneous intrinsics." · 34536db7

Dmitri Gribenko authored Dec 13, 2019

This reverts commit 0eb09927.

The code does not compile:
http://lab.llvm.org:8011/builders/clang-x86_64-debian-fast/builds/20462

34536db7

[llvm-exegesis][mips] Add BenchmarkResultTest unit test · 84728e65

Miloš Stojanović authored Dec 13, 2019

Test writing and reading benchmark instructions to and from disc, and
check calculations of min, max and avg values from a list of benchmark
measures.

Differential Revision: https://reviews.llvm.org/D71265

84728e65

[clangd] Fall back to selecting token-before-cursor if token-after-cursor fails. · b60896fa

Sam McCall authored Dec 11, 2019

Summary:
The problem:

LSP specifies that Positions are between characters. Therefore when a position
(or an empty range) is used to target elements of the source code, there is an
ambiguity - should we look left or right of the cursor?

Until now, SelectionTree resolved this to the right except in trivial cases
(where there's whitespace, semicolon, or eof on the right).
This meant that it's unable to e.g. out-line `int foo^()` today.

Complicating this, LSP notwithstanding the cursor is *on* a character in many
editors (mostly terminal-based). In these cases there's no ambiguity - we must
"look right" - but there's also no way to tell in LSP.

(Several features currently resolve this by using getBeginningOfIdentifier,
which tries to rewind and supports end-of-identifier. But this relies on
raw lexing and is limited and buggy).

Precedent: well - most other languages aren't so full of densely packed symbols
that we might want to target. Bias-towards-identifier works well enough.
MS C++ for vscode seems to mostly use bias-toward-identifier too.
The problem with this solution is it doesn't provide any way to target some
things such as the constructor call in Foo^(bar());

Presented solution:

When an ambiguous selection is found, we generate *both* possible selection
trees. We try to run the feature on the rightward tree first, and then on the
leftward tree if it fails.

This is basically do-what-I-mean, the main downside is the need to do this on
a feature-by-feature basis (because each feature knows what "fail" means).
The most complicated instance of this is Tweaks, where the preferred selection
may vary tweak-by-tweak.

Wrinkles:

While production behavior is pretty consistent, this introduces some
inconsistency in testing, depending whether the interface we're testing is
inside or outside the "retry" wrapper.

In particular, for many features like Hover, the unit tests will show production
behavior, while for Tweaks the harness would have to run the loop itself if
we want this.

Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D71345

b60896fa

[Tooling/Syntax] Helpers to find spelled tokens touching a location. · 22f81250

Sam McCall authored Dec 11, 2019

Summary: Useful when positions are used to target nodes, with before/after ambiguity.

Reviewers: ilya-biryukov, kbobyrev

Subscribers: cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D71356

22f81250

[ARM][MVE][Intrinsics] remove extraneous intrinsics. · 0eb09927

Mark Murray authored Dec 13, 2019

Summary:
I overstepped my reach and generated too many intrinsics; these never
made it into the tests.

Remove these extras. Some needed to be signed-olny, and there were some
possible but unrequired _x variants that needed an extra argument to
IntrinsicMX to allow [de-]selection at compile-time.

Reviewers: simon_tatham

Subscribers: kristof.beyls, dmgreen, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D71466

0eb09927

[ARM][MVE] Make VPT invalid for tail predication · 84593f05

Sam Parker authored Dec 11, 2019

We've been marking VPT incompatible instructions as invalid for tail
predication too, though this may not strictly be true. VPT are
incompatible and, unless its the first predicate def in a loop,
they shouldn't be compatible for tail predication either.

Differential Revision: https://reviews.llvm.org/D71410

84593f05

[llvm-dwarfdump][Statistics] Don't count coverage less than 1% as 0% · d5655c4d

Kristina Bessonova authored Dec 05, 2019

Summary:
This is a follow up for D70548.
Currently, variables with debug info coverage between 0% and 1% are put into
zero-bucket. D70548 changed the way statistics calculate a variable's coverage:
we began to use enclosing scope rather than a possible variable life range.
Thus more variables might be moved to zero-bucket despite they have some debug
info coverage.
The patch is to distinguish between a variable that has location info but
it's significantly less than its enclosing scope and a variable that doesn't
have it at all.

Reviewers: djtodoro, aprantl, dblaikie, avl

Subscribers: llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D71070

d5655c4d

Reland [DataLayout] Fix occurrences that size and range of pointers are assumed to be the same. · 97572775

Nicola Zaghen authored Dec 13, 2019

GEP index size can be specified in the DataLayout, introduced in D42123. However, there were still places
in which getIndexSizeInBits was used interchangeably with getPointerSizeInBits. This notably caused issues
with Instcombine's visitPtrToInt; but the unit tests was incorrect, so this remained undiscovered.

This fixes the buildbot failures.

Differential Revision: https://reviews.llvm.org/D68328

Patch by Joseph Faulls!

97572775

[libomptarget][nfc] Add nop syncwarp function for amdgcn · 56adcebf
Jon Chesterfield authored Dec 13, 2019

56adcebf
[x86] add tests for shift-trunc-shift; NFC · dc9e6ba9
Sanjay Patel authored Dec 12, 2019
```
More coverage for a possible generic transform.
```
dc9e6ba9

[ARM][MVE] Add vector reduction intrinsics with two vector operands · 99581fd4

Mikhail Maltsev authored Dec 13, 2019

Summary:
This patch adds intrinsics for the following MVE instructions:
* VABAV
* VMLADAV, VMLSDAV
* VMLALDAV, VMLSLDAV
* VRMLALDAVH, VRMLSLDAVH

Each of the above 4 groups has a corresponding new LLVM IR intrinsic,
since the instructions cannot be easily represented using
general-purpose IR operations.

Reviewers: simon_tatham, ostannard, dmgreen, MarkMurrayARM

Reviewed By: MarkMurrayARM

Subscribers: merge_guards_bot, kristof.beyls, hiraditya, cfe-commits, llvm-commits

Tags: #clang, #llvm

Differential Revision: https://reviews.llvm.org/D71062

99581fd4

[llvm-dwarfdump][Statistics] Change the coverage buckets representation. NFC · 1cc4b603

Kristina Bessonova authored Dec 11, 2019

Summary:
This changes the representation of 'coverage buckets' in llvm-dwarfdump and
llvm-locstats to one that makes more clear what the buckets contain.

See some related details in D71070.

Reviewers: djtodoro, aprantl, cmtice, jhenderson

Subscribers: llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D71366

1cc4b603

[ARM][MVE] Add intrinsics for more immediate shifts. · 25305a93

Simon Tatham authored Dec 13, 2019

Summary:
This fills in the remaining shift operations that take a single vector
input and an immediate shift count: the `vqshl`, `vqshlu`, `vrshr` and
`vshll[bt]` families.

`vshll[bt]` (which shifts each input lane left into a double-width
output lane) is the most interesting one. There are separate MC
instruction ids for shifting by exactly the input lane width and
shifting by less than that, because the instruction encoding is so
completely different for the lane-width special case. So I had to
write two sets of patterns to match based on the immediate shift
count, which involved adding a ComplexPattern matcher to avoid the
general-case pattern accidentally matching the special case too. For
that family I've made sure to add an llc codegen test for both
versions of each instruction.

I'm experimenting with a new strategy for parametrising the isel
patterns for all these instructions: adding extra fields to the
relevant `Instruction` subclass itself, which are ignored by the
Tablegen backends that generate the MC data, but can be retrieved from
each instance of that instruction subclass when it's passed as a
template parameter to the multiclass that generates its isel patterns.
A nice effect of that is that I can fill in those informational fields
using `let` blocks, rather than having to type them out once per
instruction at `defm` time.

(As a result, quite a lot of existing instruction `def`s are
reindented by this patch, so it's clearer to read with whitespace
changes ignored.)

Reviewers: dmgreen, MarkMurrayARM, miyuki, ostannard

Reviewed By: MarkMurrayARM

Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits

Tags: #clang, #llvm

Differential Revision: https://reviews.llvm.org/D71458

25305a93

[ARM] Add custom strict fp conversion lowering when non-strict is custom · 01ba201a

John Brawn authored Dec 05, 2019

We have custom lowering for operations converting to/from floating-point types
when we don't have hardware support for those types, and this doesn't interact
well with the target-independent legalization of the strict versions of these
operations. Fix this by adding similar custom lowering of the strict versions.

This fixes the last of the assertion failures in the CodeGen/ARM/fp-intrinsics
test, with the remaining failures due to poor instruction selection.

Differential Revision: https://reviews.llvm.org/D71127

01ba201a

Revert "AMDGPU: Try to commute sub of boolean ext" · fce1a6f5

Tim Renouf authored Dec 03, 2019

This reverts commit 69fcfb7d.

As shown in the test I attached to this commit, the change I reverted
causes a problem with "zext(cc1) - zext(cc2)". It commuted
the operands to the sub and used different logic to select the addc/subc
instruction:
   sub zext (setcc), x => addcarry 0, x, setcc
   sub sext (setcc), x => subcarry 0, x, setcc

... but that is bogus. I believe it is not possible to fold those commuted
patterns into any form of addcarry or subcarry. It may have worked as
intended before "AMDGPU: Change boolean content type to 0 or 1" because
the setcc was considered to be -1 rather than 1.

Differential Revision: https://reviews.llvm.org/D70978

Change-Id: If2139421aa6c935cbd1d925af58fe4a4aa9e8f43

fce1a6f5

[llvm-locstats] Avoid the locstats when no scope bytes coverage found · baea9136

Djordje Todorovic authored Dec 12, 2019

If the total number of PC range bytes in each variable's enclosing scope
('scope bytes total') is 0, we will have division by zero.

Differential Revision: https://reviews.llvm.org/D71415

baea9136

[Sema] Improve diagnostic about addr spaces for overload candidates · ed8dadb3

Anastasia Stulova authored Dec 13, 2019

Allow sending address spaces into diagnostics to simplify and improve
error reporting. Improved wording of diagnostics for address spaces
in overloading.

Tags: #clang

Differential Revision: https://reviews.llvm.org/D71111

ed8dadb3