Commits · 683e89722aa0fd9afd41fb0799e46e1b28db1de6 · Lorenzo Albano / LLVM bpEVL

Mar 10, 2021

Revert "[cmake] Enable -Werror=return-type" · 683e8972
Dave Lee authored Mar 10, 2021
```
This reverts commit ce94a161.
```
683e8972

[AST] Add generator for source location introspection · d627a27d

Stephen Kelly authored Dec 12, 2020

Generate a json file containing descriptions of AST classes and their
public accessors which return SourceLocation or SourceRange.

Use the JSON file to generate a C++ API and implementation for accessing
the source locations and method names for accessing them for a given AST
node.

This new API can be used to implement 'srcloc' output in clang-query:

  http://ce.steveire.com/z/m_kTIo

In this first version of this feature, only the accessors for Stmt
classes are generated, not Decls, TypeLocs etc.  Those can be added
after this change is reviewed, as this change is mostly about
infrastructure of these code generators.

Differential Revision: https://reviews.llvm.org/D93164

d627a27d

[nfc] [lldb] Remove variable ranges_base in DWARFUnit::AddUnitDIE · 08d33aa6
Jan Kratochvil authored Mar 10, 2021

08d33aa6
Add noreturn attribute to non-returning functions · 6976255f
Aditya Kumar authored Feb 23, 2021
```
Differential Revision: https://reviews.llvm.org/D97308
```
6976255f

llvm-lto: default Relocation Model should be selected by the TargetMachine. · be646e31

Wael Yehia authored Mar 10, 2021

Right now, the createTargetMachine function in LTOBackend.cpp (used by llvm-lto, and other components) selects the default Relocation Model when none is specified in the module.
Other components (such as opt and llc) that construct a TargetMachine delegate the decision on the default value to the polymorphic TargetMachine's constructor.

This commit aligns llvm-lto with other components.

Reviewed By: daltenty, fhahn

Differential Revision: https://reviews.llvm.org/D97507

be646e31

[AArch64] Extend vecreduce -> udot handling to mla reductions · 1a808286

David Green authored Mar 10, 2021

We previously have lowering for:
  vecreduce.add(zext(X)) to vecreduce.add(UDOT(zero, X, one))
This extends that to also handle:
  vecreduce.add(mul(zext(X), zext(Y)) to vecreduce.add(UDOT(zero, X, Y))
It extends the existing code to optionally handle a mul with equal
extends.

Differential Revision: https://reviews.llvm.org/D97280

1a808286

[Attributor] Attributor call site specific AAValueConstantRange · d75c9e61

kuterd authored Jan 24, 2021

This patch makes uses of the context bridges introduced in D83299 to make
AAValueConstantRange call site specific.

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D83744

d75c9e61

[lldb] Ignore linkage diagnostic for LLDBSwigPythonBreakpointCallbackFunction (NFC) · daf36998

Dave Lee authored Mar 10, 2021

Ignore `-Wreturn-type-c-linkage` diagnostics for `LLDBSwigPythonBreakpointCallbackFunction`.

The function is defined in `python-wrapper.swig` which uses `extern "C" { ... }` blocks.
The declaration of this function in `ScriptInterpreterPython.cpp` already uses these
same pragmas to silence the warning there.

This prevents `-Werror` builds from failing.

Differential Revision: https://reviews.llvm.org/D98368

daf36998

[lldb/Platform] Skip very slow xcrun queries for simulator platforms, NFC · ac29c352

Vedant Kumar authored Mar 09, 2021

GetXcodeSDK() consistently takes over 1 second to complete if the
queried SDK is missing, because `xcrun` doesn't cache negative lookups.

Because there are multiple simulator platforms, this can add 4+ seconds
to `lldb -b some_object_file.o`.

To work around this, skip the call to GetXcodeSDK() when setting up
simulator platforms if the specified arch doesn't have what looks like a
simulator triple.

Some other ways to fix this:
- Fix caching in xcrun (rdar://74882205)
- Test for arch compat before calling SomePlatform::CreateInstance() (much
  larger change)

Differential Revision: https://reviews.llvm.org/D98272

ac29c352

[flang][driver] Formatting OpenMP sema check as per clang-format · 361e9bf9
Arnamoy Bhattacharyya authored Mar 10, 2021

361e9bf9

[NFC] Fix a compiler warning · 49942c6d

Quentin Colombet authored Mar 10, 2021

Fix a warning caused by -Wrange-loop-analysis

Patch by Xiaoqing Wu <xiaoqing_wu@apple.com>

Differential Revision: https://reviews.llvm.org/D98297

49942c6d

[AArch64] Extend vecreduce -> udot handling to v8i8 · a02f5068

David Green authored Mar 10, 2021

https://reviews.llvm.org/D88577 added v16i8 vecreduce to udot/sdot
lowering. This extends that to v8i8 too, generalizing the pattern to
handle the extra types.

Differential Revision: https://reviews.llvm.org/D97279

a02f5068

[VPlan] Support to widen select intructions in VPlan native path · 0de8aeae

Mauri Mustonen authored Mar 10, 2021

Add support to widen select instructions in VPlan native path by using a correct recipe when such instructions are encountered. This is already used by inner loop vectorizer.

Previously select instructions get handled by the wrong recipe and resulted in unreachable instruction errors like this one: https://bugs.llvm.org/show_bug.cgi?id=48139.

Reviewed By: fhahn

Differential Revision: https://reviews.llvm.org/D97136

0de8aeae

Replace func name with regex in update_cc_test_checks · bf58d6a1

Giorgis Georgakoudis authored Feb 19, 2021

The patch adds an argument to update_cc_test_checks for replacing a function name matching a regex. This functionality is needed to match generated function signatures that include file hashes. Example:

The function signature for the following function:

`__omp_offloading_50_b84c41e__Z9ftemplateIiET_i_l30_worker`

with `--replace-function-regex "__omp_offloading_[0-9]+_[a-z0-9]+_(.*)"` will become:

`CHECK-LABEL: @{{__omp_offloading_[0-9]+_[a-z0-9]+__Z9ftemplateIiET_i_l30_worker}}(`

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D97107

bf58d6a1

[mlir] Remove unnecessary copying of pass options · 6a291ed0

Christian Sigg authored Mar 10, 2021

I missed a comment in D98279 that you don't need to copy pass options.

Reviewed By: rriddle

Differential Revision: https://reviews.llvm.org/D98366

6a291ed0

[llvm-objcopy][NFC] replace class Buffer/MemBuffer/FileBuffer with streams. · 4f16e177

Alexey Lapshin authored Oct 24, 2020

During D88827 it was requested to remove the local implementation
of Memory/File Buffers:

// TODO: refactor the buffer classes in LLVM to enable us to use them here
// directly.

This patch uses raw_ostream instead of Buffers. Generally, using streams
could allow us to reduce memory usages. No need to load all data into the
memory - the data could be streamed through a smaller buffer.
Thus, this patch uses raw_ostream as an interface for output data:

Error executeObjcopyOnBinary(CopyConfig &Config,
                             object::Binary &In,
                             raw_ostream &Out);

Note 1. This patch does not change the implementation of Writers
so that data would be directly stored into raw_ostream.
This is assumed to be done later.

Note 2. It would be better if Writers would be implemented in a such way
that data could be streamed without seeking/updating. If that would be
inconvenient then raw_ostream could be replaced with raw_pwrite_stream
to have a possibility to seek back and update file headers.
This is assumed to be done later if necessary.

Note 3. Current FileOutputBuffer allows using a memory-mapped file.
The raw_fd_ostream (which could be used if data should be stored in the file)
does not allow us to use a memory-mapped file. Memory map functionality
could be implemented for raw_fd_ostream:

It is possible to add resize() method into raw_ostream.

class raw_ostream {
  void resize(uint64_t size);
}

That method, implemented for raw_fd_ostream, could create a memory-mapped file.
The streamed data would be written into that memory file then.
Thus we would be able to use memory-mapped files with raw_fd_ostream.
This is assumed to be done later if necessary.

Differential Revision: https://reviews.llvm.org/D91028

4f16e177

[mlir][spirv] Define spv.Image Operation · 619c1505

Weiwei Li authored Mar 10, 2021



co-authered-by: Alan Liu <alanliu.yf@gmail.com>

Reviewed By: antiagainst

Differential Revision: https://reviews.llvm.org/D98270

619c1505

[AMDGPU] Disable SCC bit on fp atomics · 9931b1f7
Stanislav Mekhanoshin authored Mar 08, 2021
```
Differential Revision: https://reviews.llvm.org/D98221
```
9931b1f7

[AMDGPU] Always expand system scope fp atomics on gfx90a · 574a9dab

Stanislav Mekhanoshin authored Mar 05, 2021

FP atomics in system scope cannot be used and shall always
be expanded in a CAS loop.

Differential Revision: https://reviews.llvm.org/D98085

574a9dab

Run non-filechecked commands in update_cc_test_checks.py · a2abe225

Giorgis Georgakoudis authored Feb 19, 2021

Some tests in clang require running non-filechecked commands to generate the actual filecheck input. For example, tests for openmp offloading require generating the host bc without any checking, before running the clang command to actually generate the filechecked IR of the target device. This patch enables `update_cc_test_checks.py` to run non-filechecked run lines in-place.

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D97068

a2abe225

[dfsan] Update fast16labels.ll test · 05c2c8aa

George Balatsouras authored Mar 09, 2021

Remove hard-coded shadow width references. Separate CHECK lines that only apply to fast16 mode.

Reviewed By: stephan.yichao.zhao

Differential Revision: https://reviews.llvm.org/D98308

05c2c8aa

[DSE] Extending isOverwrite to support offsetted fully overlapping stores · 989051d5

Matteo Favaro authored Mar 10, 2021

The isOverwrite function is making sure to identify if two stores
are fully overlapping and ideally we would like to identify all the
instances of OW_Complete as they'll yield possibly killable stores.
The current implementation is incapable of spotting instances where
the earlier store is offsetted compared to the later store, but
still fully overlapped. The limitation seems to lie on the
computation of the base pointers with the
GetPointerBaseWithConstantOffset API that often yields different
base pointers even if the stores are guaranteed to partially overlap
(e.g. the alias analysis is returning AliasResult::PartialAlias).

The patch relies on the offsets computed and cached by BatchAAResults
(available after D93529) to determine if the offsetted overlapping
is OW_Complete.

Differential Revision: https://reviews.llvm.org/D97676

989051d5

[lld-macho][NFC] add const to pointer/reference induction variables of range-based for loops · 98fe9e41

Greg McGary authored Mar 09, 2021

Pointer and reference induction variables of range-based for loops are often const, and code authors often lax about qualifying them.

Differential Revision: https://reviews.llvm.org/D98317

98fe9e41

Remove original implementation of UniqueInternalLinkageNames pass. · 0ba1ebcb

Sriraman Tallam authored Mar 08, 2021

D96109 was recently submitted which contains the refactored implementation of
-funique-internal-linakge-names by adding the unique suffixes in clang rather
than as an LLVM pass. Deleting the former implementation in this change.

Differential Revision: https://reviews.llvm.org/D98234

0ba1ebcb

[InstCombine] Regenerate test checks (NFC) · e19160c8
Nikita Popov authored Mar 10, 2021

e19160c8
Revert "[mlir][Vector][Affine] Improve affine vectorizer algorithm" · 79da91c5
Alex Zinenko authored Mar 10, 2021
```
This reverts commit 95db7b4a.

This breaks vectorize_2d.mlir and vectorize_3d.mlir test under ASAN (use
after free).
```
79da91c5
Revert "[mlir][Affine][Vector] Add initial support for 'iter_args' to Affine vectorizer." · ed715536
Alex Zinenko authored Mar 10, 2021
```
This reverts commit 77a9d154.

Parent commit is broken.
```
ed715536

[RuntimeDyld] Support more relocations · e4b40616

Rafael Auler authored Mar 03, 2021

This patch introduces functionality used by BOLT when
re-linking the final binary. It adds new relocation types that
are currently unsupported by RuntimeDyldELF.

Reviewed By: lhames

Differential Revision: https://reviews.llvm.org/D97899

e4b40616

[lldb] Fix PushPlan to set subplan to private · 354d1053

Dave Lee authored Feb 17, 2021

Call `SetPrivate(true)` for subplans pushed via `PushPlan()`, as described in its
docstring.

Differential Revision: https://reviews.llvm.org/D96916

354d1053

[NFC] Fix compiler warnings · 66dab2fa

Quentin Colombet authored Mar 10, 2021

Fix warnings caused by -Wrange-loop-analysis.

Patch by Xiaoqing Wu <xiaoqing_wu@apple.com>

Differential Revision: https://reviews.llvm.org/D98298

66dab2fa

[mlir][Affine][Vector] Add initial support for 'iter_args' to Affine vectorizer. · 77a9d154

Diego Caballero authored Mar 10, 2021

This patch adds support for vectorizing loops with 'iter_args' when those loops
are not a vector dimension. This allows vectorizing outer loops with an inner
'iter_args' loop (e.g., reductions). Vectorizing scenarios where 'iter_args'
loops are vector dimensions would require more work (e.g., analysis,
generating horizontal reduction, etc.) not included in this patch.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D97892

77a9d154

[mlir][Vector][Affine] Improve affine vectorizer algorithm · 95db7b4a

Diego Caballero authored Mar 10, 2021

This patch replaces the root-terminal vectorization approach implemented in the
Affine vectorizer with a topological order approach that vectorizes all the
operations within the target loop nest. These are the most important changes
introduced by the new algorithm:
  * Removed tracking of root and terminal ops. Existing vectorization
    functionality is preserved and extended so that loop nests without
    root-terminal chains can be vectorized.
  * Vectorizing a loop nest now only requires a single topological traversal.
  * A new vector loop nest is incrementally built along the vectorization
    process. The original scalar loop is kept intact. No cloning guard is needed
    to recover the scalar loop if vectorization fails. This approach also
    simplifies the challenging task of replacing a loop operation amid the
    vectorization process without invalidating the analysis information that
    depends on the original loop.
  * Vectorization of specific operations has been implemented as independent,
    preparing them to be moved to a potential vectorization interface.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D97442

95db7b4a

[PowerPC] Implement patterns for PC-Rel zextload/extload byte loads · 8b540c54

Amy Kwan authored Mar 04, 2021

This patch adds patterns to select the PC-Relative extloadi1 and zextloadi1 byte loads.

Differential Revision: https://reviews.llvm.org/D98042

8b540c54

[clang] Don't assert in EmitAggregateCopy on trivial_abi types · c8227f06
Arthur Eubanks authored Mar 03, 2021
```
Fixes PR42961.

Reviewed By: rnk

Differential Revision: https://reviews.llvm.org/D97872
```
c8227f06

[DebugInfo][NFC] Refactor BinOp+GEP salvaging in salvageDebugInfoImpl · 81b8357e

gbtozers authored Dec 08, 2020

This patch refactors out the salvaging of GEP and BinOp instructions into
separate functions, in preparation for further changes to the salvaging of these
instructions coming in another patch; there should be no functional change as a
result of this refactor.

Differential Revision: https://reviews.llvm.org/D92851

81b8357e

[RISCV][SelectionDAG] Introduce an ISD::SPLAT_VECTOR_PARTS node that can... · 9106d045

Craig Topper authored Mar 10, 2021

[RISCV][SelectionDAG] Introduce an ISD::SPLAT_VECTOR_PARTS node that can represent a splat of 2 i32 values into a nxvXi64 vector for riscv32.

On riscv32, i64 isn't a legal scalar type but we would like to
support scalable vectors of i64.

This patch introduces a new node that can represent a splat made
of multiple scalar values. I've used this new node to solve the current
crashes we experience when getConstant is used after type legalization.

For RISCV, we are now default expanding SPLAT_VECTOR to SPLAT_VECTOR_PARTS
when needed and then handling the SPLAT_VECTOR_PARTS later during
LegalizeOps. I've remove the special case I previously put in for
ABS for D97991 as the default expansion is now able to succesfully
use getConstant.

Reviewed By: frasercrmck

Differential Revision: https://reviews.llvm.org/D98004

9106d045

[RISCV] Starting fixing issues that prevent us from testing vXi64 intrinsics on RV32. · 0c73a506

Craig Topper authored Mar 10, 2021

Currently we crash in type legalization any time an intrinsic
uses a scalar i64 on RV32.

This patch adds support for type legalizing this to prevent
crashing. I don't promise that it uses the best possible codegen
just that it is functional.

This first version handles 3 cases. vmv.v.x intrinsic, vmv.s.x
intrinsic and intrinsics that take a scalar input, splat it and
then do some operation.

For vmv.v.x we'll either rely on hardware sign extension for
constants or we'll convert it to multiple splats and bit
manipulation.

For vmv.s.x we use a really unoptimal sequence inspired by what
we do for an INSERT_VECTOR_ELT.

For the third case we'll either try to use the .vi form for
constants or convert to a complicated splat and bitmanip and use
the .vv form of the operation.

I've renamed the ExtendOperand field to SplatOperand now use it
specifically for the third case. The first two cases are handled
by custom lowering specifically for those intrinsics.

I haven't updated all tests yet, but I tried to cover a subset
that includes single-width, widening, and narrowing.

Reviewed By: frasercrmck

Differential Revision: https://reviews.llvm.org/D97895

0c73a506

[compiler-rt] Fix stale incremental builds when using `LLVM_BUILD_EXTERNAL_COMPILER_RT=ON`. · a159f91c

Dan Liew authored Mar 09, 2021

When building with `LLVM_BUILD_EXTERNAL_COMPILER_RT=ON` (e.g. Swift does
this) we do an "external" build of compiler-rt where we build
compiler-rt with the just built clang.

Unfortunately building in this mode had a bug where compiler-rt would
not get rebuilt if compiler-rt sources changed. This is problematic
for incremental builds because it meant that the compiler-rt binaries
were stale.

The fix is to use the `BUILD_ALWAYS` ExternalProject_Add option which
means the build command for compiler-rt is always run.

In principle if all of the following are true:

* compiler-rt has already been built.
* there are no compiler-rt source changes.
* the compiler hasn't changed.
* ninja is being used as the generator for the compiler-rt build.

then the overhead for always running the build command for incremental
builds is negligible.

However, in practice clang gets rebuilt everytime the HEAD commit
changes (due to commit hash being embedded in the output of `--version`)
which means all of compiler-rt will be rebuilt everytime this happens.
While this is annoying it's better to do the slow but correct thing
rather than the fast but incorrect thing.

rdar://75150660

Differential Revision: https://reviews.llvm.org/D98291

a159f91c

[flang] Fix call to CHECK() on overriding an erroneous type-bound procedure · 40e26180

Peter Steinfeld authored Mar 10, 2021

You can define a base type with a type-bound procedure which is erroneously
missing a NOPASS attribute and then define another type that extends the base
type and overrides the erroneous procedure. In this case, when we perform
semantic checking on the overriding procedure, we verify the "pass index" of
the overriding procedure. The attempt to get the procedure's pass index fails
a call to CHECK().

I fixed this by calling SetError() on the symbol of the overridden procedure in
the base type. Then, I check HasError() before executing the code that invokes
the failing call to CHECK(). I also added a test that will cause the compiler
to fail the call to CHECK() without this change.

Differential Revision: https://reviews.llvm.org/D98355

40e26180

[lldb] [test] Update XFAILs for FreeBSD/aarch64 · f47a84bc
Michał Górny authored Mar 03, 2021

f47a84bc