Commits · 0de8aeae7249d314f25b5188c91b04b9a24003ad · Lorenzo Albano / LLVM bpEVL

Mar 10, 2021

[VPlan] Support to widen select intructions in VPlan native path · 0de8aeae

Mauri Mustonen authored Mar 10, 2021

Add support to widen select instructions in VPlan native path by using a correct recipe when such instructions are encountered. This is already used by inner loop vectorizer.

Previously select instructions get handled by the wrong recipe and resulted in unreachable instruction errors like this one: https://bugs.llvm.org/show_bug.cgi?id=48139.

Reviewed By: fhahn

Differential Revision: https://reviews.llvm.org/D97136

0de8aeae

Replace func name with regex in update_cc_test_checks · bf58d6a1

Giorgis Georgakoudis authored Feb 19, 2021

The patch adds an argument to update_cc_test_checks for replacing a function name matching a regex. This functionality is needed to match generated function signatures that include file hashes. Example:

The function signature for the following function:

`__omp_offloading_50_b84c41e__Z9ftemplateIiET_i_l30_worker`

with `--replace-function-regex "__omp_offloading_[0-9]+_[a-z0-9]+_(.*)"` will become:

`CHECK-LABEL: @{{__omp_offloading_[0-9]+_[a-z0-9]+__Z9ftemplateIiET_i_l30_worker}}(`

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D97107

bf58d6a1

[llvm-objcopy][NFC] replace class Buffer/MemBuffer/FileBuffer with streams. · 4f16e177

Alexey Lapshin authored Oct 24, 2020

During D88827 it was requested to remove the local implementation
of Memory/File Buffers:

// TODO: refactor the buffer classes in LLVM to enable us to use them here
// directly.

This patch uses raw_ostream instead of Buffers. Generally, using streams
could allow us to reduce memory usages. No need to load all data into the
memory - the data could be streamed through a smaller buffer.
Thus, this patch uses raw_ostream as an interface for output data:

Error executeObjcopyOnBinary(CopyConfig &Config,
                             object::Binary &In,
                             raw_ostream &Out);

Note 1. This patch does not change the implementation of Writers
so that data would be directly stored into raw_ostream.
This is assumed to be done later.

Note 2. It would be better if Writers would be implemented in a such way
that data could be streamed without seeking/updating. If that would be
inconvenient then raw_ostream could be replaced with raw_pwrite_stream
to have a possibility to seek back and update file headers.
This is assumed to be done later if necessary.

Note 3. Current FileOutputBuffer allows using a memory-mapped file.
The raw_fd_ostream (which could be used if data should be stored in the file)
does not allow us to use a memory-mapped file. Memory map functionality
could be implemented for raw_fd_ostream:

It is possible to add resize() method into raw_ostream.

class raw_ostream {
  void resize(uint64_t size);
}

That method, implemented for raw_fd_ostream, could create a memory-mapped file.
The streamed data would be written into that memory file then.
Thus we would be able to use memory-mapped files with raw_fd_ostream.
This is assumed to be done later if necessary.

Differential Revision: https://reviews.llvm.org/D91028

4f16e177

[AMDGPU] Disable SCC bit on fp atomics · 9931b1f7
Stanislav Mekhanoshin authored Mar 08, 2021
```
Differential Revision: https://reviews.llvm.org/D98221
```
9931b1f7

[AMDGPU] Always expand system scope fp atomics on gfx90a · 574a9dab

Stanislav Mekhanoshin authored Mar 05, 2021

FP atomics in system scope cannot be used and shall always
be expanded in a CAS loop.

Differential Revision: https://reviews.llvm.org/D98085

574a9dab

Run non-filechecked commands in update_cc_test_checks.py · a2abe225

Giorgis Georgakoudis authored Feb 19, 2021

Some tests in clang require running non-filechecked commands to generate the actual filecheck input. For example, tests for openmp offloading require generating the host bc without any checking, before running the clang command to actually generate the filechecked IR of the target device. This patch enables `update_cc_test_checks.py` to run non-filechecked run lines in-place.

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D97068

a2abe225

[dfsan] Update fast16labels.ll test · 05c2c8aa

George Balatsouras authored Mar 09, 2021

Remove hard-coded shadow width references. Separate CHECK lines that only apply to fast16 mode.

Reviewed By: stephan.yichao.zhao

Differential Revision: https://reviews.llvm.org/D98308

05c2c8aa

[DSE] Extending isOverwrite to support offsetted fully overlapping stores · 989051d5

Matteo Favaro authored Mar 10, 2021

The isOverwrite function is making sure to identify if two stores
are fully overlapping and ideally we would like to identify all the
instances of OW_Complete as they'll yield possibly killable stores.
The current implementation is incapable of spotting instances where
the earlier store is offsetted compared to the later store, but
still fully overlapped. The limitation seems to lie on the
computation of the base pointers with the
GetPointerBaseWithConstantOffset API that often yields different
base pointers even if the stores are guaranteed to partially overlap
(e.g. the alias analysis is returning AliasResult::PartialAlias).

The patch relies on the offsets computed and cached by BatchAAResults
(available after D93529) to determine if the offsetted overlapping
is OW_Complete.

Differential Revision: https://reviews.llvm.org/D97676

989051d5

Remove original implementation of UniqueInternalLinkageNames pass. · 0ba1ebcb

Sriraman Tallam authored Mar 08, 2021

D96109 was recently submitted which contains the refactored implementation of
-funique-internal-linakge-names by adding the unique suffixes in clang rather
than as an LLVM pass. Deleting the former implementation in this change.

Differential Revision: https://reviews.llvm.org/D98234

0ba1ebcb

[InstCombine] Regenerate test checks (NFC) · e19160c8
Nikita Popov authored Mar 10, 2021

e19160c8

[RuntimeDyld] Support more relocations · e4b40616

Rafael Auler authored Mar 03, 2021

This patch introduces functionality used by BOLT when
re-linking the final binary. It adds new relocation types that
are currently unsupported by RuntimeDyldELF.

Reviewed By: lhames

Differential Revision: https://reviews.llvm.org/D97899

e4b40616

[NFC] Fix compiler warnings · 66dab2fa

Quentin Colombet authored Mar 10, 2021

Fix warnings caused by -Wrange-loop-analysis.

Patch by Xiaoqing Wu <xiaoqing_wu@apple.com>

Differential Revision: https://reviews.llvm.org/D98298

66dab2fa

[PowerPC] Implement patterns for PC-Rel zextload/extload byte loads · 8b540c54

Amy Kwan authored Mar 04, 2021

This patch adds patterns to select the PC-Relative extloadi1 and zextloadi1 byte loads.

Differential Revision: https://reviews.llvm.org/D98042

8b540c54

[DebugInfo][NFC] Refactor BinOp+GEP salvaging in salvageDebugInfoImpl · 81b8357e

gbtozers authored Dec 08, 2020

This patch refactors out the salvaging of GEP and BinOp instructions into
separate functions, in preparation for further changes to the salvaging of these
instructions coming in another patch; there should be no functional change as a
result of this refactor.

Differential Revision: https://reviews.llvm.org/D92851

81b8357e

[RISCV][SelectionDAG] Introduce an ISD::SPLAT_VECTOR_PARTS node that can... · 9106d045

Craig Topper authored Mar 10, 2021

[RISCV][SelectionDAG] Introduce an ISD::SPLAT_VECTOR_PARTS node that can represent a splat of 2 i32 values into a nxvXi64 vector for riscv32.

On riscv32, i64 isn't a legal scalar type but we would like to
support scalable vectors of i64.

This patch introduces a new node that can represent a splat made
of multiple scalar values. I've used this new node to solve the current
crashes we experience when getConstant is used after type legalization.

For RISCV, we are now default expanding SPLAT_VECTOR to SPLAT_VECTOR_PARTS
when needed and then handling the SPLAT_VECTOR_PARTS later during
LegalizeOps. I've remove the special case I previously put in for
ABS for D97991 as the default expansion is now able to succesfully
use getConstant.

Reviewed By: frasercrmck

Differential Revision: https://reviews.llvm.org/D98004

9106d045

[RISCV] Starting fixing issues that prevent us from testing vXi64 intrinsics on RV32. · 0c73a506

Craig Topper authored Mar 10, 2021

Currently we crash in type legalization any time an intrinsic
uses a scalar i64 on RV32.

This patch adds support for type legalizing this to prevent
crashing. I don't promise that it uses the best possible codegen
just that it is functional.

This first version handles 3 cases. vmv.v.x intrinsic, vmv.s.x
intrinsic and intrinsics that take a scalar input, splat it and
then do some operation.

For vmv.v.x we'll either rely on hardware sign extension for
constants or we'll convert it to multiple splats and bit
manipulation.

For vmv.s.x we use a really unoptimal sequence inspired by what
we do for an INSERT_VECTOR_ELT.

For the third case we'll either try to use the .vi form for
constants or convert to a complicated splat and bitmanip and use
the .vv form of the operation.

I've renamed the ExtendOperand field to SplatOperand now use it
specifically for the third case. The first two cases are handled
by custom lowering specifically for those intrinsics.

I haven't updated all tests yet, but I tried to cover a subset
that includes single-width, widening, and narrowing.

Reviewed By: frasercrmck

Differential Revision: https://reviews.llvm.org/D97895

0c73a506

[InstCombine][SimplifyLibCalls] An extra sqrtf was produced because of... · 7c49f3c7

Daniil Seredkin authored Mar 10, 2021

[InstCombine][SimplifyLibCalls] An extra sqrtf was produced because of transformations in optimizePow function

See: https://bugs.llvm.org/show_bug.cgi?id=47613

There was an extra sqrt call because shrinking emitted a new powf and at the same time optimizePow replaces the previous pow with sqrt and as the result we have two instructions that will be in worklist of InstCombie despite the fact that %powf is not used by anyone (it is alive because of errno).

As the result we have two instructions:

  %powf = call fast float @powf(float %x, float 5.000000e-01)
  %sqrt = call fast double @sqrt(double %dx)

%powf will be converted to %sqrtf on a later iteration.

As a quick fix for that I moved shrinking to the end of optimizePow so that pow is replaced with sqrt at first that allows not to emit a new shrunk powf.

Differential Revision: https://reviews.llvm.org/D98235

7c49f3c7

[RISCV] Manually split vector operands to VECREDUCE when handling vXi64 vectors on RV32. · 1e391186

Craig Topper authored Mar 10, 2021

The type legalizer will visit the result before the operands. To
avoid creating an illegal target specific node or falling back to
scalarization, we need to manually split vector operands.

This still doesn't handle the case of non-power of 2 operands
which need to be widened. I'm not sure the type legalizer is
ready for it. I think we would need to insert an
INSERT_SUBVECTOR with the power of 2 type we want, with an undef
first operand, and the non-power of 2 orignal operand as the vector
to insert. Then fill in the neutral elements into the elements the
padded elements. Alternatively we INSERT_SUBVECTOR into a neutral vector.
From there we carry on splitting if needed to get to a legal type
then do the target specific code.

The problem with this is the type legalizer doesn't know how to
widen an insert_subvector yet. We would need to add that including
the handling for a non-undef first vector.

Reviewed By: frasercrmck

Differential Revision: https://reviews.llvm.org/D98292

1e391186

Revert "[LoopInterchange] Replace tightly-nesting-ness check with the one from `LoopNest`" · 7ff2768b
Ta-Wei Tu authored Mar 11, 2021
```
This reverts commit df9158c9.
```
7ff2768b

[DebugInfo] Handle DBG_VALUES with multiple variable location operands in MIR · 1db137b1

Stephen Tozer authored Mar 10, 2021

This patch adds handling for DBG_VALUE_LIST in the MIR-passes (after
finalize-isel), excluding the debug liveness passes and DWARF emission. This
most significantly affects MachineSink, which now needs to consider all used
registers of a debug value when sinking, but for most passes this change is
simply replacing getDebugOperand(0) with an iteration over all debug operands.

Differential Revision: https://reviews.llvm.org/D92578

1db137b1

[dfsan] Tracking origins at phi nodes · 6a9a686c

Jianzhou Zhao authored Mar 09, 2021

This is a part of https://reviews.llvm.org/D95835.

Reviewed-by: morehouse

Differential Revision: https://reviews.llvm.org/D98268

6a9a686c

[DSE] Handle memmove with equal non-const sizes · c68b560b

Dávid Bolvanský authored Mar 10, 2021

Follow up for fhahn's D98284. Also fixes a case from PR47644.

Reviewed By: fhahn

Differential Revision: https://reviews.llvm.org/D98346

c68b560b

[DSE] Add tests that require phi translation to be removed. · 077dc5c8
Florian Hahn authored Mar 09, 2021

077dc5c8

[AMDGPU] Fix isReallyTriviallyReMaterializable for V_MOV_* · 70f013fd

Jay Foad authored Mar 10, 2021

D57708 changed SIInstrInfo::isReallyTriviallyReMaterializable to reject
V_MOVs with extra implicit operands, but it accidentally rejected all
V_MOVs because of their implicit use of exec. Fix it but avoid adding a
moderately expensive call to MI.getDesc().getNumImplicitUses().

In real graphics shaders this changes quite a few vgpr copies into move-
immediates, which is good for avoiding stalls on GFX10.

Differential Revision: https://reviews.llvm.org/D98347

70f013fd

Reapply "[DebugInfo] Add DWARF emission for DBG_VALUE_LIST" · e64f3ccc
Stephen Tozer authored Mar 10, 2021
```
This reverts commit 429c6ecb.
```
e64f3ccc

[SystemZ][NFC] Renaming of ELF specific variables. · 023b5c1e

Yusra Syeda authored Mar 09, 2021

Rename ELF specific variables, making it easier to add the XPLink
variables in future patches.

Reviewed By: abhina.sreeskantharajan, Kai

Differential Revision: https://reviews.llvm.org/D98199

023b5c1e

Revert "[DebugInfo] Add DWARF emission for DBG_VALUE_LIST" · 429c6ecb

Stephen Tozer authored Mar 10, 2021

This reverts commit 0da27ba5.

This revision was causing an error on the sanitizer-x86_64-linux-autoconf build.

429c6ecb

[DebugInfo] Add DWARF emission for DBG_VALUE_LIST · 0da27ba5

gbtozers authored Sep 11, 2020

This patch allows DBG_VALUE_LIST instructions to be emitted to DWARF with valid
DW_AT_locations. This change mainly affects DbgEntityHistoryCalculator, which
now tracks multiple registers per value, and DwarfDebug+DwarfExpression, which
can now emit multiple machine locations as part of a DWARF expression.

Differential Revision: https://reviews.llvm.org/D83495

0da27ba5

[AArch64] Add missing intrinsics for scalar FP rounding · 25951c5a
Jingu Kang authored Mar 09, 2021
```
Differential Revision: https://reviews.llvm.org/D98269
```
25951c5a

GlobalISel: Try to combine G_[SU]DIV and G_[SU]REM · 4c6ab48f

Christudasan Devadasan authored Mar 10, 2021

It is good to have a combined `divrem` instruction when the
`div` and `rem` are computed from identical input operands.
Some targets can lower them through a single expansion that
computes both division and remainder. It effectively reduces
the number of instructions than individually expanding them.

Reviewed By: arsenm, paquette

Differential Revision: https://reviews.llvm.org/D96013

4c6ab48f

Revert "[clangd] Enable reflection for clangd-index-server" · 99b01cf2
Kadir Cetinkaya authored Mar 10, 2021
```
This reverts commit 8080ea4c.

As discussed offline we should only do that for debug builds.
```
99b01cf2

[NFC] Unify FIME with FIXME in comments · 481079e2

Jinzheng Tu authored Mar 10, 2021

There are 5 occurrences FIME and 15333 FIXME. All of them should be FIXME.

Reviewed By: alexfh

Differential Revision: https://reviews.llvm.org/D98321

481079e2

[Statepoint Lowering] Fix the crash with gc.relocate in a separate block · 2fccd1b0

Serguei Katkov authored Mar 10, 2021

If it was decided to relocate derived pointer using the spill its value is
not exported in general case.
When gc.relocate is located in an another block than a statepoint we cannot
get SD for derived value but for spill case it is not required at all.
However implementation of gc.relocate lowering unconditionally request SD value
causing the assert triggering.

The CL fixes this by handling spill case earlier than SD is really required.

Reviewers: reames, dantrushin
Reviewed By: dantrushin
Subscribers: llvm-commits
Differential Revision: https://reviews.llvm.org/D98324

2fccd1b0

[DebugInfo] Process DBG_VALUE_LIST in LiveDebugVariables · 7d0cafba

gbtozers authored Sep 11, 2020

This patch adds support for DBG_VALUE_LIST in the LiveDebugVariables pass. The
changes are mostly in computeIntervals, extendDef, and addDefsFromCopies; when
extending the def of a DBG_VALUE_LIST the live ranges of every used register
must be considered, and when such a def is killed by more than one of its used
registers being killed at the same time it is necessary to find valid copies of
all of those registers to create a new def with.

The DebugVariableValue class has also been changed to reference multiple
location numbers instead of just one. This has been accomplished by using a
C-style array with a unique_ptr and an array length packed into 6 bits, to
minimize the size of the class (which must be kept low to be used with
IntervalMap). This may not be the most efficient solution possible, and should
be looked at if performance issues arise.

Differential Revision: https://reviews.llvm.org/D83895

7d0cafba

Avoid shuffle self-assignment in EXPENSIVE_CHECKS builds · 35bf23e9

Alex Richardson authored Mar 09, 2021

Some versions of libstdc++ perform self-assignment in std::shuffle. This
breaks the EXPENSIVE_CHECKS builds of TableGen due to an incorrect assertion
in libstdc++.

See https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85828.

Fixes https://llvm.org/PR37652

Reviewed By: RKSimon

Differential Revision: https://reviews.llvm.org/D98167

35bf23e9

[SLC] Simplify strcpy and friends with non-zero address spaces · b26d6758

Alex Richardson authored Mar 08, 2021

The current logic in TargetLibraryInfoImpl::getLibFunc() was only treating
strcpy, etc. with i8* arguments in address space zero as a valid library
function. However, in the CHERI and Morello targets we expect all libc
functions to use address space 200 arguments.

This commit updates isValidProtoForLibFunc() to check that the argument
is a pointer type. This also drops the check for i8* since we should not
be checking the pointee type any more.

Reviewed By: arsenm
Differential Revision: https://reviews.llvm.org/D95142

b26d6758

[SLC] Baseline test for missed strcpy optimizations in non-zero AS · 81e2550f
Alex Richardson authored Mar 08, 2021
```
This will be fixed in D95142

Differential Revision: https://reviews.llvm.org/D95138
```
81e2550f

[DSE] Handle memcpy/memset with equal non-const sizes. · 8d9b9c0e

Florian Hahn authored Mar 10, 2021

Currently DSE misses cases where the size is a non-const IR value, even
if they match. For example, this means that llvm.memcpy/llvm.memset
calls are not eliminated, even if they write the same number of bytes.

This patch extends isOverwite to try to get IR values for the number of
bytes written from the analyzed instructions. If the values match,
alias checks are performed and the result is returned.

At the moment this only covers llvm.memcpy/llvm.memset. In the future,
we may enable MemoryLocation to also track variable sizes, but this
simple approach should allow us to cover the important cases in DSE.

Reviewed By: asbirlea

Differential Revision: https://reviews.llvm.org/D98284

8d9b9c0e

[DSE] Add tests with memset & memcpy combinations and non-const sizes. · 52932876
Florian Hahn authored Mar 10, 2021

52932876

[NFC] [PowerPC] Remove unsafe-fp-math in some tests · e82a54ae

Qiu Chaofan authored Mar 10, 2021

As we're going to replace this ambiguous option with more precise
instruction-level fast-math description, some tests need to be updated
and the option doesn't play any role in some of them.

e82a54ae