Commits · cc96d2d6bc9b0841c172a5211c558e188a8fa28b · Lorenzo Albano / LLVM bpEVL

Mar 07, 2022

Apply clang-tidy fixes for modernize-use-emplace to MLIR (NFC) · cc96d2d6
Mehdi Amini authored Mar 07, 2022

cc96d2d6
Apply clang-tidy fixes for modernize-use-default-member-init to MLIR (NFC) · 671e30a1
Mehdi Amini authored Mar 07, 2022

671e30a1
Apply clang-tidy fixes for modernize-loop-convert to MLIR (NFC) · e6e36b9c
Mehdi Amini authored Mar 07, 2022

e6e36b9c
Apply clang-tidy fixes for llvm-qualified-auto to MLIR (NFC) · cfdf9747
Mehdi Amini authored Mar 07, 2022

cfdf9747
Apply clang-tidy fixes for bugprone-macro-parentheses to MLIR (NFC) · 393c6db7
Mehdi Amini authored Mar 07, 2022

393c6db7

[CoroElide] Remove fallback for frame layout determination · 1bd33691

Nikita Popov authored Mar 04, 2022

Only determine the frame layout based on dereferenceable and align
attributes, and remove the type-based fallback, which is incompatible
with opaque pointers. The dereferenceable attribute is required,
while the align attribute uses default alignment of 1 (commonly,
align 1 attributes do not get placed, relying on default alignment).

The CoroSplit pass producing the resume function adds the necessary
attributes in https://github.com/llvm/llvm-project/blob/7daed359111f6d151fef447f520f85ef1dabedf6/llvm/lib/Transforms/Coroutines/CoroSplit.cpp#L840,
and their presence is checked in coro-debug.ll at least.

Differential Revision: https://reviews.llvm.org/D120988

1bd33691

[clang][modules] Fix failing test · 2d26f163

Jan Svoboda authored Mar 07, 2022

This test started failing on Windows after b45888e9 due to path separators not matching up.

2d26f163

Remove Simon Atanasyan from the code owners list. MIPS Backend. · 7daed359
Simon Atanasyan authored Mar 07, 2022

7daed359

[Coroutines] Allow FramePtr to be an Argument · 9bca4ea3

Nikita Popov authored Mar 04, 2022

With opaque pointers, after splitRetconCoroutine() the FramePtr
may be an Argument rather than an Instruction. With typed pointers,
this currently doesn't happen because the FramePtr would be a
bitcast instruction.

Fix this by making FramePtr a Value and adding a helper for the
"after FramePtr" insertion point, which would be the start of the
function in the Argument case.

Differential Revision: https://reviews.llvm.org/D120994

9bca4ea3

[clang][modules] Report module maps affecting `no_undeclared_includes` modules · b45888e9

Jan Svoboda authored Mar 07, 2022

Since D106876, PCM files don't report module maps as input files unless they contributed to the compilation.

Reporting only module maps of (transitively) imported modules is not enough, though. For modules marked with `[no_undeclared_includes]`, other module maps affect the compilation by introducing anti-dependencies.

This patch makes sure such module maps are being reported as input files.

Depends on D120463.

Reviewed By: dexonsmith

Differential Revision: https://reviews.llvm.org/D120464

b45888e9

[clang][modules] NFC: Simplify and clarify test · 242b24c1

Jan Svoboda authored Mar 07, 2022

This patch simplifies a test that checks only used module map files are reported as input files in PCM files.

Instead of using opaque `diff`, this patch uses `clang -module-file-info` and `FileCheck` to verify this.

Reviewed By: dexonsmith

Differential Revision: https://reviews.llvm.org/D120463

242b24c1

[AArch64] Turn truncating buildvectors into truncates · d9633d14

David Green authored Mar 07, 2022

When lowering large v16f32->v16i8 fp_to_si_sat, the fp_to_si_sat node is
split several times, creating an illegal v4i8 concat that gets expanded
into a BUILD_VECTOR. After some combining and other legalisation, it
ends up the a buildvector that extracts from 4 vectors, looking like
BUILDVECTOR(a0,a1,a2,a3,b0,b1,b2,b3,c0,c1,c2,c3,d0,d1,d2,d3). That is
really an v16i32->v16i8 truncate in disguise.

This adds a ReconstructTruncateFromBuildVector method to detect the
pattern, converting it back into the legal "concat(trunc(concat(trunc(a),
trunc(b))), trunc(concat(trunc(c), trunc(d))))" tree. The extracted
nodes could also be v4i16, in which case the truncates are not needed.
All those truncates and concats then become uzip1's, which is much
better than expanding by moving vector lanes around.

Differential Revision: https://reviews.llvm.org/D119469

d9633d14

[libc] Fix alignment logic in TLS image size calculation. · c74c3442
Siva Chandra Reddy authored Mar 07, 2022

c74c3442
[gn build] Port 5f621567 · d7480d06
LLVM GN Syncbot authored Mar 07, 2022

d7480d06

[mlir][NFC] Move Translation.h to a Tools/mlir-translate directory · ee1d447e

River Riddle authored Mar 05, 2022

Translation.h is currently awkwardly shoved into the top-level mlir, even though it is
specific to the mlir-translate tool. This commit moves it to a new Tools/mlir-translate
directory, which is intended for libraries used to implement tools. It also splits the
translate registry from the main entry point, to more closely mirror what mlir-opt
does.

Differential Revision: https://reviews.llvm.org/D121026

ee1d447e

[mlir][NFC] Move MlirOptMain to the Tools/ directory · 6b7d211a

River Riddle authored Mar 04, 2022

MlirOptMain is currently awkwardly shoved into mlir/Support. This commit
moves it to the Tools/ directory, which is intended for libraries used to
implement tools.

Differential Revision: https://reviews.llvm.org/D121025

6b7d211a

[mlir][NFC] Move Parser.h to Parser/ · 9eaff423

River Riddle authored Mar 04, 2022

There is no reason for this file to be at the top-level, and
its current placement predates the Parser/ folder's existence.

Differential Revision: https://reviews.llvm.org/D121024

9eaff423

[ConstraintElimination] Remove dead variables when dropping constraints. · 542c3351

Florian Hahn authored Mar 07, 2022

This patch extends ConstraintElimination to also remove dead variables
when removing a constraint. When a constraint is removed because it is
out of scope, all new variables added for this constraint can also be
removed.

This keeps the total size of the systems much smaller, because it
reduces the number of variables drastically.

It also fixes a bug where variables where removed incorrectly.

Fixes https://github.com/llvm/llvm-project/issues/54228

542c3351

[ConstraintElimination] Add test from PR54228. · 4ad1ed3a
Florian Hahn authored Mar 07, 2022
```
Test for https://github.com/llvm/llvm-project/issues/54228
```
4ad1ed3a

[X86] Update some of the AVX512 intrinsic tests to avoid adds. · be85f55b

Luo, Yuanke authored Mar 07, 2022

As noticed in D119654, by adding the masked intrinsics results together
we can end up with the selects being canonicalized away from the
intrinsic - this isn't what we want to test here so replace with a
insertvalue chain into a aggregate instead to retain all the results.

be85f55b

[VP] Introducing VectorBuilder, the VP intrinsic builder · 5f621567

Simon Moll authored Mar 07, 2022

VectorBuilder wraps around an IRBuilder and
VectorBuilder::createVectorInstructions emits VP intrinsics as if they
were regular instructions.

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D105283

5f621567

[Attributor] Remove function pointer restriction for AAAlign · a9b03d9e

Nikita Popov authored Mar 03, 2022

This check is not compatible with opaque pointers. We can avoid
it by adjusting the getPointerAlignment() implementation to avoid
creating unnecessary ptrtoint expressions for bitcasted pointers.
The code already uses OnlyIfReduced to not create an expression
if it does not simplify, and this makes sure that folding a
bitcast and ptrtoint into a ptrtoint doesn't count as a
simplification.

Differential Revision: https://reviews.llvm.org/D120904

a9b03d9e

[AArch64] Use NPM for cost model tests. NFC · 43b63824

David Green authored Mar 07, 2022

As per the other tests, this switches the run lines back to using the
NPM via
-passes='print<cost-model>' -cost-kind=throughput 2>&1 -disable-output

43b63824

[SCEV] Enable verification under EXPENSIVE_CHECKS · 81b43b23

Nikita Popov authored Feb 28, 2022

SCEV verification should no longer affect results of subsequent
queries, and our lit tests as well as llvm-test-suite pass with
SCEV verification enabled, so I think we can enable it by default
under EXPENSIVE_CHECKS now.

Differential Revision: https://reviews.llvm.org/D120708

81b43b23

[LoongArch] Add EncoderMethods for transformed immediate operands · c063f9da

Weining Lu authored Mar 07, 2022

This is a split patch of D120476 and thanks to myhsu.

'Transformed' means the encoding of an immediate is not the same as
its binary representation. For example, the `bl` instruction
requires a signed 28-bits integer as its operand and the low 2 bits
must be 0. So only the upper 26 bits are needed to get encoded into
the instruction.

Based on the above reason this kind of immediate needs a customed
`EncoderMethod` to get the real value getting encoded into the
instruction.

Currently these immediate includes:
```
  uimm2_plus1
  simm14_lsl2
  simm16_lsl2
  simm21_lsl2
  simm26_lsl2
```

This patch adds those `EncoderMethod`s and revises related .mir test
in previous patch.

Reviewed By: xen0n, MaskRay

Differential Revision: https://reviews.llvm.org/D120545

c063f9da

[SCEV] Enable verification in LoopPM · d1e880ac

Nikita Popov authored Feb 25, 2022

Currently, we hardly ever actually run SCEV verification, even in
tests with -verify-scev. This is because the NewPM LPM does not
verify SCEV. The reason for this is that SCEV verification can
actually change the result of subsequent SCEV queries, which means
that you see different transformations depending on whether
verification is enabled or not.

To allow verification in the LPM, this limits verification to
BECounts that have actually been cached. It will not calculate
new BECounts.

BackedgeTakenInfo::getExact() is still not entirely readonly,
it still calls getUMinFromMismatchedTypes(). But I hope that this
is not problematic in the same way. (This could be avoided by
performing the umin in the other SCEV instance, but this would
require duplicating some of the code.)

Differential Revision: https://reviews.llvm.org/D120551

d1e880ac

[mlir] Use empty() instead of checking size() == 0 (NFC) · ef193a7a
Adrian Kuegel authored Mar 07, 2022

ef193a7a

[SCEV] Fully invalidate SCEVUnknown on RAUW · 8133778d

Nikita Popov authored Feb 17, 2022

When a SCEVUnknown gets RAUWd, we currently drop it from the folding
set, but don't forget memoized values. I believe we should be
treating RAUW the same way as deletion here and invalidate all
caches and dependent expressions.

I don't have any specific cases where this causes issues right now,
but it does address the FIXME in https://reviews.llvm.org/D119488.

Differential Revision: https://reviews.llvm.org/D120033

8133778d

[clang][parser] Stop dragging an EndLoc around when parsing attributes · 7b969b0b

Timm Bäder authored Mar 03, 2022

It's almost always entirely unused and if it is used, the end of the
attribute range can be used instead.

Differential Revision: https://reviews.llvm.org/D120888

7b969b0b

[MLIR] Change call sites from deprecated `parseSourceFile()` to `parseSourceFile<ModuleOp>()`. · 0dc66b76

Christian Sigg authored Mar 06, 2022

Mark `parseSourceFile()` deprecated. The functions will be removed two weeks after landing this change.

Reviewed By: rriddle

Differential Revision: https://reviews.llvm.org/D121075

0dc66b76

[Attributor] Determine potentially loaded values through memory · 5af11ec3

Johannes Doerfert authored Mar 06, 2022

We already look through memory to determine where a value that is stored
might pop up again (potential copies). This patch introduces the other
direction with similar logic. If a value is loaded, we can follow all
the accesses to the pointer (or better object) and try to determine what
value might have been stored.

5af11ec3

[Attributor] Handle undef and null in AAAlignFloating · eb73af4a

Johannes Doerfert authored Mar 06, 2022

Both `undef` and `nullptr` are maximally aligned. This is especially
important as we often see `undef` until a proper value has been
identified during simplification.

eb73af4a

[Attributor] Use CFG reasoning also for read accesses · ad26e199

Johannes Doerfert authored Mar 01, 2022

With D106397 we used CFG reasoning to filter out writes that will not
interfere with a given load instruction. With this patch we use the
same logic (modulo the reversal in reachability check order) for store
instructions. As an example, we can now proof stores to shared memory
are dead if all the loads of the shared memory are not reachable from
them.

ad26e199

[Attributor] Improve isValidAtPosition (mostly for old PM) · acb37734

Johannes Doerfert authored Feb 25, 2022

To minimize the test difference between old and new PM we perform some
local dominance check if no dominator tree is available.

acb37734

[PowerPC] Add generic fnmsub intrinsic · b2497e54

Qiu Chaofan authored Mar 07, 2022

Currently in Clang, we have two types of builtins for fnmsub operation:
one for float/double vector, they'll be transformed into IR operations;
one for float/double scalar, they'll generate corresponding intrinsics.

But for the vector version of builtin, the 3 op chain may be recognized
as expensive by some passes (like early cse). We need some way to keep
the fnmsub form until code generation.

This patch introduces ppc.fnmsub.* intrinsic to unify four fnmsub
intrinsics.

Reviewed By: shchenz

Differential Revision: https://reviews.llvm.org/D116015

b2497e54

[Attributor][NFCI] Introduce fine-grained anonymous namespaces · ff758372
Johannes Doerfert authored Feb 24, 2022

ff758372

[Attributor][OpenMPOpt][FIX] Register simplification callbacks · 192a34dd

Johannes Doerfert authored Feb 24, 2022

Heap-2-stack and heap-2-shared can replace an allocation call with
something else. To avoid us deriving information from the allocator
implementation we register a simplification callback now that will
force us to stop at the call site. We probably should create the
replacement memory eagerly and return that instead though.

192a34dd

[Attributor][FIX] Use maximal access for dereferenceability deduction · 5859ae6a

Johannes Doerfert authored Feb 24, 2022

While we can use range information when we derive dereferenceability we
must make sure to pick he right end of the range. Before we always went
with the minimal offset, which is not correct if we want to combine
the base dereferenceability with some offset. In that case it's the
maximum that gives the correct result.

5859ae6a

[Attributor][FIX] Initialize stack variable · 1fcd4d0e
Johannes Doerfert authored Feb 24, 2022

1fcd4d0e
Revert "[OpenMP][NFCI] Use RAII lock guards in libomptarget where possible" · 7ead7e90
Johannes Doerfert authored Mar 06, 2022
```
This reverts commit ff50e81b as it broke
the buildbots, see https://reviews.llvm.org/D121060#3362737.
```
7ead7e90