Commits · 5af8bacc940243038478da1c92c3481cbdfcece3 · Lorenzo Albano / LLVM bpEVL

Jun 23, 2021

[InstSimplify] Add more poison folding optimizations · 5af8bacc

Juneyoung Lee authored Jun 22, 2021

This adds more poison folding optimizations to InstSimplify.

Since all binary operators propagate poison, these are fine.

Also, the precondition of `select cond, undef, x` -> `x` is relaxed to allow the case when `x` is undef.

Reviewed By: nikic

Differential Revision: https://reviews.llvm.org/D104661

5af8bacc

[lldb] Remove CommandReturnObject's SetError(StringRef) · 1b1c8e4a

David Spickett authored Jun 22, 2021

Replacing existing uses with AppendError.

SetError is also part of the SBI API. This remains
but instead of calling the underlying SetError it
will call AppendError.

Reviewed By: teemperor

Differential Revision: https://reviews.llvm.org/D104768

1b1c8e4a

[Verifier] Fail on overrunning and invalid indices for {insert,extract} vector intrinsics · 3c4dbf6e

Joe Ellis authored Jun 18, 2021

With regards to overrunning, the langref (llvm/docs/LangRef.rst)
specifies:

   (llvm.experimental.vector.insert)
   Elements ``idx`` through (``idx`` + num_elements(``subvec``) - 1)
   must be valid ``vec`` indices. If this condition cannot be determined
   statically but is false at runtime, then the result vector is
   undefined.

   (llvm.experimental.vector.extract)
   Elements ``idx`` through (``idx`` + num_elements(result_type) - 1)
   must be valid vector indices. If this condition cannot be determined
   statically but is false at runtime, then the result vector is
   undefined.

For the non-mixed cases (e.g. inserting/extracting a scalable into/from
another scalable, or inserting/extracting a fixed into/from another
fixed), it is possible to statically check whether or not the above
conditions are met. This was previously missing from the verifier, and
if the conditions were found to be false, the result of the
insertion/extraction would be replaced with an undef.

With regards to invalid indices, the langref (llvm/docs/LangRef.rst)
specifies:

    (llvm.experimental.vector.insert)
    ``idx`` represents the starting element number at which ``subvec``
    will be inserted. ``idx`` must be a constant multiple of
    ``subvec``'s known minimum vector length.

    (llvm.experimental.vector.extract)
    The ``idx`` specifies the starting element number within ``vec``
    from which a subvector is extracted. ``idx`` must be a constant
    multiple of the known-minimum vector length of the result type.

Similarly, these conditions were not previously enforced in the
verifier. In some circumstances, invalid indices were permitted
silently, and in other circumstances, an undef was spawned where a
verifier error would have been preferred.

This commit adds verifier checks to enforce the constraints above.

Differential Revision: https://reviews.llvm.org/D104468

3c4dbf6e

[TTI] Make assertion compatible with opaque pointers · cfb1cb44
Nikita Popov authored Jun 23, 2021
```
Dropping the TODO here because it applies to all uses of this method.
```
cfb1cb44

[LLParser] Remove special handling for call address space · 3ee6f1a4

Nikita Popov authored Jun 22, 2021

Spin-off from D104740: I don't think this special handling is needed
anymore. Calls in textual IR are annotated with addrspace(N) (which
defaults to the program address space from data layout) and specifies
the expected pointer address space of the callee. There is no need
to special-case the program address space on top of that, as it
already is the default expected address space, and we shouldn't
allow use of the program address space if the call was explicitly
annotated with some other address space.

The IsCall parameter is retained because it will be used again soon.

Differential Revision: https://reviews.llvm.org/D104752

3ee6f1a4

[mlir][LLVMIR] Fold ExtractValueOp coming from InsertValueOp · f0d43a29
Nicolas Vasilache authored Jun 23, 2021
```
Differential Revision: https://reviews.llvm.org/D104769
```
f0d43a29
[AMDGPU] Stop using LegacyLegalizerInfo. NFCI. · dfb8c087
Jay Foad authored Jun 04, 2021
```
Differential Revision: https://reviews.llvm.org/D103684
```
dfb8c087

[IR] Simplify createReplacementInstr · 157473a5

Jay Foad authored Jun 11, 2021

NFCI, although the test change shows that ConstantExpr::getAsInstruction
is better than the old implementation of createReplacementInstr because
it propagates things like the sdiv "exact" flag.

Differential Revision: https://reviews.llvm.org/D104124

157473a5

[mlir][linalg] Change the FillOp library call signature. · f1844f15

Tobias Gysi authored Jun 23, 2021

Adapt the FillOp library call signature to the updated operand order introduced in https://reviews.llvm.org/D10412. The patch reverts the special treatment of FillOp in LinalgToStandard.

Differential Revision: https://reviews.llvm.org/D104360

f1844f15

[llvm] Update tests that got missed in adee485a . · aa58fdb3
Florian Hahn authored Jun 23, 2021

aa58fdb3

[SCEV] Support signed predicates in applyLoopGuards. · adee485a

Florian Hahn authored Jun 23, 2021

This adds handling for signed predicates, similar to how unsigned
predicates are already handled.

Reviewed By: nikic

Differential Revision: https://reviews.llvm.org/D104732

adee485a

[SCEV] Add tests with single-cond range check generated by InstComb. · 5ab96fa1
Florian Hahn authored Jun 22, 2021

5ab96fa1

[AMDGPU] Simplify collectReachableCallees. NFCI. · c65f3f56

Jay Foad authored Jun 22, 2021

Don't use SCC iterators when we're only interested in reachability.
Use df_begin/df_end inline to find reachable nodes.

Differential Revision: https://reviews.llvm.org/D104704

c65f3f56

[mlir][linalg] Adapt the FillOp builder signature. · 7cef24ee

Tobias Gysi authored Jun 23, 2021

Change the build operand order from output, value to value, output. The patch makes the argument order consistent with the pretty printed order updated by https://reviews.llvm.org/D104356.

Differential Revision: https://reviews.llvm.org/D104359

7cef24ee

[AMDGPU] Propagate LDS align into to instructions · 2b43209e
Stanislav Mekhanoshin authored Jun 14, 2021
```
Differential Revision: https://reviews.llvm.org/D104316
```
2b43209e

[LLD] [MinGW] Silence the printouts in one test. NFC. · f1a18fb6

Martin Storsjö authored Jun 18, 2021

This particular linker invocation is only run to check that we accept
options, but we don't inspect the generated command line. As all other
commands in the file have their output piped to FileCheck, the lit test
doesn't print any other output; therefore silence this one for consistency
as well.

f1a18fb6

[llvm-objcopy][MachO] Fix namespace style issues · 011b502c
Fangrui Song authored Jun 23, 2021

011b502c

[LLD] [MinGW] Print the lld-link command to stderr · fdf54f5c

Martin Storsjö authored Jun 18, 2021

This is consistent with how clang prints its internal commands with
-### and -v.

When linking with -verbose, we get log messages from the actual
linking written to stderr. By printing the command to the same stream,
we make sure they appear in a sensible chronological order.

Differential Revision: https://reviews.llvm.org/D104527

fdf54f5c

[mlir][linalg] Change the pretty printed FillOp operand order. · a21a6f51

Tobias Gysi authored Jun 23, 2021

The patch changes the pretty printed FillOp operand order from output, value to value, output. The change is a follow up to https://reviews.llvm.org/D104121 that passes the fill value using a scalar input instead of the former capture semantics.

Differential Revision: https://reviews.llvm.org/D104356

a21a6f51

[MLIR] Generalize detecting mods during slice computing · a873b6d4

Vinayaka Bandishti authored Jun 23, 2021

During slice computation of affine loop fusion, detect one id as the mod
of another id w.r.t a constant in a more generic way. Restrictions on
co-efficients of the ids is removed. Also, information from the
previously calculated ids is used for simplification of affine
expressions, e.g.,

If `id1` = `id2`,
  `id_n - divisor * id_q - id_r + id1 - id2 = 0`, is simplified to:
  `id_n - divisor * id_q - id_r = 0`.

If `c` is a non-zero integer,
  `c*id_n - c*divisor * id_q - c*id_r = 0`, is simplified to:
  `id_n - divisor * id_q - id_r = 0`.

Reviewed By: bondhugula, ayzhuang

Differential Revision: https://reviews.llvm.org/D104614

a873b6d4

[NFC][PDL] Fix documentation typo, redundant test · 0e551122

Vinayaka Bandishti authored Jun 23, 2021

Correct a documentation typo, and delete a duplicate test in
`pdl-to-pdl-interp-rewriter.mlir`.

Reviewed By: pr4tgpt, bondhugula, rriddle

Differential Revision: https://reviews.llvm.org/D104688

0e551122

Revert "[AArch64LoadStoreOptimizer] Recommit: Generate more STPs by renaming registers earlier" · 1cb7849a

Martin Storsjö authored Jun 23, 2021

This reverts commit ea011ec5.

This still causes some miscompiles, I'll follow up in the phabricator
review with a sample of that issue (which is part of the sample of
the previous issue).

1cb7849a

[TableGen] Fix printing second PC-relative operand · 36111f28

Igor Kudrin authored Jun 23, 2021

If an instruction has several operands and a PC-relative one is not the
first of them, the generator may produce the code that does not pass the
'Address' parameter to the printout method. For example, for an Arm
instruction 'LE LR, $imm', it reuses the same code as for other
instructions where the second operand is not PC-relative:

void ARMInstPrinter::printInstruction(...) {
...
  case 11:
    // BF16VDOTI_VDOTD, BF16VDOTI_VDOTQ, BF16VDOTS_VDOTD, ...
    printOperand(MI, 1, STI, O);
    O << ", ";
    printOperand(MI, 2, STI, O);
    break;
...

The patch fixes that by considering 'PCRel' when comparing
'AsmWriterOperand' values.

Differential Revision: https://reviews.llvm.org/D104698

36111f28

[M68k] Fix incorrect #include-ed file in M68kSubtarget · dfafd56d

Min-Yih Hsu authored Jun 22, 2021

In https://reviews.llvm.org/rG2193347e72fa , a cpp file is accidentally
included instead of its header file counterpart. This patch fixes this
error.

dfafd56d

[M68k] Add testcases for shift and rotate instructions · 0365af1a

Jim Lin authored Jun 23, 2021

Add codegen testcases for lsl, lsr, asr, rol and ror instructions.

Reviewed By: myhsu

Differential Revision: https://reviews.llvm.org/D104685

0365af1a

[M68k] Refactor codegen patterns for logic operations and add tests for it · 5cb5225c

Jim Lin authored Jun 23, 2021

Refactor pat for and, or and xor operation and add missing tests for it

Reviewed By: myhsu

Differential Revision: https://reviews.llvm.org/D104626

5cb5225c

[LoopDeletion] Exploit undef Phi inputs when symbolically executing 1st iteration · 842b4c83

Max Kazantsev authored Jun 23, 2021

Follow-up on Roman's idea expressed in D103959.
- If a Phi has undefined inputs from live blocks:
   - and no other inputs, assume it is undef itself;
   - and exactly one non-undef input, we can assume that all undefs are equal to this input.

Differential Revision: https://reviews.llvm.org/D104618
Reviewed By: lebedev.ri, nikic

842b4c83

Revert "[CodeGen] Don't create fake FunctionDecls when generating block/byref" · f681fd92

Zequan Wu authored Jun 22, 2021

That commit causes crash with error "!dbg attachment points at wrong subprogram for function" on iOS platforms.

This reverts commit f4c06bcb.

f681fd92

[Test] Clear out br i1 undef from tests to avoid UB · 976926e8

Max Kazantsev authored Jun 23, 2021

We don't want to test possible unexpected impact of such
branches. Replacing them with regular conditions. Idea by
Nikita Popov.

976926e8

[LSR] Filter out zero factors. PR50765 · b7d2c173

Max Kazantsev authored Jun 23, 2021

Zero factor leads to division by zero and failure of corresponding
assert as shown in PR50765. We should filter out such factors.

Differential Revision: https://reviews.llvm.org/D104702
Reviewed By: huihuiz, reames

b7d2c173

Fix typo in Toy Tutorial Ch-4 · 4666f309
Jack Xia authored Jun 23, 2021
```
multiple_transpose -> multiply_transpose
```
4666f309
[mlir] Fix GCC5 build after D104516 · 84bd07af
River Riddle authored Jun 23, 2021
```
GCC5 isn't able to implicitly capture `this` properly in an `auto` lambda.
```
84bd07af
[mlir][OpDefGen] Don't emit attribute name getters when there are no attributes · c43e8c0e
River Riddle authored Jun 23, 2021
```
This avoids generating otherwise unnecessary methods.
```
c43e8c0e

[OpenMP] Introduce an CMake find module for OpenMP Target support · 72d4cd62

Joseph Huber authored Jun 21, 2021

This introduces a CMake find module for detecting target offloading support in
a compiler. The goal is to make it easier to incorporate target offloading into
a cmake project.

Reviewed By: tianshilei1992

Differential Revision: https://reviews.llvm.org/D104710

72d4cd62

[mlir] Fix slicing-utils.mlir test after D104516 · 0246dd30
River Riddle authored Jun 23, 2021
```
Remove the duplicate unnecessary CHECK labels at the bottom of the file.
```
0246dd30
[gn build] don't build ubsan_minimal on mac · e8c8ce09
Nico Weber authored Jun 22, 2021
```
It doesn't build there, see http://45.33.8.238/macm1/12180/step_4.txt
```
e8c8ce09

[mlir] Add a ThreadPool to MLIRContext and refactor MLIR threading usage · 6569cf2a

River Riddle authored Jun 23, 2021

This revision refactors the usage of multithreaded utilities in MLIR to use a common
thread pool within the MLIR context, in addition to a new utility that makes writing
multi-threaded code in MLIR less error prone. Using a unified thread pool brings about
several advantages:

* Better thread usage and more control
We currently use the static llvm threading utilities, which do not allow multiple
levels of asynchronous scheduling (even if there are open threads). This is due to
how the current TaskGroup structure works, which only allows one truly multithreaded
instance at a time. By having our own ThreadPool we gain more control and flexibility
over our job/thread scheduling, and in a followup can enable threading more parts of
the compiler.

* The static nature of TaskGroup causes issues in certain configurations
Due to the static nature of TaskGroup, there have been quite a few problems related to
destruction that have caused several downstream projects to disable threading. See
D104207 for discussion on some related fallout. By having a ThreadPool scoped to
the context, we don't have to worry about destruction and can ensure that any
additional MLIR thread usage ends when the context is destroyed.

Differential Revision: https://reviews.llvm.org/D104516

6569cf2a

[mlir][NFC] Cleanup the MLIRTestReducer pass · 18465bcf
River Riddle authored Jun 23, 2021

18465bcf

[libcxx][NFC] prepares `<type_traits>` for moving out forward and swap · cafae056

Christopher Di Bella authored Jun 22, 2021

* `<type_traits>` depends on `std::forward`, so we replaced it with
  `static_cast<T&&>`.
* `swap`'s return type is confusing, so it's been rearranged to improve
   readabilitiy.

cafae056

[Remarks] Make memsize remarks report as an analysis, not a missed opportunity. · 493d6928
Jon Roelofs authored Jun 10, 2021
```
Differential revision: https://reviews.llvm.org/D104078
```
493d6928