Commits · 09266e4af04ec2dc3a3afc19a3f9d5658d482a44 · Lorenzo Albano / LLVM bpEVL

Nov 13, 2020

[ObjC][ARC] Clear the lists of basic blocks and instructions before · 09266e4a
Akira Hatanaka authored Nov 12, 2020
```
continuing the loop

This fixes a bug introduced in c6f1713c.
```
09266e4a
[ORC] Make WrapperFunctionResult::zeroInit static · d3715b5a
Lang Hames authored Nov 13, 2020

d3715b5a
[ORC] Remove designated initializer. · bdf26d8d
Lang Hames authored Nov 13, 2020

bdf26d8d

[ORC] Break up OrcJIT library, add Orc-RPC based remote TargetProcessControl · 1d0676b5

Lang Hames authored Nov 11, 2020

implementation.

This patch aims to improve support for out-of-process JITing using OrcV2. It
introduces two new class templates, OrcRPCTargetProcessControlBase and
OrcRPCTPCServer, which together implement the TargetProcessControl API by
forwarding operations to an execution process via an Orc-RPC Endpoint. These
utilities are used to implement out-of-process JITing from llvm-jitlink to
a new llvm-jitlink-executor tool.

This patch also breaks the OrcJIT library into three parts:
  -- OrcTargetProcess: Contains code needed by the JIT execution process.
  -- OrcShared: Contains code needed by the JIT execution and compiler
     processes
  -- OrcJIT: Everything else.

This break-up allows JIT executor processes to link against OrcTargetProcess
and OrcShared only, without having to link in all of OrcJIT. Clients executing
JIT'd code in-process should start linking against OrcTargetProcess as well as
OrcJIT.

In the near future these changes will enable:
  -- Removal of the OrcRemoteTargetClient/OrcRemoteTargetServer class templates
     which provided similar functionality in OrcV1.
  -- Restoration of Chapter 5 of the Building-A-JIT tutorial series, which will
     serve as a simple usage example for these APIs.
  -- Implementation of lazy, cross-target compilation in lli's -jit-kind=orc-lazy
     mode.

1d0676b5

[AsmPrinter] fix -disable-debug-info option · 9606ef03

Jameson Nash authored Nov 12, 2020

This option was in a rather convoluted place, causing global parameters
to be set in awkward and undesirable ways to try to account for it
indirectly. Add tests for the -disable-debug-info option and ensure we
don't print unintended markers from unintended places.

Reviewed By: dstenb

Differential Revision: https://reviews.llvm.org/D91083

9606ef03

[X86] Use EVT::getIntegerVT instead of MVT::getIntegerVT where the type can be i2 or i4. · 114f0446

Craig Topper authored Nov 12, 2020

This was a mistake introduced in D91294. I'm not sure how to
exercise this with the existing code, but I hit it while trying
some follow up experiments.

114f0446

[X86] When storing v1i1/v2i1/v4i1 to memory, make sure we store zeros in the rest of the byte · a4124e45

Craig Topper authored Nov 12, 2020

We can't store garbage in the unused bits. It possible that something like zextload from i1/i2/i4 is created to read the memory. Those zextloads would be legalized assuming the extra bits are 0.

I'm not sure that the code in lowerStore is executed for the v1i1/v2i1/v4i1 case. It looks like the DAG combine in combineStore may have converted them to v8i1 first. And I think we're missing some cases to avoid going to the stack in the first place. But I don't have time to investigate those things at the moment so I wanted to focus on the correctness issue.

Should fix PR48147.

Reviewed By: RKSimon

Differential Revision: https://reviews.llvm.org/D91294

a4124e45

[IndVars] Replace checks with invariants if we cannot remove them · 77efb73c

Max Kazantsev authored Nov 13, 2020

If we cannot prove that the check is trivially true, but can prove that it either
fails on the 1st iteration or never fails, we can replace it with first iteration check.

Differential Revision: https://reviews.llvm.org/D88527
Reviewed By: skatkov

77efb73c

Suppress trailing template arguments equivalent to default arguments · 7602ef76
Richard Smith authored Nov 12, 2020
```
when printing the name of a member of a class template specialization.
```
7602ef76

Fix MLIR lit test configuration after cmake Python detection change · a9386bb0

Mehdi Amini authored Nov 13, 2020

07f1047f changed the CMake detection to use find_package(Python3 ...
but didn't update the lit configuration to use the expected Python3_EXECUTABLE
cmake variable to point to the interpreter path.
This resulted in an empty path on MacOS.

a9386bb0

[Tests][LoopVect] Exercise basic uniform memory operand logic · d4e81cd9
Philip Reames authored Nov 12, 2020

d4e81cd9

[OpenMP] Fixed a bug when displaying affinity · 24d0ef0f

Shilei Tian authored Nov 12, 2020

Currently the affinity format string has initial value. When users set
the format via OMP_AFFINITY_FORMAT, it will overwrite the format string. However,
when copying the format, the tailing null is missing. As a result, if the user
format string is shorter than default value, the remaining part in the default
value still makes effort. This bug is not exposed because the test case doesn't
check the end of a string. It only checks whether given output "contains" the
check string.

Reviewed By: AndreyChurbanov

Differential Revision: https://reviews.llvm.org/D91309

24d0ef0f

[hip] Remove the coercion on aggregate kernel arguments. · 8920ef06

Michael Liao authored Nov 10, 2020

- If an aggregate argument is indirectly accessed within kernels, direct
  passing results in unpromotable `alloca`, which degrade performance
  significantly. InferAddrSpace pass is enhanced in
  [D91121](https://reviews.llvm.org/D91121) to take the assumption that
  generic pointers loaded from the constant memory could be regarded
  global ones. The need for the coercion on aggregate arguments is
  mitigated.

Differential Revision: https://reviews.llvm.org/D89980

8920ef06

[Polly] Fix memory leak. · 243511a2
Michael Kruse authored Nov 12, 2020

243511a2

[InstCombine] fold sub of low-bit masked value from offset of same value · 0abde4bc

Sanjay Patel authored Nov 12, 2020

There might be some demanded/known bits way to generalize this,
but I'm not seeing it right now.

This came up as a regression when I was looking at a different
demanded bits improvement.

https://rise4fun.com/Alive/5fl

  Name: general
  Pre: ((-1 << countTrailingZeros(C1)) & C2) == 0
  %a1 = add i8 %x, C1
  %a2 = and i8 %x, C2
  %r = sub i8 %a1, %a2
  =>
  %r = and i8 %a1, ~C2

  Name: test 1
  %a1 = add i8 %x, 192
  %a2 = and i8 %x, 10
  %r = sub i8 %a1, %a2
  =>
  %r = and i8 %a1, -11

  Name: test 2
  %a1 = add i8 %x, -108
  %a2 = and i8 %x, 3
  %r = sub i8 %a1, %a2
  =>
  %r = and i8 %a1, -4

0abde4bc

[InstCombine] add tests for sub with masked bits; NFC · 87e006be
Sanjay Patel authored Nov 12, 2020

87e006be

[MLIR] Fix standard -> LLVM conversion to fail for unsupported memref element type. · 5883c4b4

Rahul Joshi authored Nov 12, 2020

- Move isSupportedMemRefType() to ConvertToLLVMPatterns and check if the
  memref element type is supported there.

Differential Revision: https://reviews.llvm.org/D91374

5883c4b4

[flang] Document DO CONCURRENT's problems (NFC) · c2bccd66
peter klausler authored Aug 25, 2020
```
Differential Revision: https://reviews.llvm.org/D86556
```
c2bccd66

[lldb/DataFormatters] Display null C++ pointers as nullptr · 406ad187

Jonas Devlieghere authored Nov 12, 2020

Display null pointer as `nullptr`, `nil` and `NULL` for C++,
Objective-C/Objective-C++ and C respectively. The original motivation
for this patch was to display a null std::string pointer as nullptr
instead of "", but the fix seemed generic enough to be done for all
summary providers.

Differential revision: https://reviews.llvm.org/D77153

406ad187

[AMDGPU] Remove scratch rsrc from spill pseudos · 5ab17021
Stanislav Mekhanoshin authored Nov 09, 2020
```
Differential Revision: https://reviews.llvm.org/D91110
```
5ab17021
[gn build] (manually) port 410626c9 · fa9f4133
Nico Weber authored Nov 12, 2020

fa9f4133

Nov 12, 2020

[mlir] Make tensor_to_memref op docs match reality · 79688028

Sean Silva authored Nov 12, 2020

The previous code defined it as allocating a new memref for its result.
However, this is not how it is treated by the dialect conversion framework,
that does the equivalent of inserting and folding it away internally
(even independent of any canonicalization patterns that we have
defined).

The semantics as they were previously written were also very
constraining: Nontrivial analysis is needed to prove that the new
allocation isn't needed for correctness (e.g. to avoid aliasing).
By removing those semantics, we avoid losing that information.

Differential Revision: https://reviews.llvm.org/D91382

79688028

[mlir] Bufferize tensor constant ops · faa66b1b

Sean Silva authored Nov 09, 2020

We lower them to a std.global_memref (uniqued by constant value) + a
std.get_global_memref to produce the corresponding memref value.
This allows removing Linalg's somewhat hacky lowering of tensor
constants, now that std properly supports this.

Differential Revision: https://reviews.llvm.org/D91306

faa66b1b

[mlir] Fix subtensor_insert bufferization. · ad2f9f67

Sean Silva authored Nov 12, 2020

It was incorrect in the presence of a tensor argument with multiple
uses.

The bufferization of subtensor_insert was writing into a converted
memref operand, but there is no guarantee that the converted memref for
that operand is safe to write into. In this case, the same converted
memref is written to in-place by the subtensor_insert bufferization,
violating the tensor-level semantics.

I left some comments in a TODO about ways forward on this. I will be
working actively on this problem in the coming days.

Differential Revision: https://reviews.llvm.org/D91371

ad2f9f67

[AArch64][GlobalISel] Select CSINC and CSINV for G_SELECT with constants · d0ba6c40

Jessica Paquette authored Nov 03, 2020

Select the following:

- G_SELECT cc, 0, 1 -> CSINC zreg, zreg, cc
- G_SELECT cc 0, -1 -> CSINV zreg, zreg cc
- G_SELECT cc, 1, f -> CSINC f, zreg, inv_cc
- G_SELECT cc, -1, f -> CSINV f, zreg, inv_cc
- G_SELECT cc, t, 1 -> CSINC t, zreg, cc
- G_SELECT cc, t, -1 -> CSINC t, zreg, cc

(IR example: https://godbolt.org/z/YfPna9)

These correspond to a bunch of the AArch64csel patterns in AArch64InstrInfo.td.

Unfortunately, it doesn't seem like we can import patterns that use NZCV like
those ones do. E.g.

```
def : Pat<(AArch64csel GPR32:$tval, (i32 1), (i32 imm:$cc), NZCV),
          (CSINCWr GPR32:$tval, WZR, (i32 imm:$cc))>;
```

So we have to manually select these for now.

This replaces `selectSelectOpc` with an `emitSelect` function, which performs
these optimizations.

Differential Revision: https://reviews.llvm.org/D90701

d0ba6c40

[VE] Support vld intrinsics · 410626c9

Kazushi (Jam) Marukawa authored Nov 10, 2020

Add intrinsics for vector load instructions.  Add a regression test also.

Reviewed By: simoll

Differential Revision: https://reviews.llvm.org/D91332

410626c9

[LoopVectorize] regenerate test checks; NFC · 9e0c3565
Sanjay Patel authored Nov 12, 2020

9e0c3565

[LLDB] Fix handling of bit-fields in a union · bae9aedb

shafik authored Nov 12, 2020

When parsing DWARF and laying out bit-fields we don't properly take into account when they are in a union, they will all have a zero offset.

Differential Revision: https://reviews.llvm.org/D91118

bae9aedb

[PhaseOrdering] regenerate test checks; NFC · d5e89e8f
Sanjay Patel authored Nov 12, 2020

d5e89e8f
[InstCombine] add tests for low-mask-of-add; NFC · 96f4aa67
Sanjay Patel authored Nov 12, 2020

96f4aa67

Some updates/fixes to the creduce script. · 0c80b542

Amy Huang authored Aug 07, 2019

This was motivated by changes to llvm's `not --crash` disabling symbolization
but I ended up removing `not` from the script entirely because it
returns differently depending on whether clang "crashes" or exits for some
other reason. The script had to choose between calling `not` and `not --crash`
and sometimes it was wrong.

The script also now disables symbolization when we don't read the stack
trace because symbolizing is kind of slow.

Differential Revision: https://reviews.llvm.org/D91372

0c80b542

[AMDGPU] Enable multi-dword flat scratch load/stores · cf6565f6
Stanislav Mekhanoshin authored Nov 12, 2020
```
Differential Revision: https://reviews.llvm.org/D91384
```
cf6565f6
[mlir][Python] Fix 'unreferenced local variable' warning on MSVC. · 4726a402
Stella Laurenzo authored Nov 11, 2020
```
Differential Revision: https://reviews.llvm.org/D91282
```
4726a402

[PatternMatch] Add single index InsertValue matcher. · c1f6f300

Florian Hahn authored Nov 12, 2020

This patch adds a new matcher for single index InsertValue instructions,
similar to the existing matcher for ExtractValue.

Reviewed By: lebedev.ri

Differential Revision: https://reviews.llvm.org/D91352

c1f6f300

[OPENMP]Fix PR47790: segfault in frontend while parsing Objective-C with OpenMP. · 07b568a9
Alexey Bataev authored Nov 12, 2020
```
Need to check if the sema is actually finishing a function decl.

Differential Revision: https://reviews.llvm.org/D91376
```
07b568a9

[VE] Disable -fsigaddr option for VE · 9c504ec0

Kazushi (Jam) Marukawa authored Nov 12, 2020

VE needs to support integrated assembler and "nas".  This "nas"
doesn't recognize ".sigaddr" pseudo mnemonics, so need to disable
it.  This patch disable it on VE by default.  Also add a regression
test for that.

Reviewed By: simoll

Differential Revision: https://reviews.llvm.org/D91350

9c504ec0

[flang] Include source information in an invalid file-unit-number message · 04a14798

peter klausler authored Nov 12, 2020

An io-unit that is an internal-file-variable is syntactically identical
to a file-unit-number expression that is a variable reference. An
ambiguous unit is initially parsed as an internal-file-variable. If
semantic analysis determines that the unit is not of character type,
it is rewritten as an internal-file-variable. This modification must
retain source coordinate information.

Differential revision: https://reviews.llvm.org/D91375

04a14798

[fuzzer] Add Windows Visual C++ exception intercept · f897e82b

Joe Pletcher authored Nov 12, 2020

Adds a new option, `handle_winexcept` to try to intercept uncaught
Visual C++ exceptions on Windows. On Linux, such exceptions are handled
implicitly by `std::terminate()` raising `SIBABRT`. This option brings the
Windows behavior in line with Linux.

Unfortunately this exception code is intentionally undocumented, however
has remained stable for the last decade. More information can be found
here: https://devblogs.microsoft.com/oldnewthing/20100730-00/?p=13273

Reviewed By: morehouse, metzman

Differential Revision: https://reviews.llvm.org/D89755

f897e82b

[flang] Recognize END FILE as ENDFILE in free form source · 6c516cda
peter klausler authored Nov 12, 2020
```
The ENDFILE statement may be spelled as two words.

Differential revision: https://reviews.llvm.org/D91377
```
6c516cda

[NFC][NewPM] Reuse PassBuilder callbacks with -O0 · 3a7b57b7

Arthur Eubanks authored Nov 05, 2020

This removes lots of duplicated code which was necessary before
https://reviews.llvm.org/D89158.
Now we can use PassBuilder::runRegisteredEPCallbacks().
This is mostly sanitizers.

There is likely more that can be done to simplify, but let's start with this.

Reviewed By: ychen

Differential Revision: https://reviews.llvm.org/D90870

3a7b57b7