Commits · 20c6e0749461147df19a3b126d1a48106c63c351 · Lorenzo Albano / LLVM bpEVL

Jan 15, 2020

[mlir] Enable printing of FuncOp in the generic form. · 20c6e074

River Riddle authored Jan 14, 2020

Summary:
This was previously disabled as FunctionType TypeAttrs could not be roundtripped in the IR. This has been fixed, so we can now generically print FuncOp.

Depends On D72429

Reviewed By: jpienaar, mehdi_amini

Differential Revision: https://reviews.llvm.org/D72642

20c6e074

make -fmodules-codegen and -fmodules-debuginfo work also with PCHs · cbc9d22e

Luboš Luňák authored Nov 03, 2019

Allow to build PCH's (with -building-pch-with-obj and the extra .o file)
with -fmodules-codegen -fmodules-debuginfo to allow emitting shared code
into the extra .o file, similarly to how it works with modules. A bit of
a misnomer, but the underlying functionality is the same. This saves up
to 20% of build time here.

Differential Revision: https://reviews.llvm.org/D69778

cbc9d22e

Jan 14, 2020

fix recent -fmodules-codegen fix test · b5b2cf7a
Luboš Luňák authored Jan 14, 2020

b5b2cf7a

-fmodules-codegen should not emit extern templates · 729530f6

Luboš Luňák authored Nov 03, 2019

If a header contains 'extern template', then the template should be provided
somewhere by an explicit instantiation, so it is not necessary to generate
a copy. Worse, this can lead to an unresolved symbol, because the codegen's
object file will not actually contain functions from such a template
because of the GVA_AvailableExternally, but the object file for the explicit
instantiation will not contain them either because it will be blocked
by the information provided by the module.

Differential Revision: https://reviews.llvm.org/D69779

729530f6

[mlir][Linalg] Update the semantics, verifier and test for Linalg with tensors. · f52d7173

Nicolas Vasilache authored Jan 11, 2020

Summary:
This diff fixes issues with the semantics of linalg.generic on tensors that appeared when converting directly from HLO to linalg.generic.
The changes are self-contained within MLIR and can be captured and tested independently of XLA.

The linalg.generic and indexed_generic are updated to:

To allow progressive lowering from the value world (a.k.a tensor values) to
the buffer world (a.k.a memref values), a linalg.generic op accepts
mixing input and output ranked tensor values with input and output memrefs.

```
%1 = linalg.generic #trait_attribute %A, %B {other-attributes} :
  tensor<?x?xf32>,
  memref<?x?xf32, stride_specification>
  -> (tensor<?x?xf32>)
```

In this case, the number of outputs (args_out) must match the sum of (1) the
number of output buffer operands and (2) the number of tensor return values.
The semantics is that the linalg.indexed_generic op produces (i.e.
allocates and fills) its return values.

Tensor values must be legalized by a buffer allocation pass before most
transformations can be applied. Such legalization moves tensor return values
into output buffer operands and updates the region argument accordingly.

Transformations that create control-flow around linalg.indexed_generic
operations are not expected to mix with tensors because SSA values do not
escape naturally. Still, transformations and rewrites that take advantage of
tensor SSA values are expected to be useful and will be added in the near
future.

Subscribers: bmahjour, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D72555

f52d7173

[DAGCombine] Replace `getIntPtrConstant()` with `getVectorIdxTy()`. · 8d07f8d9

Michael Liao authored Jan 14, 2020

- Prefer `getVectorIdxTy()` as the index operand type for
  `EXTRACT_SUBVECTOR` as targets expect different types by overloading
  `getVectorIdxTy()`.

8d07f8d9

[OPENMP]Do not emit special virtual function for NVPTX target. · a48600c0

Alexey Bataev authored Jan 14, 2020

There are no special virtual function handlers (like __cxa_pure_virtual)
defined for NVPTX target, so just emit such functions as null pointers
to prevent issues with linking and unresolved references.

a48600c0

[mlir] Use double format when parsing bfloat16 hexadecimal values · 1bd14ce3

River Riddle authored Jan 14, 2020

Summary: bfloat16 doesn't have a valid APFloat format, so we have to use double semantics when storing it. This change makes sure that hexadecimal values can be round-tripped properly given this fact.

Reviewed By: mehdi_amini

Differential Revision: https://reviews.llvm.org/D72667

1bd14ce3

Remove trailing `;`. NFC. · a3490e3e
Michael Liao authored Jan 14, 2020

a3490e3e

[AArch64][GlobalISel]: Support @llvm.{return,frame}address selection. · 6078f2fe

Amara Emerson authored Jan 14, 2020

These intrinsics expand to a variable number of instructions so just like in
ISelLowering.cpp we use custom code to deal with them.

Committing Tim's original patch.

Differential Revision: https://reviews.llvm.org/D65656

6078f2fe

[Driver][test] Fix Driver/hexagon-toolchain-elf.c for -DCLANG_DEFAULT_LINKER=lld builds · 1ca51c06
Fangrui Song authored Jan 13, 2020
```
Reviewed By: nathanchance, sidneym

Differential Revision: https://reviews.llvm.org/D72668
```
1ca51c06

[LegalizeTypes] Remove untested code from ExpandIntOp_UINT_TO_FP · 9ee90ea5

Craig Topper authored Jan 14, 2020

This code is untested in tree because the "APFloat::semanticsPrecision(sem) >= SrcVT.getSizeInBits() - 1" check is false for most combinations for int and fp types except maybe i32 and f64. For that you would need i32 to be an illegal type, but f64 to be legal and have custom handling for legalizing the split sint_to_fp. The precision check itself was added in 2010 to fix a double rounding issue in the algorithm that would occur if the sint_to_fp was not able to do the conversion without rounding.

Differential Revision: https://reviews.llvm.org/D72728

9ee90ea5

[GVN] fix comment/argument name to match actual implementation. NFC · fe37d9ec
Fedor Sergeev authored Jan 15, 2020

fe37d9ec

[clang][test][NFC] Use more widely supported sanitizer for file dependency tests · 986202fa

Jan Korous authored Jan 14, 2020

The tests aren't concerned at all by the actual sanitizer - only by blacklist being reported as a dependency.
We're unfortunately limited by platform support for any particular sanitizer but we can at least use one that is widely supported.

Post-commit review:
https://reviews.llvm.org/D72729

986202fa

[InstCombine] Fix worklist management when removing guard intrinsic · 04e58615

Nikita Popov authored Jan 11, 2020

When multiple guard intrinsics are merged into one, currently the
result of eraseInstFromFunction() is returned -- however, this
should only be done if the current instruction is being removed.
In this case we're removing a different instruction and should
instead report that the current one has been modified by returning it.

For this test case, this reduces the number of instcombine iterations
from 5 to 2 (the minimum possible).

Differential Revision: https://reviews.llvm.org/D72558

04e58615

[DebugInfo] Add option to clang to limit debug info that is emitted for classes. · 651128f5

Amy Huang authored Jan 03, 2020

Summary:
This patch adds an option to limit debug info by only emitting complete class
type information when its constructor is emitted. This applies to classes
that have nontrivial user defined constructors.

I implemented the option by adding another level to `DebugInfoKind`, and
a flag `-flimit-debug-info-constructor`.

Total object file size on Windows, compiling with RelWithDebInfo:
  before: 4,257,448 kb
  after:  2,104,963 kb

And on Linux
  before: 9,225,140 kb
  after:  4,387,464 kb

According to the Windows clang.pdb files, here is a list of types that are no
longer complete with this option enabled: https://reviews.llvm.org/P8182

Reviewers: rnk, dblaikie

Subscribers: aprantl, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D72427

651128f5

[analyzer] Fix SARIF column locations · 5ee616a7
Joe Ranieri authored Jan 14, 2020
```
Differential revision: https://reviews.llvm.org/D70689
```
5ee616a7

dotest.py: Add option to pass extra lldb settings to dotest · b53d44b1

Adrian Prantl authored Jan 14, 2020

The primary motivation for this is to add another dimension to the
Swift LLDB test matrix, but this seems generally useful.

Differential Revision: https://reviews.llvm.org/D72662

b53d44b1

[libcxx] [Windows] Make a more proper implementation of strftime_l for mingw with msvcrt.dll · 337e4359

Martin Storsjö authored Oct 28, 2019

This also makes this function consistent with the rest of the
libc++ provided fallbacks.

The locale support in msvcrt.dll is very limited anyway; it can
only be configured processwide, not per thread, and it only seems
to support the locales "C" and "" (the user set locale), so it's
hard to make any meaningful automatic test for it. But manually tested,
this change does make time formatting locale code in libc++ output
times in the user requested format, when using locale "".

Differential Revision: https://reviews.llvm.org/D69554

337e4359

[SVE] Add patterns for MUL immediate instruction. · 26d96126

Danilo Carvalho Grael authored Jan 13, 2020

Summary: Add the missing MUL pattern for integer immediate instructions.

Reviewers: sdesmalen, huntergr, efriedma, c-rhodes, kmclaughlin

Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits, amehsan

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D72654

26d96126

[Driver] Ignore -fno-semantic-interposition · 5d1b3ba6

Fangrui Song authored Jan 14, 2020

Fedora wants to build projects with -fno-semantic-interposition (e.g.
https://fedoraproject.org/wiki/Changes/PythonNoSemanticInterpositionSpeedup),
which is supported by GCC>=5.

Clang's current behavior is similar to -fno-semantic-interposition and
the end goal is to make it more so
(https://lists.llvm.org/pipermail/llvm-dev/2016-November/107625.html).
Ignore this option.

We should let users know -fsemantic-interposition is not currently
supported, so it should remain a hard error.

Reviewed By: serge-sans-paille

Differential Revision: https://reviews.llvm.org/D72724

5d1b3ba6

[OpenMP][Tool] Runtime warning for missing TSan-option · 2d4571bf

Joachim Protze authored Jan 13, 2020

TSan spuriously reports for any OpenMP application a race on the initialization
of a runtime internal mutex:

```
Atomic read of size 1 at 0x7b6800005940 by thread T4:
  #0 pthread_mutex_lock <null> (a.out+0x43f39e)
  #1 __kmp_resume_64 <null> (libomp.so.5+0x84db4)

Previous write of size 1 at 0x7b6800005940 by thread T7:
  #0 pthread_mutex_init <null> (a.out+0x424793)
  #1 __kmp_suspend_initialize_thread <null> (libomp.so.5+0x8422e)
```

According to @AndreyChurbanov this is a false positive report, as the control
flow of the runtime guarantees the ordering of the mutex initialization and
the lock:
https://software.intel.com/en-us/forums/intel-open-source-openmp-runtime-library/topic/530363

To suppress this report, I suggest the use of
TSAN_OPTIONS='ignore_uninstrumented_modules=1'.
With this patch, a runtime warning is provided in case an OpenMP application
is built with Tsan and executed without this Tsan-option.

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D70412

2d4571bf

[NewPM] Port MergeFunctions pass · 41033186

Nikita Popov authored Jan 10, 2020

This ports the MergeFunctions pass to the NewPM. This was rather
straightforward, as no analyses are used.

Additionally MergeFunctions needs to be conditionally enabled in
the PassBuilder, but I left that part out of this patch.

Differential Revision: https://reviews.llvm.org/D72537

41033186

[OPENMP]Improve handling of possibly incorrectly mapped types. · 48bad08a
Alexey Bataev authored Jan 14, 2020
```
Need to analayze the type of the expression for mapping, not the type of
the declaration.
```
48bad08a

[InstCombine] Fix infinite loop due to bitcast <-> phi transforms · 65c0805b

Nikita Popov authored Jan 13, 2020

Fix for https://bugs.llvm.org/show_bug.cgi?id=44245.

The optimizeBitCastFromPhi() and FoldPHIArgOpIntoPHI() end up
fighting against each other, because optimizeBitCastFromPhi()
assumes that bitcasts of loads will get folded. This doesn't
happen here, because a dangling phi node prevents the one-use
fold in https://github.com/llvm/llvm-project/blob/master/llvm/lib/Transforms/InstCombine/InstCombineLoadStoreAlloca.cpp#L620-L628 from triggering.

This patch fixes the issue by explicitly performing the load
combine as part of the bitcast of phi transform. Other attempts
to force the load to be combined first were ultimately too
unreliable.

Differential Revision: https://reviews.llvm.org/D71164

65c0805b

[InstCombine] Make combineLoadToNewType a method; NFC · b4dd928f
Nikita Popov authored Jan 13, 2020
```
So it can be reused as part of other combines.
In particular for D71164.
```
b4dd928f

[InstCombine] Fix user iterator invalidation in bitcast of phi transform · 652cd7c1

Nikita Popov authored Jan 13, 2020

This fixes the issue encountered in D71164. Instead of using a
range-based for, manually iterate over the users and advance the
iterator beforehand, so we do not skip any users due to iterator
invalidation.

Differential Revision: https://reviews.llvm.org/D72657

652cd7c1

[InstCombine] Add test for iterator invalidation bug; NFC · fa632340
Nikita Popov authored Jan 13, 2020

fa632340

[nfc][libomptarget] Refactor nvptx/target_impl.cu · 2a43688a

Jon Chesterfield authored Jan 14, 2020

Summary:
[nfc][libomptarget] Refactor nxptx/target_impl.cu

Use __kmpc_impl_atomic_add instead of atomicAdd to match the rest of the file.
Alternatively, target_impl.cu could use the cuda functions directly. Using a mixture in this
file was an oversight, happy to resolve in either direction.

Removed some comments that look outdated.

Call __kmpc_impl_unset_lock directly to avoid a redundant diagnostic and remove an implict
dependency on interface.h.

Reviewers: ABataev, grokos, jdoerfert

Reviewed By: jdoerfert

Subscribers: jfb, openmp-commits

Tags: #openmp

Differential Revision: https://reviews.llvm.org/D72719

2a43688a

[nfc][libomptarget] Refactor amdgcn target_impl · 2d287bec

Jon Chesterfield authored Jan 14, 2020

Summary:
[nfc][libomptarget] Refactor amdgcn target_impl

Removes references to internal libraries from the header
Standardises on C++ mangling for all the target_impl functions
Update comment block
clang-format
Move some functions into a new target_impl.hip source file

This lays the groundwork for implementing the remaining unresolved
symbols in the target_impl.hip source.

Reviewers: jdoerfert, grokos, ABataev, ronlieb

Reviewed By: jdoerfert

Subscribers: jvesely, mgorny, jfb, openmp-commits

Tags: #openmp

Differential Revision: https://reviews.llvm.org/D72712

2d287bec

Fix NetBSD bot after ([Clang][Driver]... · 88b8cb72

Alexandre Ganea authored Jan 14, 2020

Fix NetBSD bot after b4a99a06 ([Clang][Driver] Re-use the calling process instead of creating a new process for the cc1 invocation)

88b8cb72

[InstCombine] add test for possible cast-of-select transform; NFC · 57cb4685
Sanjay Patel authored Jan 14, 2020

57cb4685

[MachineScheduler] Reduce reordering due to mem op clustering · b777e551

Jay Foad authored Jan 14, 2020

Summary:
Mem op clustering adds a weak edge in the DAG between two loads or
stores that should be clustered, but the direction of this edge is
pretty arbitrary (it depends on the sort order of MemOpInfo, which
represents the operands of a load or store). This often means that two
loads or stores will get reordered even if they would naturally have
been scheduled together anyway, which leads to test case churn and goes
against the scheduler's "do no harm" philosophy.

The fix makes sure that the direction of the edge always matches the
original code order of the instructions.

Reviewers: atrick, MatzeB, arsenm, rampitec, t.p.northover

Subscribers: jvesely, wdng, nhaehnle, kristof.beyls, hiraditya, javed.absar, arphaman, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D72706

b777e551

[lldb/test] test_breakpoints_func_full from... · ab72db7f

Stella Stamenova authored Jan 14, 2020

[lldb/test] test_breakpoints_func_full from TestNamespace.NamespaceBreakpointTestCase is now passing on Windows

After https://reviews.llvm.org/D70846, the test is now passing on Windows

ab72db7f

[gn build] Port 36fcbb83 · 527f5a47
LLVM GN Syncbot authored Jan 14, 2020

527f5a47

Added readability-qualified-auto check · 36fcbb83

Nathan James authored Jan 14, 2020

Adds a check that detects any auto variables that are deduced to a pointer or
a const pointer then adds in the const and asterisk according. Will also
check auto L value references that could be written as const. This relates
to the coding standard
https://llvm.org/docs/CodingStandards.html#beware-unnecessary-copies-with-auto

36fcbb83

[RISCV] Allow shrink wrapping for RISC-V · cd800f3b

lewis-revill authored Jan 14, 2020

Enabling shrink wrapping requires ensuring the insertion point of the
epilogue is correct for MBBs without a terminator, in which case the
instruction to adjust the stack pointer is the last instruction in the
block.

Differential Revision: https://reviews.llvm.org/D62190

cd800f3b

[ThinLTO/WPD] Remove an overly-aggressive assert · 2cefb939

Teresa Johnson authored Jan 13, 2020

Summary:
An assert added to the index-based WPD was trying to verify that we only
have multiple vtables for a given guid when they are all non-external
linkage. This is too conservative because we may have multiple external
vtable with the same guid when they are in comdat. Remove the assert,
as we don't have comdat information in the index, the linker should
issue an error in this case.

See discussion on D71040 for more information.

Reviewers: evgeny777, aganea

Subscribers: mehdi_amini, inglorion, hiraditya, steven_wu, dexonsmith, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D72648

2cefb939

[ELF] Delete the RelExpr member R_HINT. NFC · bec1b55c

Fangrui Song authored Dec 22, 2019

R_HINT is ignored like R_NONE. There are no strong reasons to keep
R_HINT. The largest RelExpr member R_RISCV_PC_INDIRECT is 60 now.

Differential Revision: https://reviews.llvm.org/D71822

bec1b55c

[ThinLTO] Handle variable with twice promoted name (Rust) · 7dc4bbf8

Teresa Johnson authored Jan 14, 2020

Summary:
Ensure that we can internalize values produced from two rounds of
promotion.

Note that this cannot happen currently via clang, but in other use cases
such as the Rust compiler which does a first round of ThinLTO on library
code, producing bitcode, and a second round on the final binary.

In particular this can happen if a function is exported and promoted,
ending up with a ".llvm.${hash}" suffix, and then goes through a round
of optimization creating an internal switch table expansion variable
that is internal and contains the promoted name of the enclosing
function. This variable will be promoted in the second round of ThinLTO
if @foo is imported again, and therefore ends up with two
".llvm.${hash}" suffixes. Only the final one should be stripped when
consulting the index to locate the summary.

Reviewers: wmi

Subscribers: mehdi_amini, inglorion, hiraditya, JDevlieghere, steven_wu, dexonsmith, arphaman, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D72711

7dc4bbf8