Commits · f28e710db720a913f4b508a9dc43f25e81629e72 · Lorenzo Albano / LLVM bpEVL

Sep 10, 2021

[OpenMP] Make CUDA math library functions SPMD amenable · f28e710d

Joseph Huber authored Aug 30, 2021

This patch adds the SPMD amenable assumption to the CUDA math library
defintions in Clang. Previously these functions would block SPMD
execution on the device because they're intrinsic calls into the library
and can't be calculated. These functions don't have side-effects so they
are safe to execute in SPMD mode.

Depends on D105937

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D108958

f28e710d

[libc] Add extension functions fedisableexcept, feenableexcept and fegetexcept. · 0da5ac1a
Siva Chandra Reddy authored Sep 09, 2021
```
Reviewed By: michaelrj

Differential Revision: https://reviews.llvm.org/D109613
```
0da5ac1a

[hwasan] Do not instrument accesses to uninteresting allocas. · 09391e7e

Florian Mayer authored Sep 09, 2021

This leads to a statistically significant improvement when using -hwasan-instrument-stack=0: https://bit.ly/3AZUIKI.
When enabling stack instrumentation, the data appears gets better but not statistically significantly so. This is consistent
with the very moderate improvements I have seen for stack safety otherwise, so I expect it to improve when the underlying
issue of that is resolved.

Reviewed By: eugenis

Differential Revision: https://reviews.llvm.org/D108457

09391e7e

[Sanitizers] intercept netent, protoent and mincore on FreeBSD. · 8fdd821a

David Carlier authored Sep 10, 2021

netent on Linux in addition as well.

Reviewd By: vitalybuka

Differential Revision: https://reviews.llvm.org/D109287

8fdd821a

[stack-safety] Allow to determine safe accesses. · 57335b6e
Florian Mayer authored Sep 10, 2021
```
Reviewed By: vitalybuka

Differential Revision: https://reviews.llvm.org/D109503
```
57335b6e

[clang] Fix typo in test from · 23f256f2

Nico Weber authored Sep 10, 2021

We want the driver-level flag here, else the test passes for the wrong reasons.
See comments on https://reviews.llvm.org/D99901.

23f256f2

[CodeGen, Target] Use pred_empty and succ_empty (NFC) · c9fca53a
Kazu Hirata authored Sep 10, 2021

c9fca53a

[lldb] Add support for debugging via the dynamic linker. · 03df9710

Rumeet Dhindsa authored Sep 10, 2021

This patch adds support for shared library load when the executable is
called through ld.so.

Differential Revision:https://reviews.llvm.org/D108061

03df9710

[clang] `aligned_alloc` allocation function specifies alignment in first arg,... · f3c2094d

Roman Lebedev authored Sep 10, 2021

[clang] `aligned_alloc` allocation function specifies alignment in first arg, manifest that knowledge

Mainly, if a constant value was passed as an alignment,
then we correctly annotate the alignment of the returned value
of @aligned_alloc. And if it wasn't constant,
then we also don't loose that, but emit an assumption.

f3c2094d

[NFCI][clang] Move allocation alignment manifestation for malloc-like into Sema from Codegen · 85ba583e
Roman Lebedev authored Sep 10, 2021
```
... so that it happens right next to `AddKnownFunctionAttributesForReplaceableGlobalAllocationFunction()`,
which is good for consistency.
```
85ba583e
[NFC][clang] Improve test coverage for alignment manifestation on aligned allocation functions · 50d7ecc5
Roman Lebedev authored Sep 10, 2021

50d7ecc5

[AArch64ISelLowering] Fix null pointer access in performSVEAndCombine. · da4a2fd8

Huihui Zhang authored Sep 10, 2021

When combining 'and' of an unsigned unpack and shuffle instruction,
bail early if shuffle is not constructed from a constant integer.

Reviewed By: paulwalker-arm

Differential Revision: https://reviews.llvm.org/D109556

da4a2fd8

[openmp][amdgpu] Update SupportAndFAQ docs · f244af5c
Jon Chesterfield authored Sep 10, 2021

f244af5c

[AggressiveInstCombine] Add `udiv` and `urem` instrs to TruncInstCombine DAG · 54d8ebbb

Anton Afanasyev authored Sep 07, 2021

Add `udiv` and `urem` instructions to the DAG post-dominated by `trunc`,
allowing TruncInstCombine to reduce bitwidth of expressions containing these
instructions. It is sufficient to require that all truncated bits of both
operands are zeros: https://alive2.llvm.org/ce/z/yiithn
(`urem` case is identical).

Differential Revision: https://reviews.llvm.org/D109515

54d8ebbb

[Test][AggressiveInstCombine] Add test for `udiv` and `urem` · ea7b2c14
Anton Afanasyev authored Sep 09, 2021
```
Precommit test for D109515
```
ea7b2c14

Revert "[OpenMP] Group side-effects to improve guarding efficiency" · d2f206e0

Johannes Doerfert authored Sep 10, 2021

This reverts commit ca134c39.

There seems to be a problem with the tests, investigating now:
  https://lab.llvm.org/buildbot/#/builders/61/builds/14574

d2f206e0

Revert "[GlobalOpt][FIX] Do not embed initializers into AS!=0 globals" · d9a8d208

Johannes Doerfert authored Sep 10, 2021

This reverts commit 7dbba337.

There seems to be a problem with the tests, investigating now:
  https://lab.llvm.org/buildbot/#/builders/61/builds/14574

d9a8d208

[OpenMP][Docs] Remove old/outdated webpage · 9f844aee

Johannes Doerfert authored Aug 23, 2021

This should have happened a long time ago, now that openmp.llvm.org
redirects to openmp.llvm.org/docs we completely switched over to the
sphinx documentation page instead.

Reviewed By: JonChesterfield

Differential Revision: https://reviews.llvm.org/D108588

9f844aee

[OpenMP] Encode `omp [...] assume[...]` assumptions with `omp[x]` prefix · 45e8e084

Johannes Doerfert authored Jul 13, 2021

Since these assumptions are coming from OpenMP it makes sense to mark
them as such in the generic IR encoding. Standardized assumptions will
be named
  omp_ASSUMPTION_NAME
and extensions will be named
  ompx_ASSUMPTION_NAME
which is the OpenMP 5.2 syntax for "extensions" of any kind.

This also matches what the OpenMP-Opt pass expects.

Summarized,
  #pragma omp [...] assume[s] no_parallelism
now generates the same IR assumption annotation as
  __attribute__((assume("omp_no_parallelism")))

Reviewed By: jhuber6

Differential Revision: https://reviews.llvm.org/D105937

45e8e084

[GlobalOpt][FIX] Do not embed initializers into AS!=0 globals · 7dbba337

Johannes Doerfert authored Sep 02, 2021

Not all address spaces support initializers for globals and we can
therefore not set them without checking if they are allowed. This
patch adds a hook into TTI to check if an AS allows non-undef
initializers. We disable it for all but address space 0 by default,
NVPTX and AMDGPU targets allow all but address space 3.

Reviewed By: tra

Differential Revision: https://reviews.llvm.org/D109337

7dbba337

[OpenMP] Group side-effects to improve guarding efficiency · ca134c39

Johannes Doerfert authored Aug 11, 2021

When we guard side-effects as part of SPMDzation we do it for
consecutive instructions that need guarding. This patch will try to
reorder guarded side-effects in a block to decrease the number of
guarded regions we need. It does not use any smarts, e.g., alias
analysis, to move side-effects over non-interfering reads. Instead,
it only moves side-effects downwards to the next guarded side-effect
if there was nothing in between that could have possibly be affected.

Reviewed By: ggeorgakoudis

Differential Revision: https://reviews.llvm.org/D109070

ca134c39

[ARM] Remove unused tblgen arguments. NFC · deefeffb

David Green authored Sep 10, 2021

As per D109359, this removes or makes use of some of the existing unused
NEON and base ARM tblgn arguments.

deefeffb

[CallLowering] Support opaque pointers · 14afbe94

Nikita Popov authored Sep 10, 2021

Always use the byval/inalloca/preallocated type (which is required
nowadays), don't fall back on the pointer element type.

This requires adding Function::getParamPreallocatedType() to
mirror the CallBase API, so that the templated code can work with
both.

14afbe94

[IR] Remove unused parameter (NFC) · d34d2bbe
Nikita Popov authored Sep 10, 2021

d34d2bbe

[RISCV] Enable CGP to sink splat operands of Add/Sub/Mul/Shl/LShr/AShr · 1b736bda

Craig Topper authored Sep 10, 2021

LICM may have pulled out a splat, but with .vx instructions we
can fold it into an operation.

This patch enables CGP to reverse the LICM transform and move the
splat back into the loop.

I've started with the commutable integer operations and shifts, but we can
extend this with more operations in future patches.

Reviewed By: frasercrmck

Differential Revision: https://reviews.llvm.org/D109394

1b736bda

[RISCV] Teach vsetvli insertion that stores don't use the policy bits in vtype. · 6c7cadb8
Craig Topper authored Sep 09, 2021
```
This can avoid a vsetvl after a tail undisturbed operation.

Differential Revision: https://reviews.llvm.org/D109549
```
6c7cadb8
[lldb] [test] Remove parent check in Subprocess/clone-follow-child-softbp.test · 4e7ac6fa
Michał Górny authored Sep 10, 2021
```
Hopefully this will resolve the remaining flakiness.
```
4e7ac6fa

[lld][WebAssembly] Cleanup output of --verbose · 3a7bcba3

Sam Clegg authored Sep 10, 2021

Remove some unnecessary logging from wasm-ld when running under
`--verbose`.  Unlike `-debug` this logging is available in release
builds.  This change makes it little more minimal/readable.

Also, avoid compiling the `debugWrite` function in releaase builds
where it does nothing.  This should remove a lot debug strings from
the binary, and avoid having to construct unused debug strings at
runtime.

Differential Revision: https://reviews.llvm.org/D109583

3a7bcba3

[lldb] [test] Skip A/vRun/QEnvironment* tests on Windows, and fix them · d727bd69

Michał Górny authored Sep 10, 2021

Skip A/vRun/QEnvironment* tests on Windows as testing for output is
known not to work there.  Add a missing output check to the vRun test.

d727bd69

[lldb] [test] Attempt to fix gdb_remote_client A/vRun tests on Windows · 784281d3
Michał Górny authored Sep 10, 2021

784281d3
[lldb] [test] Mark new launch/QEnvironment tests as llgs category · c362f610
Michał Górny authored Sep 10, 2021

c362f610
[lldb] [test] Skip file permission tests on Windows · 9a4379c3
Michał Górny authored Sep 10, 2021

9a4379c3

[ARM] Remove unused tblgen arguments. NFCI · 6b7cdb40

David Green authored Sep 10, 2021

As per D109359, this removes or makes use of some of the existing unused
MVE tblgn arguments.

6b7cdb40

[WebAssembly][libObject] Avoid re-use of Section object during parsing · e4b2f305

Sam Clegg authored Aug 31, 2021

The re-use of this struct across iterations of the loop was causing
fields (specifically Name) to be incorrectly shared between multiple
sections.

Differential Revision: https://reviews.llvm.org/D108984

e4b2f305

[clang-offload-bundler] Fix compatibility testing for non-assert builds · 4a25c3fb

Saiyedul Islam authored Sep 10, 2021

Test using debug-only=CodeObjectComaptibility was failing in
non-assert builds, so it has been moved to a different file which
requires assert.

Reviewed By: RKSimon

Differential Revision: https://reviews.llvm.org/D109592

4a25c3fb

[OpaquePtr] Forbid mixing typed and opaque pointers · 90ec6dff

Nikita Popov authored Sep 04, 2021

Currently, opaque pointers are supported in two forms: The
-force-opaque-pointers mode, where all pointers are opaque and
typed pointers do not exist. And as a simple ptr type that can
coexist with typed pointers.

This patch removes support for the mixed mode. You either get
typed pointers, or you get opaque pointers, but not both. In the
(current) default mode, using ptr is forbidden. In -opaque-pointers
mode, all pointers are opaque.

The motivation here is that the mixed mode introduces additional
issues that don't exist in fully opaque mode. D105155 is an example
of a design problem. Looking at D109259, it would probably need
additional work to support mixed mode (e.g. to generate GEPs for
typed base but opaque result). Mixed mode will also end up
inserting many casts between i8* and ptr, which would require
significant additional work to consistently avoid.

I don't think the mixed mode is particularly valuable, as it
doesn't align with our end goal. The only thing I've found it to
be moderately useful for is adding some opaque pointer tests in
between typed pointer tests, but I think we can live without that.

Differential Revision: https://reviews.llvm.org/D109290

90ec6dff

[InstCombine] add tests for X == 0 ? 0 : X * Y ; NFC · 745f82b8
Filipp Zhinkin authored Sep 10, 2021
```
These are the tests for D108408 with current baseline results.
```
745f82b8

[AArch64] Regenerate some test checks. NFC · 2c5590ad

David Green authored Sep 10, 2021

This updates some mostly update_test_check test files and generates the
check lines with the script, making them more maintainable.

2c5590ad

[clang][deps] Test diagnostic options are being respected · 7afabc2e

Jan Svoboda authored Sep 10, 2021

This patch tests code in D108976. This split is necessary to avoid temporary regression.

Depends on D108974,

Reviewed By: dexonsmith

Differential Revision: https://reviews.llvm.org/D109158

7afabc2e

[clang][deps] Sanitize both instances of DiagnosticOptions · 993f60ae

Jan Svoboda authored Sep 10, 2021

During dependency scanning, we generally want to suppress -Werror. Apply the same logic to the DiagnosticOptions instance used for command-line parsing.

This fixes a test failure on the PS4 bot, where the system header directory could not be found, which was reported due to -Werror being on the command line and not being sanitized.

993f60ae