Commits · b86dbe0c23e3d284699da551afc2ca9c634a6c62 · Lorenzo Albano / LLVM bpEVL

Feb 16, 2021

Fix Flang build after addition of a new OpenMP clauses for a clang patch (D76342).
Flang is using TableGen to generation the declaration of clause checks and the new clause
was missing a definiton.

Reviewed By: klausler

Differential Revision: https://reviews.llvm.org/D96808

b86dbe0c

[lldb] Delete superfluous semicolon · 901ea296
Pavel Labath authored 4 years ago

901ea296
[lldb] Remove some nanosleep preprocessor magic · 1db2888e
Pavel Labath authored 4 years ago
```
It's obsolete, as we use (portable) std::this_thread::sleep_for now.
```
1db2888e
[RISCV] Add isel patterns for fixed vector fmsub/fnmadd/fnmsub. · 61a238e6
Craig Topper authored 4 years ago

61a238e6

[mlir] Fix memref_cast + subview folder when reducing rank · adfd3c70

Thomas Raoux authored 4 years ago

When the destination of the subview has a lower rank than its source we need to
fix the result type of the new subview op.

Differential Revision: https://reviews.llvm.org/D96804

adfd3c70

[clangd] Fix windows buildbots after ecea7218 · cdef5a71
Kadir Cetinkaya authored 4 years ago

cdef5a71
[gn build] Port ecea7218 · f350fe8c
LLVM GN Syncbot authored 4 years ago

f350fe8c
[gn build] Port 310b3530 · abb75702
LLVM GN Syncbot authored 4 years ago

abb75702

[FileCollector] Fix that the file system case-sensitivity check was inverted · f88b502d

Raphael Isemann authored 4 years ago

real_path returns an `std::error_code` which evaluates to `true` in case an
error happens and `false` if not. This code was checking the inverse, so
case-insensitive file systems ended up being detected as case sensitive.

Tested using an LLDB reproducer test as we anyway need a real file system and
also some matching logic to detect whether the respective file system is
case-sensitive (which the test is doing via some Python checks that we can't
really emulate with the usual FileCheck logic).

Fixes rdar://67003004

Reviewed By: JDevlieghere

Differential Revision: https://reviews.llvm.org/D96795

f88b502d

[clangd] Treat paths case-insensitively depending on the platform · ecea7218

Kadir Cetinkaya authored 4 years ago

Path{Match,Exclude} and MountPoint were checking paths case-sensitively
on all platforms, as with other features, this was causing problems on
windows. Since users can have capital drive letters on config files, but
editors might lower-case them.

This patch addresses that issue by:
- Creating regexes with case-insensitive matching on those platforms.
- Introducing a new pathIsAncestor helper, which performs checks in a
  case-correct manner where needed.

Differential Revision: https://reviews.llvm.org/D96690

ecea7218

[RISCV] Add add/sub saturation tests that exist on ARM/AArch64/X86 · acfab44e

Craig Topper authored 4 years ago

There have been some recent changes to the type legalization for
some of these intrinsics so I thought it would be good to have
coverage.

acfab44e

[SampleFDO][NFC] Refactor SampleProfile.cpp · 310b3530

Rong Xu authored 4 years ago

Refactor SampleProfile.cpp to use the core code in CodeGen.
The main changes are:
(1) Move SampleProfileLoaderBaseImpl class to a header file.
(2) Split SampleCoverageTracker to a head file and a cpp file.
(3) Move the common codes (common options and callsiteIsHot())
to the common cpp file.

Differential Revision: https://reviews.llvm.org/D96455

310b3530

libunwind: Don't attempt to authenticate a null return address. · cddc53ef

Peter Collingbourne authored 4 years ago

Null return addresses can appear at the bottom of the stack (i.e. the
frame corresponding to the entry point). Authenticating these addresses
will set the error code in the address, which will lead to a segfault
in the sigreturn trampoline detection code. Fix this problem by not
authenticating null addresses.

Differential Revision: https://reviews.llvm.org/D96560

cddc53ef

Revert "[AArch64][GlobalISel] Fold constants into G_GLOBAL_VALUE" · 962b73dd

Jessica Paquette authored 4 years ago

This reverts commit 61b4702a.

We were seeing some test failures in SPECINT2006 due to this change. Reverting
to investigate.

962b73dd

[SystemZ][ZOS] Prefer -nostdlib++ as opposed to -nodefaultlibs when building c++ libraries · 5f9be2c3

Zbigniew Sarbinowski authored 4 years ago

Let's use -nostdlib++ rather than -nodefaultlibs when building libc++/libc++abi/libunwind libraries. The default is -nostdlib++ if supported by a build compiler like it is the case with clang, otherwise -nodefaultlibs is used as before.

This change is needed to avoid additional changes at the link step and not to increase the maintenance costs. If clang with -nodefaultlibs is used all the libraries which are removed but required would have to be manually added in. This set of libraries are unique and will send out.

The propose change will allow to make the link step simple for other platforms as well.

Reviewed By: #libc, #libc_abi, ldionne

Differential Revision: https://reviews.llvm.org/D95875

5f9be2c3

[OpenMP] Implement '#pragma omp tile', by Michael Kruse (@Meinersbur). · 6c050052

Michael Kruse authored 4 years ago

The tile directive is in OpenMP's Technical Report 8 and foreseeably will be part of the upcoming OpenMP 5.1 standard.

This implementation is based on an AST transformation providing a de-sugared loop nest. This makes it simple to forward the de-sugared transformation to loop associated directives taking the tiled loops. In contrast to other loop associated directives, the OMPTileDirective does not use CapturedStmts. Letting loop associated directives consume loops from different capture context would be difficult.

A significant amount of code generation logic is taking place in the Sema class. Eventually, I would prefer if these would move into the CodeGen component such that we could make use of the OpenMPIRBuilder, together with flang. Only expressions converting between the language's iteration variable and the logical iteration space need to take place in the semantic analyzer: Getting the of iterations (e.g. the overload resolution of `std::distance`) and converting the logical iteration number to the iteration variable (e.g. overload resolution of `iteration + .omp.iv`). In clang, only CXXForRangeStmt is also represented by its de-sugared components. However, OpenMP loop are not defined as syntatic sugar. Starting with an AST-based approach allows us to gradually move generated AST statements into CodeGen, instead all at once.

I would also like to refactor `checkOpenMPLoop` into its functionalities in a follow-up. In this patch it is used twice. Once for checking proper nesting and emitting diagnostics, and additionally for deriving the logical iteration space per-loop (instead of for the loop nest).

Differential Revision: https://reviews.llvm.org/D76342

6c050052

[mlir] Simplify ModuleTranslation for LLVM IR · ce8f10d6

Alex Zinenko authored 4 years ago

A series of preceding patches changed the mechanism for translating MLIR to
LLVM IR to use dialect interface with delayed registration. It is no longer
necessary for specific dialects to derive from ModuleTranslation. Remove all
virtual methods from ModuleTranslation and factor out the entry point to be a
free function.

Also perform some cleanups in ModuleTranslation internals.

Depends On D96774

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D96775

ce8f10d6

[DAG] PromoteIntRes_ADDSUBSHLSAT - promote ISD::UADDSAT as clamped add · df45c181

Simon Pilgrim authored 4 years ago

Similar to D96622, we're better off just promoting uaddsat(x,y) -> umin(add(x,y),c) instead of trying to perform a shifted uaddsat.

I initially tried to just use shifted promotion in cases where we didn't have a legal/custom umin - but we don't appear to have any targets that have uaddsat but not umin, so imo we're better off always using the umin and avoid an untested shifted uaddsat code path.

Differential Revision: https://reviews.llvm.org/D96767

df45c181

[RISCV] Add support for fixed vector mask logic operations. · 07ca13fe
Craig Topper authored 4 years ago
```
Reviewed By: frasercrmck

Differential Revision: https://reviews.llvm.org/D96741
```
07ca13fe

[SelectionDAG][AArch64] Restrict matchUnaryPredicate to only handle... · 064ada4e

Craig Topper authored 4 years ago

[SelectionDAG][AArch64] Restrict matchUnaryPredicate to only handle SPLAT_VECTOR for scalable vectors.

fde24661 added support for
scalable vectors to matchUnaryPredicate by handling SPLAT_VECTOR in
addition to BUILD_VECTOR. This was used to enabled UDIV/SDIV/UREM/SREM
by constant expansion in BuildUDIV/BuildSDIV in TargetLowering.cpp

The caller there expects to call getBuildVector from the match factors.
This leads to a crash right now if there is a SPLAT_VECTOR of
fixed vectors since the number of vectors won't match the number
of elements.

To fix this, this patch updates the callers to check the opcode
instead of whether the type is fixed or scalable. This assumes
that only 3 opcodes are handled by matchUnaryPredicate so
I've added an assertion to the final else to check that opcode.

Reviewed By: RKSimon

Differential Revision: https://reviews.llvm.org/D96174

064ada4e

[mlir] tighten LLVM dialect verifiers to generate valid LLVM IR · 2ab57c50

Alex Zinenko authored 4 years ago

Verification of the LLVM IR produced when translating various MLIR dialects was
only active when calling the translation programmatically. This has led to
several cases of invalid LLVM IR being generated that could not be caught with
textual mlir-translate tests. Add verifiers for these cases and fix the tests
in preparation for enforcing the validation of LLVM IR.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D96774

2ab57c50

[AArch64] Convert CMP/SELECT sign patterns to OR & ASR. · 211147c5

Florian Hahn authored 4 years ago

ICMP & SELECT patterns extracting the sign of a value can be simplified
to OR & ASR (see  https://alive2.llvm.org/ce/z/Xx4iZ0).

This does not save any instructions in IR, but it is profitable on
AArch64, because we need at least 2 extra instructions to materialize 1
and -1 for the SELECT.

The improvements result in ~5% speedups on loops of the form

    static int sign_of(int x) {
      if (x < 0) return -1;
      return 1;
    }

    void foo(const int *x, int *res, int cnt) {
      for (int i=0;i<cnt;i++)
        res[i] = sign_of(x[i]);
    }

Reviewed By: dmgreen

Differential Revision: https://reviews.llvm.org/D96596

211147c5

[libc][NFC] Make few maths functions buildable outside of LLVM libc build. · dba14814

Siva Chandra Reddy authored 4 years ago

Few math functions manipulate errno. They assumed that LLVM libc's errno
is available. However, that might not be the case when these functions
are used in a libc which does not use LLVM libc's errno. This change
switches such uses of LLVM libc's errno to the normal public errno macro.
This does not affect LLVM libc's build because the include order ensures
we get LLVM libc's errno. Also, the header check rule ensures we are only
including LLVM libc's errno.h.

dba14814

[analyzer] Fix a warning · 1a323c8a

Kazu Hirata authored 4 years ago

This patch fixes a warning from -Wcovered-switch-default.  The switch
statement in question handles all the enum values.

1a323c8a

[mlir] add verifiers for NVVM and ROCDL kernel attributes · 9cd47a26

Alex Zinenko authored 4 years ago

Make sure they can only be attached to LLVM functions as a result of converting
GPU functions to the LLVM Dialect.

9cd47a26

[coro async] Don't promote allocas to the frame or rewrite swifterror if... · 627cfd43

Arnold Schwaighofer authored 4 years ago

[coro async] Don't promote allocas to the frame or rewrite  swifterror if there are no suspend points

Also don't call function to update the call graph if there are no
clones. The function will fail.

rdar://74277860

Differential Revision: https://reviews.llvm.org/D96620

627cfd43

[flang][fir] Add fir-opt tool · 8260232c

clementval authored 4 years ago

This patch introduce the fir-opt tool. Similar to mlir-opt for FIR.
It will be used in following patches to test fir opt and round-trip.

Reviewed By: schweitz, mehdi_amini

Differential Revision: https://reviews.llvm.org/D96535

8260232c

[ARM] Use rGPR for writeback vldrs · 1e007cf4

David Green authored 4 years ago

From what I can tell, a writeback is unpredictable with LR for both
loads and stores. This changes the operand from a gprnopc to a rGPR in
both cases (which I believe is essentially a NFC due to the tied-def
already being a rGPR.)

Differential Revision: https://reviews.llvm.org/D96723

1e007cf4

AMDGPU: Remove kills following clusters of memory instruction · a7455d7b

Matt Arsenault authored 4 years ago

In a future commit, soft clauses will be hinted with kill instructions
rather than forced together with bundles. Look for kills that look
like this, and erase them. I'm not sure if the check for specific uses
is worthwhile, or if it would be better to just unconditionally erase
kills.

This reduces test churn in a future patch.

a7455d7b

[DAG] Fold shuffle(bop(shuffle(x,y),shuffle(z,w)),bop(shuffle(a,b),shuffle(c,d))) · 5dfba562

Simon Pilgrim authored 4 years ago

Fold shuffle(bop(shuffle(x,y),shuffle(z,w)),bop(shuffle(a,b),shuffle(c,d))) -> bop(shuffle(x,y),shuffle(z,w)),bop(shuffle(a,b),shuffle(c,d))

Attempt to fold from a shuffle of a pair of binops to a binop of shuffles, as long as one/both of the binop sources are also shuffles that can be merged with the outer shuffle. This should guarantee that we remove one binop without introducing any additional shuffles.

Technically there's potential for a merged shuffle's lowering to be poorer than the original shuffle, but it could also be better, and I'm not seeing any regressions as long as we keep the 'don't merge splats' rule already present in MergeInnerShuffle.

This expands and generalizes an existing X86 combine and attempts to merge either of each binop's sources (with an on-the-fly commutation of the shuffle mask) - we couldn't do that in the x86 version as it had to stay in a form that DAGCombine's MergeInnerShuffle would still recognise.

Differential Revision: https://reviews.llvm.org/D96345

5dfba562

AMDGPU: Fix debug info handling in post-RA bundler · c320e819

Matt Arsenault authored 4 years ago

This was allowing debug instructions to break the bundling, which
would change scheduling behavior. Bundle debug info / kills inside
the bundle. This seems to work OK, although the asm printer doesn't
understand these in a bundle. This implicitly expects the memory
legalizer to unbundle. It would probably be slightly nicer to move
these after.

Rewrite the loop to be clearer and make sure we don't end a bundle on
a meta instruction, only allow them in between other valid bundle
instructions.

c320e819

Reduce the number of attributes attached to each function · 3c8bf29f

serge-sans-paille authored 4 years ago

This takes advantage of the implicit default behavior to reduce the number of
attributes, which in turns reduces compilation time. I've observed -3% in
instruction count when compiling sqlite3 amalgamation with -O0

Differential Revision: https://reviews.llvm.org/D96400

3c8bf29f

[mlir][vector] Add missing support for contract of integer lowering. · 397336dc

Thomas Raoux authored 4 years ago

Some of the lowering of vector.contract didn't support integer case. Since
reduction of integer cannot accumulate we always break up the reduction op, it
should be merged by a separate canonicalization if possible.

Differential Revision: https://reviews.llvm.org/D96461

397336dc

[mlir] Add canonicalization for tensor_cast + tensor_to_memref · 807e5467

Thomas Raoux authored 4 years ago

This helps bufferization passes by removing tensor_cast operations.

Differential Revision: https://reviews.llvm.org/D96745

807e5467

[mlir][vector] Move splitting transfer ops into a separate entry point · cb1a4235

Lei Zhang authored 4 years ago

These patterns unrolls transfer read/write ops if the vector consumers/
producers are extract/insert slices op. Transfer ops can map to hardware
load/store functionalities, where the vector size matters for bandwidth
considerations. So these patterns should be collected separately, instead
of being generic canonicalization patterns.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D96782

cb1a4235

[libc++] Build thread_win32.cpp only if LIBCXX_HAS_PTHREAD_API is not set · 0f5020af

Colin Finck authored 4 years ago

This allows building libc++ against winpthreads from mingw-w64 to support
operating systems older than Windows 7. The remaining libc++ code already
supports `WIN32` with `LIBCXX_HAS_PTHREAD_API`.

Note that there is also the older "pthreads-win32". However, that support
library implements `pthread_t` as a struct, which violates the libc++
assumption that `pthread_t` is always a scalar and can be compared,
ordered, and set to zero.

Differential Revision: https://reviews.llvm.org/D96339

0f5020af

[mlir][vector] Add support for unrolling vector.fma · d8c7f442
Lei Zhang authored 4 years ago
```
Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D96781
```
d8c7f442

[lldb/test] Test lldb-server named pipe functionality on windows · 85f025e5

Pavel Labath authored 4 years ago

lldb-server can use a named pipe to communicate the port number it is
listening on. This windows bits of this are already implemented, but we
did not have a test for that, most likely because python does not have
native pipe functionality.

This patch implements the windows bits necessary to test this. I'm using
the ctypes package to call the native APIs directly to avoid a
dependency to non-standard python packages. This introduces some amount
of boilerplate, but our named pipe use case is fairly limited, so we
should not end up needing to wrap large chunks of windows APIs.

Surprisingly to changes to lldb-server were needed to make the test
pass.

Differential Revision: https://reviews.llvm.org/D96260

85f025e5

[clangd] Give modules access to filesystem, scheduler, and index. · b6e52d8f

Sam McCall authored 4 years ago

This finally makes it possible to implement useful modules.

Differential Revision: https://reviews.llvm.org/D96726

b6e52d8f

[gn build] Port 40cc63ea · b2db4934
LLVM GN Syncbot authored 4 years ago

b2db4934