Commits · a257bde420ca96246863082b8e01f0f2b1141621 · Roger Ferrer / llvm-epi-0.8

Feb 14, 2020

AMDGPU/GlobalISel: Handle G_BSWAP · a257bde4
Matt Arsenault authored Feb 12, 2020

a257bde4

[libc++] Remove cycle between <type_traits> and <cstddef> · cccf1ef0

Eric Fiselier authored Feb 14, 2020

This was caused by byte depending on traits. This patch moves
the minimal amount of meta-programming into <cstddef> to break the cycle.

cccf1ef0

Fix compilation breakage introduced by 8404aeb5. · 0d2ba657
Alexandre Ganea authored Feb 14, 2020
```
Also fix BitVector unittest failure when DLLVM_ENABLE_ASSERTIONS are OFF, introduced by d110c3a9.
```
0d2ba657

[Driver] Rename AddGoldPlugin to addLTOOptions. NFC · 597dfb3b

Fangrui Song authored Feb 13, 2020

AddGoldPlugin does more than adding `-plugin path/to/LLVMgold.so`.
It works with lld and GNU ld, and adds other LTO options.
So AddGoldPlugin is no longer a suitable name.

Reviewed By: tejohnson

Differential Revision: https://reviews.llvm.org/D74591

597dfb3b

Reverting D73027 [DependenceAnalysis] Dependecies for loads marked with... · cae643d5

Evgeniy Brevnov authored Feb 14, 2020

Reverting D73027 [DependenceAnalysis] Dependecies for loads marked with "ivnariant.load" should not be shared with general accesses(PR42151).

cae643d5

add type_traits include as required for std::integral_constant · e337fb07
Eric Fiselier authored Feb 14, 2020

e337fb07

Revert "Reland D74436 "Change clang option -ffp-model=precise to select ffp-contract=on"" · 9122b92f

Melanie Blower authored Feb 14, 2020

This reverts commit 0a1123eb.
Want to revert this because it's causing trouble for PowerPC
I also fixed test fp-model.c which was looking for an incorrect error message

9122b92f

[Support] On Windows, ensure hardware_concurrency() extends to all CPU sockets and all NUMA groups · 8404aeb5

Alexandre Ganea authored Feb 13, 2020

The goal of this patch is to maximize CPU utilization on multi-socket or high core count systems, so that parallel computations such as LLD/ThinLTO can use all hardware threads in the system. Before this patch, on Windows, a maximum of 64 hardware threads could be used at most, in some cases dispatched only on one CPU socket.

== Background ==
Windows doesn't have a flat cpu_set_t like Linux. Instead, it projects hardware CPUs (or NUMA nodes) to applications through a concept of "processor groups". A "processor" is the smallest unit of execution on a CPU, that is, an hyper-thread if SMT is active; a core otherwise. There's a limit of 32-bit processors on older 32-bit versions of Windows, which later was raised to 64-processors with 64-bit versions of Windows. This limit comes from the affinity mask, which historically is represented by the sizeof(void*). Consequently, the concept of "processor groups" was introduced for dealing with systems with more than 64 hyper-threads.

By default, the Windows OS assigns only one "processor group" to each starting application, in a round-robin manner. If the application wants to use more processors, it needs to programmatically enable it, by assigning threads to other "processor groups". This also means that affinity cannot cross "processor group" boundaries; one can only specify a "preferred" group on start-up, but the application is free to allocate more groups if it wants to.

This creates a peculiar situation, where newer CPUs like the AMD EPYC 7702P (64-cores, 128-hyperthreads) are projected by the OS as two (2) "processor groups". This means that by default, an application can only use half of the cores. This situation could only get worse in the years to come, as dies with more cores will appear on the market.

== The problem ==
The heavyweight_hardware_concurrency() API was introduced so that only *one hardware thread per core* was used. Once that API returns, that original intention is lost, only the number of threads is retained. Consider a situation, on Windows, where the system has 2 CPU sockets, 18 cores each, each core having 2 hyper-threads, for a total of 72 hyper-threads. Both heavyweight_hardware_concurrency() and hardware_concurrency() currently return 36, because on Windows they are simply wrappers over std::thread::hardware_concurrency() -- which can only return processors from the current "processor group".

== The changes in this patch ==
To solve this situation, we capture (and retain) the initial intention until the point of usage, through a new ThreadPoolStrategy class. The number of threads to use is deferred as late as possible, until the moment where the std::threads are created (ThreadPool in the case of ThinLTO).

When using hardware_concurrency(), setting ThreadCount to 0 now means to use all the possible hardware CPU (SMT) threads. Providing a ThreadCount above to the maximum number of threads will have no effect, the maximum will be used instead.
The heavyweight_hardware_concurrency() is similar to hardware_concurrency(), except that only one thread per hardware *core* will be used.

When LLVM_ENABLE_THREADS is OFF, the threading APIs will always return 1, to ensure any caller loops will be exercised at least once.

Differential Revision: https://reviews.llvm.org/D71775

8404aeb5

[clang-scan-deps] Switch to using a ThreadPool · d9049e87

Alexandre Ganea authored Feb 13, 2020

Use a ThreadPool instead of plain std::threads in clang-scan-deps.
This is needed to further support https://reviews.llvm.org/D71775.

Differential Revision: https://reviews.llvm.org/D74569

d9049e87

[ADT] Support BitVector as a key in DenseSet/Map · d110c3a9

Alexandre Ganea authored Feb 13, 2020

This patch adds DenseMapInfo<> support for BitVector and SmallBitVector.

This is part of https://reviews.llvm.org/D71775, where a BitVector is used as a thread affinity mask.

d110c3a9

Fix line endings produced by update_cc_test_checks.py · c2931070

Alex Richardson authored Feb 14, 2020

Use the same appraoch as update_llc_test_checks.py to always write \n
line endings. This should fix the Windows buildbots.

c2931070

[libc++] Remove unnecessary typenames from std/numerics/c.math/abs.pass.cpp · f54e7b4e

Louis Dionne authored Feb 14, 2020

There are some unnecessary typenames in std/numerics/c.math/abs.pass.cpp;
e.g. they're not in a dependent context.

Patch by Bryce Adelstein Lelbach

Differential Revision: https://reviews.llvm.org/D72106

f54e7b4e

Revert "[clang-tools-extra] fix the check for if '-latomic' is necessary" · 13700c38
Luís Marques authored Feb 14, 2020
```
This reverts commit 1d40c415.
This seemed to have caused build failures on ARM/AArch64.
```
13700c38

[clangd] Update the CompletionItemKind. · 5dc2314d

Haojian Wu authored Feb 14, 2020

Summary: Fix some FIXMEs.

Reviewers: kadircet

Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, usaxena95, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D74609

5dc2314d

Move update_cc_test_checks.py tests to clang · 61dd0603

Alex Richardson authored Feb 14, 2020

Having tests that depend on clang inside llvm/ are not a good idea since
it can break incremental `ninja check-llvm`.

Fixes https://llvm.org/PR44798

Reviewed By: lebedev.ri, MaskRay, rsmith
Differential Revision: https://reviews.llvm.org/D74051

61dd0603

[clangd] Add tracer to the rename workflow, NFC · 74c97ca1

Haojian Wu authored Feb 11, 2020

Reviewers: kbobyrev

Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D74395

74c97ca1

Reenable "Always import constants" after compile time fixes · 2102ef8a

Teresa Johnson authored Feb 12, 2020

Summary:
Reenables importing of constants by default, which was disabled in
D73724 due to excessive thin link times. These inefficiencies were
fixed in D73851.

I re-measured thin link times for a number of binaries that had compile
time explosions with importing of constants previously and confirmed
they no longer have any notable increases with it enabled.

Reviewers: wmi, evgeny777

Subscribers: hiraditya, arphaman, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D74512

2102ef8a

[AArch64] Add BIT/BIF support. · b6a9fe20

Pavel Iliin authored Feb 04, 2020

This patch added generation of SIMD bitwise insert BIT/BIF instructions.
In the absence of GCC-like functionality for optimal constraints satisfaction
during register allocation the bitwise insert and select patterns are matched
by pseudo bitwise select BSP instruction with not tied def.
It is expanded later after register allocation with def tied
to BSL/BIT/BIF depending on operands registers.
This allows to get rid of redundant moves.

Reviewers: t.p.northover, samparker, dmgreen

Reviewed By: dmgreen

Differential Revision: https://reviews.llvm.org/D74147

b6a9fe20

[clang-tools-extra] fix the check for if '-latomic' is necessary · 1d40c415

Gokturk Yuksek authored Feb 14, 2020

Summary:
The CheckAtomic module performs two tests to determine if passing
'-latomic' to the linker is required: one for 64-bit atomics, and
another for non-64-bit atomics. clangd only uses the result from
HAVE_CXX_ATOMICS64_WITHOUT_LIB. This is incomplete because there are
uses of non-64-bit atomics in the code, such as the ReplyOnce::Replied
of type std::atomic<bool> defined in clangd/ClangdLSPServer.cpp.

Fix by also checking for the result of HAVE_CXX_ATOMICS_WITHOUT_LIB.

See also: https://reviews.llvm.org/D68964

Reviewers: ilya-biryukov, nridge, kadircet, beanz, compnerd, luismarques
Reviewed By: luismarques
Tags: #clang
Differential Revision: https://reviews.llvm.org/D69869

1d40c415

[Driver][RISCV] Add RedHat Linux RISC-V triple · 9816e726

Luís Marques authored Feb 14, 2020

Summary: Adds the RedHat Linux triple to the list of 64-bit RISC-V triples.
Without this the gcc libraries wouldn't be found by clang on a redhat/fedora
system, as the search list included `/usr/lib/gcc/riscv64-redhat-linux-gnu`
but the correct path didn't include the `-gnu` suffix.

Reviewers: lenary, asb, dlj
Reviewed By: lenary
Tags: #clang
Differential Revision: https://reviews.llvm.org/D74399

9816e726

[test][DebugInfo] Fix signed/unsigned comparison problem in test · a55dec7d
James Henderson authored Feb 14, 2020
```
This caused build bot failures:
http://lab.llvm.org:8011/builders/ppc64le-lld-multistage-test/builds/8568/
```
a55dec7d

[libc++] span: Fix incorrect static asserts · 0a0e0afa

Louis Dionne authored Feb 14, 2020

The static asserts in span<T, N>::front() and span<T, N>::back() are
incorrect as they may be triggered from valid code due to evaluation
of a never taken branch:

    span<int, 0> foo;
    if (!foo.empty()) {
        auto x = foo.front();
    }

The problem is that the branch is always evaluated by the compiler,
creating invalid compile errors for span<T, 0>.

Thanks to Michael Schellenberger Costa for the patch.

Differential Revision: https://reviews.llvm.org/D71995

0a0e0afa

[clang][Index] Introduce a TemplateParm SymbolKind · 84240e0d

Kadir Cetinkaya authored Jan 30, 2020

Summary:
Currently template parameters has symbolkind `Unknown`. This patch
introduces a new kind `TemplateParm` for templatetemplate, templatetype and
nontypetemplate parameters.

Also adds tests in clangd hover feature.

Reviewers: sammccall

Subscribers: kristof.beyls, ilya-biryukov, jkorous, arphaman, usaxena95, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D73696

84240e0d

[clang][DeclPrinter] Implement visitors for {TemplateType,NonTypeTemplate}Parms · c45fb35b

Kadir Cetinkaya authored Jan 30, 2020

Reviewers: sammccall, hokein

Subscribers: kristof.beyls, jkorous, arphaman, usaxena95, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D73693

c45fb35b

[lldb] Print result when expect_expr unexpectedly succeeds · a57ad008
Raphael Isemann authored Feb 14, 2020

a57ad008

[X86][SSE] lowerShuffleAsBitRotate - lower to vXi8 shuffles to ROTL on pre-SSSE3 targets · 2492075a

Simon Pilgrim authored Feb 14, 2020

Without PSHUFB we are better using ROTL (expanding to OR(SHL,SRL)) than using the generic v16i8 shuffle lowering - but if we can widen to v8i16 or more then the existing shuffles are still the better option.

REAPPLIED: Original commit rG11c16e71598d was reverted at rGde1d90299b16 as it wasn't accounting for later lowering. This version emits ROTLI or the OR(VSHLI/VSRLI) directly to avoid the issue.

2492075a

llvm/cmake/config.guess: add support for riscv32 and riscv64 · de1c2877

Luís Marques authored Feb 14, 2020

Summary: LLVM configuration fails with 'unable to guess system type' on riscv64.
Add support for detecting riscv32 and riscv64 systems.

Patch by Gokturk Yuksek (gokturk)
Reviewers: erichkeane, rengolin, mgorny, aaron.ballman, beanz, luismarques
Reviewed By: luismarques
Tags: #llvm
Differential Revision: https://reviews.llvm.org/D68899

de1c2877

[OpenMP] Lower taskyield using OpenMP IR Builder · 2bef1c0e

Roger Ferrer authored Dec 12, 2019

This is similar to D69828.

Special codegen for enclosing untied tasks is still done in clang.

Differential Revision: https://reviews.llvm.org/D70799

2bef1c0e

[llvm-ar] Simplify Windows comparePaths NFCI · 430fc538

Andrew Ng authored Feb 11, 2020

Replace use of widenPath in comparePaths with UTF8ToUTF16. widenPath
does a lot more than just conversion from UTF-8 to UTF-16. This is not
necessary for CompareStringOrdinal and could possibly even cause
problems.

Differential Revision: https://reviews.llvm.org/D74477

430fc538

[DebugInfo] Error if unsupported address size detected in line table · fe6983a7

James Henderson authored Feb 03, 2020

Prior to this patch, if a DW_LNE_set_address opcode was parsed with an
address size (i.e. with a length after the opcode) of anything other 1,
2, 4, or 8, an llvm_unreachable would be hit, as the data extractor does
not support other values. This patch introduces a new error check that
verifies the address size is one of the supported sizes, in common with
other places within the DWARF parsing.

This patch also fixes calculation of a generated line table's size in
unit tests. One of the tests in this patch highlighted a bug introduced
in 1271cde4, when non-byte operands were used as arguments for
extended or standard opcodes.

Reviewed by: dblaikie

Differential Revision: https://reviews.llvm.org/D73962

fe6983a7

[lldb/DWARF] Don't hold a unique SymbolFileDWARFDwo in a DWARFUnit · 9dc84e9b

Pavel Labath authored Feb 06, 2020

This is the second dwp preparatory patch. When a SymbolFileDWARFDwo will
hold more than one split unit, it will not be able to be uniquely owned
by a single DWARFUnit. I achieve this by changing the
unique_ptr<SymbolFileDWARFDwo> member of DWARFUnit to
shared_ptr<DWARFUnit>. The shared_ptr points to a DWARFUnit, but it is
in fact holding the entire SymbolFileDWARFDwo alive. This is the same
method used by llvm DWARFUnit (except that is uses the DWARFContext
class).

Differential Revision: https://reviews.llvm.org/D73782

9dc84e9b

[OpenMP] Lower taskwait using OpenMP IR Builder · a82f35e1

Roger Ferrer authored Dec 12, 2019

The code generation is exactly the same as it was.

But not that the special handling of untied tasks is still handled by
emitUntiedSwitch in clang.

Differential Revision: https://reviews.llvm.org/D69828

a82f35e1

[doc] Clarify responsibility for fixing experimental target problems · 4e1c49cf

James Henderson authored Feb 13, 2020

Experimental targets are meant to be maintained by the community behind
the target. They are not monitored by the primary build bots. This
change clarifies that it is this communities responsibility for things
like test fixes related to the target caused by changes unrelated to
that target.

See http://lists.llvm.org/pipermail/llvm-dev/2020-February/139115.html
for a full discussion.

Reviewed by: rupprecht, lattner, MaskRay

Differential Revision: https://reviews.llvm.org/D74538

4e1c49cf

Do not build the CUBIN conversion pass when NVPTX Backend isn't configured · 850cb135

Mehdi Amini authored Feb 14, 2020

This pass would currently build, but fail to run when this backend isn't
linked in. On the other hand, we'd like it to initialize only the NVPTX
backend, which isn't possible if we continue to build it without the
backend available. Instead of building a broken configuration, let's
skip building the pass entirely.

Differential Revision: https://reviews.llvm.org/D74592

850cb135

[lldb/dotest] Remove the "exclusive test subdir" concept · 07211d95

Pavel Labath authored Feb 13, 2020

Summary:
This was added in 2018 (r339929), when we were still using the
hand-rolled test runner.

It does not seem to be relevant anymore. In fact as far as I can tell,
it's a big no-op now as the exclusive_test_subdir variable is never set.

Reviewers: vsk, JDevlieghere

Subscribers: lldb-commits

Tags: #lldb

Differential Revision: https://reviews.llvm.org/D74551

07211d95

[mlir] Fix argument attribute attribute reassignment in ConvertStandardToLLVM · 39cb2a8f

Alex Zinenko authored Feb 14, 2020

The commit switching the calling convention for memrefs (5a177805)
inadvertently introduced a bug in the function argument attribute conversion:
due to incorrect indexing of function arguments it was not assigning the
attributes to the arguments beyond those generated from the first original
argument. This was not caught in the commit since the test suite does have a
test for converting multi-argument functions with argument attributes. Fix the
bug and add relevant tests.

39cb2a8f

[lldb] Remove accidentally checked-in debugging code · 189c7013
Pavel Labath authored Feb 14, 2020

189c7013

[VE] Support for PIC (global data and calls) · 60431bd7

Kazushi (Jam) Marukawa authored Feb 14, 2020

Summary: Support for PIC with tests for global variables and function calls.

Reviewed By: arsenm

Differential Revision: https://reviews.llvm.org/D74536

60431bd7

[VecotrCombine] Fix unused variable for assertion disabled builds · 1674f772
Kadir Cetinkaya authored Feb 14, 2020

1674f772
[NFC][ARM] Convert some pointers to references. · fd01b2f4
Sam Parker authored Feb 14, 2020

fd01b2f4