Commits · 009e032634b3bd7fc32071ac2344b12142286477 · Lorenzo Albano / LLVM bpEVL

Nov 07, 2019

Temporarily Revert "[LV] Apply sink-after & interleave-groups as VPlan transformations (NFC)" · 009e0326
Eric Christopher authored Nov 06, 2019
```
as it's causing assert failures.

This reverts commit 100e797a.
```
009e0326
[OPENMP] [DOCS] fix section formatting issues [NFC] · 9f10cc2d
Kelvin Li authored Nov 06, 2019
```
Differential Revision: https://reviews.llvm.org/D69909
```
9f10cc2d

Keep import function list for inlinee profile update · ba1dfae0

Wenlei He authored Nov 01, 2019

Summary:
When adjusting function entry counts after inlining, Funciton::setEntryCount is called without providing an import function list. The side effect of that is the previously set import function list will be dropped. The import function list is used by ThinLTO to help import hot cross module callee for LTO inlining, so dropping that during ThinLTO pre-link may adversely affect LTO inlining. The fix is to keep the list while updating entry counts for inlining.

Reviewers: wmi, davidxl, tejohnson

Subscribers: mehdi_amini, hiraditya, dexonsmith, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D69736

ba1dfae0

[AArch64][SVE] Add remaining patterns and intrinsics for add/sub/mad patterns · e55b536d

Danilo Carvalho Grael authored Nov 06, 2019

Add pattern matching and intrinsics for the following instructions:

predicated orr, eor, and, bic
predicated mul, smulh, umulh, sdiv, udiv, sdivr, udivr
predicated smax, umax, smin, umin, sabd, uabd
mad, msb, mla, mls

https://reviews.llvm.org/D69588

e55b536d

Revert "gn build: (manually) merge b5913e6d" · fe6fee94
Nico Weber authored Nov 06, 2019
```
This reverts commit c52efdc5,
because b5913e6d got reverted.
```
fe6fee94
Revert "Introduce llvm-install-name-tool" · 7d83c298
Alexander Shaposhnikov authored Nov 06, 2019
```
This reverts commit b5913e6d.
```
7d83c298
AMDGPU: Select global atomicrmw fadd · e16a7138
Matt Arsenault authored Aug 29, 2019
```
This only works if there is no use of the return value.
```
e16a7138

TableGen: Remove assert that pattern results match input number · 9f9f42db

Matt Arsenault authored Aug 29, 2019

AMDGPU has some atomic instructions that do not return the previous
result, and can only be selected if there are no uses. The source
pattern will only match if the use is empty, so it should be safe to
discard the result.

9f9f42db

Temporarily Revert: · e511c4b0

Eric Christopher authored Nov 06, 2019

 "[SLP] Generalization of stores vectorization."
 "[SLP] Fix -Wunused-variable. NFC"
 "[SLP] Vectorize jumbled stores."

As they're causing significant (10-30x) compile time regressions on
vectorizable code.

The primary cause of the compile-time regression is f228b537.

This reverts commits:

f228b537
5503455c
21d498c9

e511c4b0

[LLDB] Adding caching to libc++ std::function formatter for lookups that require scanning symbols · e18f4db2

shafik authored Nov 06, 2019

Performance issues lead to the libc++ std::function formatter to be disabled.
This change is the first of two changes that should address the performance issues and allow us to enable the formatter again.
In some cases we end up scanning the symbol table for the callable wrapped by std::function for those cases we will now cache the results and used the cache in subsequent look-ups. This still leaves a large cost for the initial lookup which will be addressed in the next change.

Differential Revision: https://reviews.llvm.org/D67111

e18f4db2

[AMDGPU] Add handling of 160 bit registers in analyzeResourceUsage · d17bcf2b
Stanislav Mekhanoshin authored Nov 06, 2019
```
This was omitted. Also SReg_96Reg missed IsSGPR assignment.

Differential Revision: https://reviews.llvm.org/D69919
```
d17bcf2b

unwind: restore the LINKER_LANGUAGE · e74e61ff

Saleem Abdulrasool authored Nov 06, 2019

Have CMake treat the unwind libraries as C libraries rather than C++.
There is no C++ runtime dependency at runtime.  This ensures that we do
not accidentally end up with a link against the C++ runtime.

We need to explicitly reset the implicitly linked libraries for C++ to
ensure that we do not have CMake force the link against the C++ runtime.
This adjustment should enable the NetBSD bots to be happy with this
change.

e74e61ff

unwind: reflow some of the build rules (NFC) · aa582e36
Saleem Abdulrasool authored Nov 06, 2019
```
Reflow the CMake properties to take less vertical space.  This just
makes it easier to read.  NFC.
```
aa582e36

[LoopPred] Enable new transformation by default · 8748be77

Philip Reames authored Nov 06, 2019

The basic idea of the transform is to convert variant loop exit conditions into invariant exit conditions by changing the iteration on which the exit is taken when we know that the trip count is unobservable.  See the original patch which introduced the code for a more complete explanation.

The individual parts of this have been reviewed, the result has been fuzzed, and then further analyzed by hand, but despite all of that, I will not be suprised to see breakage here.  If you see problems, please don't hesitate to revert - though please do provide a test case.  The most likely class of issues are latent SCEV bugs and without a reduced test case, I'll be essentially stuck on reducing them.

(Note: A bunch of tests were opted out of the new transform to preserve coverage.  That landed in a previous commit to simplify revert cycles if they turn out to be needed.)

8748be77

[LoopPred] Selectively disable to preserve test cases · 20cbb6cd

Philip Reames authored Nov 06, 2019

I'm about to enable the new loop predication transform by default.  It has the effect of completely destroying many read only loops - which happen to be a super common idiom in our test cases.  So as to preserve test coverage of other transforms, disable the new transform where it would cause sharp test coverage regressions.

(This is semantically part of the enabling commit.  It's committed separate to ease revert if the actual flag flip gets reverted.)

20cbb6cd

gn build: (manually) merge b5913e6d · c52efdc5
Nico Weber authored Nov 06, 2019

c52efdc5

When lowering calls and tail calls in AArch64, the register mask and · 8d694a45

Eric Christopher authored Nov 06, 2019

return value location depends on the calling convention of the callee.
`F.getCallingConv()`, however, is the caller CC. Correct it to the
callee CC from `CallLoweringInfo`.

Fixes PR43449

Patch by Shu-Chun Weng!

8d694a45

[lldb] Mark ASan & TSan as test dependencies · 703c97be

Jonas Devlieghere authored Nov 06, 2019

Without asan and tsan as test dependencies, you might end up with a
clang that points to sanitizer runtime library that hasn't been build
yet.

703c97be

[test] Fix apple_simulator_test decorator when simulators are unavailable · a6b5daa7
Alex Langford authored Nov 06, 2019
```
In the case where xcodebuild fails as you set up simulator tests, you
would fail because `feature` is never defined.
```
a6b5daa7
[lldb] Remove dead code from STLUtils.h · cfca0056
Jonas Devlieghere authored Nov 06, 2019

cfca0056

Nov 06, 2019

[docs] Fix references to a renamed flag. · baaa0973

Lang Hames authored Nov 06, 2019

The -use-mcjit option was replaced with -jit-kind=mcjit a while back. This patch
updates the docs to reflect that.

Patch by Yu Jian. Thanks Jian!

baaa0973

[ConstantRange] Add `subWithNoWrap()` method · 7fbe5d4b

Roman Lebedev authored Nov 07, 2019

Summary:
Much like D67339, adds ConstantRange handling for
when we know no-wrap behavior of the `sub`.

Unlike addWithNoWrap(), we only get lucky re returning empty set
for signed wrap. For unsigned, we must perform overflow check manually.

A patch that makes use of this in LVI (CVP) to be posted later.

Reviewers: nikic, shchenz, efriedma

Reviewed By: nikic

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D69918

7fbe5d4b

[ConstantRange] Cleanup addWithNoWrap() by just piggybacking on sadd_sat()/uadd_sat() · 365d729e

Roman Lebedev authored Nov 07, 2019

As discussed in https://reviews.llvm.org/D69918
that happens to work as intended, and returns empty set if
there is always an overflow because we get lucky with intersection.
Since there's now an explicit test for that, let's prefer cleaner code.

365d729e

[ConstantRange] TestAddWithNo*WrapExhaustive: check that all overflow means empty set · b5ddcb9f

Roman Lebedev authored Nov 07, 2019

As disscussed in https://reviews.llvm.org/D69918 / https://reviews.llvm.org/D67339
that is an implied postcondition, but it's not really fully tested.

b5ddcb9f

[JITLink] Refactor EH-frame handling to support eh-frames with existing relocs. · 76aee8a3

Lang Hames authored Nov 04, 2019

Some targets (E.g. MachO/arm64) use relocations to fix some CFI record fields
in the eh-frame section. When relocations are used the initial (pre-relocation)
content of the eh-frame section can no longer be interpreted by following the
eh-frame specification. This causes errors in the existing eh-frame parser.

This patch moves eh-frame handling into two LinkGraph passes that are run after
relocations have been parsed (but before they are applied). The first] pass
breaks up blocks in the eh-frame section into per-CFI-record blocks, and the
second parses blocks of (potentially multiple) CFI records and adds the
appropriate edges to any CFI fields that do not have existing relocations.
These passes can be run independently of one another. By handling eh-frame
splitting/fixing with LinkGraph passes we can both re-use existing relocations
for CFI record fields and avoid applying eh-frame fixups before parsing the
section (which would complicate the linker and require extra temporary
allocations of working memory).

76aee8a3

Testuite: Support Asan test with remote testing · 8243918f

Fred Riss authored Nov 06, 2019

To do so, we need to register the sanitizer libraries with the target
so that they get uploaded before running. This patch adds a helper to
the test class to this effect.

8243918f

[LLDB] Fix handling for the clang name mangling extension for block invocations · 83393d27
shafik authored Nov 06, 2019
```
Add support for clangs  mangling extension for block invocations.

Differential Revision: https://reviews.llvm.org/D69738
```
83393d27
[Orc] Fix iterator usage after remove · 007d173e
Alexandre Ganea authored Nov 06, 2019
```
Differential Revision: https://reviews.llvm.org/D69805
```
007d173e

[JumpThreading] Factor out code to clone instructions (NFC) · f0f73ed8

Kazu Hirata authored Nov 06, 2019

Summary:
This patch factors out code to clone instructions -- partly for
readability and partly to facilitate an upcoming patch of my own.

Reviewers: wmi

Subscribers: hiraditya, jfb, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D69861

f0f73ed8

[WC] Fix a subtle bug in our definition of widenable branch · 686f449e

Philip Reames authored Nov 06, 2019

We had a subtle, but nasty bug in our definition of a widenable branch, and thus in the transforms which used that utility. Specifically, we returned true for any branch which included a widenable condition within it's condition, regardless of whether that widenable condition also had other uses.

The problem is that the result of the WC() call is defined to be one particular value. As such, all users must agree as to what that value is. If we widen a branch without also updating *all other users* of the WC in the same way, we have broken the required semantics.

Most of the textual diff is updating existing transforms not to leave dead uses hanging around. They're largely NFC as the dead instructions would be immediately deleted by other passes. The reason to make these changes is so that the transforms preserve the widenable branch form.

In practice, we don't get bitten by this only because it isn't profitable to CSE WC() calls and the lowering pass from guards uses distinct WC calls per branch.

Differential Revision: https://reviews.llvm.org/D69916

686f449e

[Analysis] Attribute deref/deref_or_null should not prevent tail call optimization · 62ad2128
Dávid Bolvanský authored Nov 06, 2019

62ad2128
[lldb] Record framework build path and use it everywhere · 77a60f0d
Haibo Huang authored Oct 30, 2019
```
This avoids config time dependencies on liblldb. And enables other refactoring.
```
77a60f0d

[LoopPred] Fix two subtle issues found by inspection · 9bfa5ab3

Philip Reames authored Nov 06, 2019

This patch fixes two issues noticed by inspection when going to enable the loop predication code in IndVarSimplify.

Issue 1 - Both the LoopPredication transform, and the already on by default optimizeLoopExits transform, modify the exit count of the exits they modify. (either to 0 or Infinity) Looking at the code more closely, this was not reflected into SCEV and we were instead running later transforms with incorrect SCEVs. Fixing this requires forgetting the loop, weakening a too strong assert, and updating SCEV to not pessimize results when a loop is provable untaken. I haven't been able to find a test case to demonstrate the miscompile.

Issue 2 - For modules without a data layout, we can end up with unsized pointer typed exit counts. Just bail out of this case.

I think these are the last two issues which need addressed before we enable this by default. The code has already survived a decent amount of fuzzing without revealing either of the above.

Differential Revision: https://reviews.llvm.org/D69695

9bfa5ab3

[lit] Protect full test suite from FILECHECK_OPTS · 6cecd3c3

Joel E. Denny authored Jul 25, 2019

lit's test suite calls lit multiple times for various sample test
suites.  `FILECHECK_OPTS` is safe for FileCheck calls in lit's test
suite.  It's not safe for FileCheck calls in the sample test suites,
whose output affects the results of lit's test suite.

Without this patch, only one such sample test suite is protected from
`FILECHECK_OPTS`, and currently `shtest-shell.py` breaks with
`FILECHECK_OPTS=-vv`.  Moreover, it's hard to predict the future,
especially false passes.  Thus, this patch protects all existing and
future sample test suites from `FILECHECK_OPTS` (and the deprecated
`FILECHECK_DUMP_INPUT_ON_FAILURE`).

Reviewed By: probinson

Differential Revision: https://reviews.llvm.org/D65156

6cecd3c3

[X86] Clamp large constant shift amounts for MMX shift intrinsics to 8-bits. · 641d2e52

Craig Topper authored Nov 06, 2019

The MMX intrinsics for shift by immediate take a 32-bit shift
amount but the hardware for shifting by immediate only encodes
8-bits. For the intrinsic we don't require the shift amount to
fit in 8-bits in the frontend because we don't check that its an
immediate in the frontend. If its is not an immediate we move it
to an MMX register and use the shift by register.

But if it is an immediate we'll use the shift by immediate
instruction. But we need to change the shift amount to 8-bits.
We were previously doing this accidentally by masking it in the
encoder. But this can make a large shift amount into a small
in bounds shift amount. Instead we should clamp larger shift
amounts to 255 so that the they don't become in bounds.

Fixes PR43922

641d2e52

[AArch64] Re-add patterns for (s/u)mull2. · 35cf9a1f

Eli Friedman authored Nov 04, 2019

These patterns were added in D46009, but removed in D54276 due to
missing test coverage.

Differential Revision: https://reviews.llvm.org/D69831

35cf9a1f

[clang-format] [NFC] update the documentation in Format.h to allow... · eadb65f2

paulhoad authored Nov 06, 2019

[clang-format] [NFC] update the documentation in Format.h to allow dump_format_style.py to get a little closer to being correct. (part 2)

Summary:
a change {D67541} cause LanguageStandard to now be subtly different from all other clang-format options, in that the Enum value (less the prefix) is not always allowed as valid as the configuration option.

This caused the ClangFormatStyleOptions.rst and the Format.h to diverge so that the ClangFormatStyleOptions.rst could no longer be generated from the Format.h using dump_format_stlye.py

This fix tried to remedy that:

1) by allowing an additional comment (in Format.h) after the enum to be used as the `in configuration ( XXXX )`  text, and changing the dump_format_style.py to support that.

This makes the following code:

```
enum {
...
LS_Cpp03, // c++03
LS_Cpp11, // c++11
...
};
```

would render as:

```* ``LS_Cpp03`` (in configuration: ``c++03``)
* ``LS_Cpp11`` (in configuration: ``c++11``)
```

And we also  move the deprecated alias into the text of the enum (otherwise it won't be added at the end as an option)

This patch includes a couple of other whitespace changes which help bring Format.h and ClangFormatStyleOptions.rst almost back into line and regeneratable...  (there is still one more)

Reviewers: klimek, mitchell-stellar, sammccall

Reviewed By: mitchell-stellar, sammccall

Subscribers: mrexodia, cfe-commits

Tags: #clang, #clang-format

Differential Revision: https://reviews.llvm.org/D69433

eadb65f2

Introduce llvm-install-name-tool · b5913e6d

Alexander Shaposhnikov authored Oct 17, 2019

This diff adds a new "driver" for llvm-objcopy
which is supposed to emulate the behavior of install-name-tool.

Differential revision: https://reviews.llvm.org/D69146

Test plan: make check-all

b5913e6d

Fix a typo in my previous commit · 2293b3f1
Steven Wu authored Nov 06, 2019

2293b3f1

[NFC] Add SUPPORT_PLUGINS to add_llvm_executable() · 6740a88d

David Tenty authored Nov 06, 2019

Summary:
this allows us to move logic about when it is appropriate set
LLVM_NO_DEAD_STRIP out of each tool and into add_llvm_executable,
which will enable future platform specific handling.

This is a follow on to the reverted D69356

Reviewers: hubert.reinterpretcast, beanz, lhames

Reviewed By: beanz

Subscribers: mgorny, cfe-commits, llvm-commits

Tags: #clang, #llvm

Differential Revision: https://reviews.llvm.org/D69638

6740a88d