Commits · 2c03e53f4e241bee2d1b3103931be556a346cc82 · Lorenzo Albano / LLVM bpEVL

Aug 28, 2017

[X86] Use getUnpackl helper to create an ISD::VECTOR_SHUFFLE instead of using... · 33681161

Craig Topper authored Aug 28, 2017

[X86] Use getUnpackl helper to create an ISD::VECTOR_SHUFFLE instead of using X86ISD::UNPCKL in reduceVMULWidth.

This runs fairly early, we should use target independent nodes if possible.

llvm-svn: 311873

33681161

[X86] Add an early out to combineLoopMAddPattern and combineLoopSADPattern when SSE2 is disabled. · 2c77011d
Craig Topper authored Aug 28, 2017
```
Without this the madd.ll and sad.ll test cases both throw assertions if you run them with SSE2 disabled.

llvm-svn: 311872
```
2c77011d

[Error] Add a handleExpected utility. · 5d06c23d

Lang Hames authored Aug 28, 2017

handleExpected is similar to handleErrors, but takes an Expected<T> as its first
input value and a fallback functor as its second, followed by an arbitary list
of error handlers (equivalent to the handler list of handleErrors). If the first
input value is a success value then it is returned from handleErrors
unmodified. Otherwise the contained error(s) are passed to handleErrors, along
with the handlers. If handleErrors returns success (indicating that all errors
have been handled) then handleExpected runs the fallback functor and returns its
result. If handleErrors returns a failure value then the failure value is
returned and the fallback functor is never run.

This simplifies the process of re-trying operations that return Expected values.
Without this utility such retry logic is cumbersome as the internal Error must
be explicitly extracted from the Expected value, inspected to see if its
handleable and then consumed:

enum FooStrategy { Aggressive, Conservative };
Expected<Foo> tryFoo(FooStrategy S);

Expected<Foo> Result;
(void)!!Result; // "Check" Result so that it can be safely overwritten.
if (auto ValOrErr = tryFoo(Aggressive))
  Result = std::move(ValOrErr);
else {
  auto Err = ValOrErr.takeError();
  if (Err.isA<HandleableError>()) {
    consumeError(std::move(Err));
    Result = tryFoo(Conservative);
  } else
    return std::move(Err);
}

with handleExpected, this can be re-written as:

auto Result =
  handleExpected(
    tryFoo(Aggressive),
    []() { return tryFoo(Conservative); },
    [](HandleableError&) { /* discard to handle */ });

llvm-svn: 311870

5d06c23d

revert r310985 which breaks for the following case: · 191b24d3

Dehao Chen authored Aug 27, 2017

struct string {
  ~string();
};
void f2();
void f1(int) { f2(); }
void run(int c) {
  string body;
  while (true) {
    if (c)
      f1(c);
    else
      f1(c);
  }
}

Will recommit once the issue is fixed.

llvm-svn: 311864

191b24d3

Aug 27, 2017

[mips] Generate NMADD and NMSUB instructions when fneg node is present · f11daad1

Petar Jovanovic authored Aug 27, 2017

This patch enables generation of NMADD and NMSUB instructions when fneg node
is present. These instructions are currently only generated if fsub node is
present.

Patch by Stanislav Ocovaj.

Differential Revision: https://reviews.llvm.org/D34507

llvm-svn: 311862

f11daad1

[ARM] Tidy-up condition-code support functions · b81fa993

Javed Absar authored Aug 27, 2017

Move condition code support functions to Utils and remove code duplication.

Reviewed by: @fhahn, @asb
Differential Revision: https://reviews.llvm.org/D37179

llvm-svn: 311860

b81fa993

[AVX512] Add more patterns for using masked moves for subvector extracts of... · 80075a5f

Craig Topper authored Aug 27, 2017

[AVX512] Add more patterns for using masked moves for subvector extracts of the lowest subvector. This time with bitcasts between the vselect and the extract.

llvm-svn: 311856

80075a5f

[DAGCombiner] allow undef shuffle operands when eliminating bitcasts (PR34111) · a7a61d97

Sanjay Patel authored Aug 27, 2017

As noted in the FIXME, this could be improved more, but this is the smallest fix
that helps:
https://bugs.llvm.org/show_bug.cgi?id=34111

llvm-svn: 311853

a7a61d97

[x86] add haddps test for PR34111; NFC · 4e4ba615
Sanjay Patel authored Aug 27, 2017
```
llvm-svn: 311852
```
4e4ba615

[ARM] Tidy-up ARMAsmParser. NFC. · 17ee7c09

Javed Absar authored Aug 27, 2017

Simplify getDRegFromQReg function

Reviewed by: @fhahn, @asb
Differential Revision: https://reviews.llvm.org/D37118

llvm-svn: 311850

17ee7c09

[LV] Fix PR34248 - recommit D32871 after revert r311304 · 1f58dda4

Ayal Zaks authored Aug 27, 2017

Original commit r311077 of D32871 was reverted in r311304 due to failures
reported in PR34248.

This recommit fixes PR34248 by restricting the packing of predicated scalars
into vectors only when vectorizing, avoiding doing so when unrolling w/o
vectorizing. Added a test derived from the reproducer of PR34248.

llvm-svn: 311849

1f58dda4

[X86] Adding more tests for horizontal [F]HADD/[F]SUB for AVX512 vectors types · 23eaf52d
Jatin Bhateja authored Aug 27, 2017
```
llvm-svn: 311847
```
23eaf52d
[X86] Add a target-specific DAG combine to combine extract_subvector from all... · 36bd247f
Craig Topper authored Aug 27, 2017
```
[X86] Add a target-specific DAG combine to combine extract_subvector from all zero/one build_vectors.

llvm-svn: 311841
```
36bd247f
[X86] Use getOnesVector instead of using DAG.getConstant(-1). · 71dab64a
Craig Topper authored Aug 27, 2017
```
llvm-svn: 311840
```
71dab64a
[NewGVN] Use `auto` when the type is obvious NFCI. · 9bdccb37
Davide Italiano authored Aug 26, 2017
```
llvm-svn: 311838
```
9bdccb37

[AVX512] Add patterns to match masked extract_subvector with bitcasts between... · a088362e

Craig Topper authored Aug 26, 2017

[AVX512] Add patterns to match masked extract_subvector with bitcasts between the vselect and the extract_subvector. Remove the late DAG combine.

We used to do a late DAG combine to move the bitcasts out of the way, but I'm starting to think that it's better to canonicalize extract_subvector's type to match the type of its input. I've seen some cases where we've formed two different extract_subvector from the same node where one had a bitcast and the other didn't.

Add some more test cases to ensure we've also got most of the zero masking covered too.

llvm-svn: 311837

a088362e

Aug 26, 2017

[Dominators] Remove redundant explicit template instantiation. · a67e1312

Don Hinton authored Aug 26, 2017

Summary:
Remove redundant explicit template instantiation.

This was reported by Andrew Kelley building release_50 with gcc7.2.0 on MacOS: duplicate symbol llvm::DominatorTreeBase.

Reviewers: kuhar, andrewrk, davide, hans

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D37185

llvm-svn: 311835

a67e1312

[X86] Adding a test for horizontal [f]add/[f]sub for avx512 vector type 16x32. · c2f41b9f
Jatin Bhateja authored Aug 26, 2017
```
Differential Revision: https://reviews.llvm.org/D37183

llvm-svn: 311834
```
c2f41b9f

[DAGCombiner] Extending pattern detection for vector shuffle. · e4ca95d6

Jatin Bhateja authored Aug 26, 2017

Summary:
If all the operands of a BUILD_VECTOR extract elements from same vector then split the
vector efficiently based on the maximum vector access index.

This will also fix PR 33784

Reviewers: zvi, delena, RKSimon, thakis

Reviewed By: RKSimon

Subscribers: chandlerc, eladcohen, llvm-commits

Differential Revision: https://reviews.llvm.org/D35788

llvm-svn: 311833

e4ca95d6

Revert rL311247 : To rectify commit message. · b60cfbef

Jatin Bhateja authored Aug 26, 2017

Summary: This reverts commit rL311247.

Differential Revision: https://reviews.llvm.org/D36927

llvm-svn: 311832

b60cfbef

NewGVN: Fix PR33204 - We need to add memory users when we bypass memorydefs... · de269f46

Daniel Berlin authored Aug 26, 2017

NewGVN: Fix PR33204 - We need to add memory users when we bypass memorydefs for loads, not just when we do it for stores.

llvm-svn: 311829

de269f46

[X86] Qualify the RMW INC/DEC patterns with NotSlowIncDec. · 69ec2018
Craig Topper authored Aug 26, 2017
```
We were suppressing most uses of INC/DEC, but this one seems to have been missed.

llvm-svn: 311828
```
69ec2018
Revert "[llvm] Add symbol table support to llvm-objcopy" · 08089e52
Petr Hosek authored Aug 26, 2017
```
This reverts commit r311826 because it's failing on llvm-i686-linux-RA.

llvm-svn: 311827
```
08089e52

[llvm] Add symbol table support to llvm-objcopy · 70535d2c

Petr Hosek authored Aug 26, 2017

This change adds support for SHT_SYMTAB sections.

Patch by Jake Ehrlich

Differential Revision: https://reviews.llvm.org/D34167

llvm-svn: 311826

70535d2c

[llvm-objcopy] New layout algorithm that lays out segments first · 3f38383a

Petr Hosek authored Aug 26, 2017

The current file layout algorithm in llvm-objcopy is simple but
difficult to reason about. It also makes it very complicated to support
nested segments and to support segments that have offsets that come
before a point after the program headers. To support these cases and
simplify one of the most critical parts llvm-objcopy I rewrote the
layout algorithm. Laying out segments first solves most of the issues
encountered by the previous algorithm.

Patch by Jake Ehrlich

Differential Revision: https://reviews.llvm.org/D36494

llvm-svn: 311825

3f38383a

Add options to dump block frequency/branch probability info in text. · 63e17ebf

Hiroshi Yamauchi authored Aug 26, 2017

Summary:
Add options -print-bfi/-print-bpi that dump block frequency and branch
probability info like -view-block-freq-propagation-dags and
-view-machine-block-freq-propagation-dags do but in text.

This is useful when the graph is very large and complex (the dot command
crashes, lines/edges too close to tell apart, hard to navigate without textual
search) or simply when text is preferred.

Reviewers: davidxl

Reviewed By: davidxl

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D37165

llvm-svn: 311822

63e17ebf

[AVX512] Add patterns to use masked moves to implement masked... · d27386a9

Craig Topper authored Aug 25, 2017

[AVX512] Add patterns to use masked moves to implement masked extract_subvector of the lowest subvector.

This only supports 32 and 64 bit element sizes for now. But we could probably do 16 and 8-bit elements with BWI.

llvm-svn: 311821

d27386a9

[AVX512] Add additional test cases for masked extract subvector. · b89dbf02
Craig Topper authored Aug 25, 2017
```
This includes tests for extracting 128-bits from a 256-bit vector and zero masking.

llvm-svn: 311820
```
b89dbf02

[X86] Add patterns to show more failures to use TBM instructions when we're trying to check flags. · e81de105

Craig Topper authored Aug 25, 2017

We can probably add patterns to fix some of them. But the ones that use 'and' as their root node emit a X86ISD::CMP node in front of the 'and' and then pattern matching that to 'test' instruction. We can't use a tablegen pattern to fix that because we can't remap the cmp result to the flag output of a TBM instruction.

llvm-svn: 311819

e81de105

[x86] Teach the backend to fold more read-modify-write memory operands · 4b611a89

Chandler Carruth authored Aug 25, 2017

to instructions.

These can't be reasonably matched in tablegen due to the handling of
flags, so we have to do this in C++ code. We only did it for `inc` and
`dec` historically, this starts fleshing that out to more interesting
instructions. Notably, this handles transfering operands to `add` and
`sub`.

Currently this forces them into a register. The next patch will add
support for keeping immediate operands as immediates. Then I'll extend
this beyond just `add` and `sub`.

I'm not super thrilled by the repeated switches in the code but
everything else I tried was really ugly or problematic.

Many thanks to Craig Topper for the suggestions about where to even
begin here and how to make this stuff work.

Differential Revision: https://reviews.llvm.org/D37130

llvm-svn: 311806

4b611a89

[Verifier] Diagnose invalid DIType references instead of crashing. · 26053818
Davide Italiano authored Aug 25, 2017
```
Fixes PR34325.

llvm-svn: 311805
```
26053818

[Inliner] Only compute fully inline cost when remarks are enabled. · a872519d

Davide Italiano authored Aug 25, 2017

Prior to this change (and after r311371), we computed it
unconditionally, causin gsevere compile time regressions (in some
cases, 5 to 10x).

llvm-svn: 311804

a872519d

Revert "[SanitizeCoverage] Enable stack-depth coverage for -fsanitize=fuzzer" · 6ec7595b
Matt Morehouse authored Aug 25, 2017
```
This reverts r311801 due to a bot failure.

llvm-svn: 311803
```
6ec7595b

Aug 25, 2017

[SanitizeCoverage] Enable stack-depth coverage for -fsanitize=fuzzer · f42bd313

Matt Morehouse authored Aug 25, 2017

Summary:
- Don't sanitize __sancov_lowest_stack.
- Don't instrument leaf functions.
- Add CoverageStackDepth to Fuzzer and FuzzerNoLink.

Reviewers: vitalybuka, kcc

Reviewed By: kcc

Subscribers: cfe-commits, llvm-commits, hiraditya

Differential Revision: https://reviews.llvm.org/D37156

llvm-svn: 311801

f42bd313

[utils] add aarch64 target as an option · b2f62a9e

Sanjay Patel authored Aug 25, 2017

I don't know enough to add a custom scrubber for AArch64, so I just re-used ARM.

llvm-svn: 311795

b2f62a9e

[sanitizer-coverage] extend fsanitize-coverage=pc-table with flags for every PC · d3e4b7e2
Kostya Serebryany authored Aug 25, 2017
```
llvm-svn: 311794
```
d3e4b7e2
[x86] regenerate checks; NFC · 50a446ef
Sanjay Patel authored Aug 25, 2017
```
llvm-svn: 311793
```
50a446ef

[InlineCost] Small changes to early exit condition. NFC. · 61995364

Haicheng Wu authored Aug 25, 2017

Change the early exit condition from Cost > Threshold to Cost >= Threshold
because the inline condition is Cost < Threshold.

Differential Revision: https://reviews.llvm.org/D37087

llvm-svn: 311791

61995364

[InstCombine] Don't fall back to only calling computeKnownBits if the upper... · 35171e5d

Craig Topper authored Aug 25, 2017

[InstCombine] Don't fall back to only calling computeKnownBits if the upper bit of Add/Sub is demanded.

Just create an all 1s demanded mask and continue recursing like normal. The recursive calls should be able to handle an all 1s mask and do the right thing.

The only time we should care about knowing whether the upper bit was demanded is when we need to know if we should clear the NSW/NUW flags.

Now that we have a consistent path through the code for all cases, use KnownBits::computeForAddSub to compute the known bits at the end since we already have the LHS and RHS.

My larger goal here is to move the code that turns add into xor if only 1 bit is demanded and no bits below it are non-zero from InstCombiner::OptAndOp to here. This will allow it to be more general instead of just looking for 'add' and 'and' with constant RHS.

Differential Revision: https://reviews.llvm.org/D36486

llvm-svn: 311789

35171e5d

[InstCombine] Add tests to show missed opportunities to combine bit tests... · c5e818e3

Craig Topper authored Aug 25, 2017

[InstCombine] Add tests to show missed opportunities to combine bit tests hidden by a sign compare and a truncate. NFC

llvm-svn: 311784

c5e818e3