Commits · cee313d288a4faf0355d76fb6e0e927e211d08a5 · Lorenzo Albano / LLVM bpEVL

Apr 17, 2019

Revert "Temporarily Revert "Add basic loop fusion pass."" · cee313d2
Eric Christopher authored Apr 17, 2019
```
The reversion apparently deleted the test/Transforms directory.

Will be re-reverting again.

llvm-svn: 358552
```
cee313d2

Temporarily Revert "Add basic loop fusion pass." · a8634351

Eric Christopher authored Apr 17, 2019

As it's causing some bot failures (and per request from kbarton).

This reverts commit r358543/ab70da07.

llvm-svn: 358546

a8634351

Add basic loop fusion pass. · ab70da07

Kit Barton authored Apr 17, 2019

This patch adds a basic loop fusion pass. It will fuse loops that conform to the
following 4 conditions:
  1. Adjacent (no code between them)
  2. Control flow equivalent (if one loop executes, the other loop executes)
  3. Identical bounds (both loops iterate the same number of iterations)
  4. No negative distance dependencies between the loop bodies.

The pass does not make any changes to the IR to create opportunities for fusion.
Instead, it checks if the necessary conditions are met and if so it fuses two
loops together.

The pass has not been added to the pass pipeline yet, and thus is not enabled by
default. It can be run stand alone using the -loop-fusion option.

Phabricator: https://reviews.llvm.org/D55851
llvm-svn: 358543

ab70da07

Apr 16, 2019

[EarlyCSE] detect equivalence of selects with inverse conditions and commuted operands (PR41101) · e08783e2

Sanjay Patel authored Apr 16, 2019

This is 1 of the problems discussed in the post-commit thread for:
rL355741 / http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20190311/635516.html
and filed as:
https://bugs.llvm.org/show_bug.cgi?id=41101

Instcombine tries to canonicalize some of these cases (and there's room for improvement
there independently of this patch), but it can't always do that because of extra uses.
So we need to recognize these commuted operand patterns here in EarlyCSE. This is similar
to how we detect commuted compares and commuted min/max/abs.

Differential Revision: https://reviews.llvm.org/D60723

llvm-svn: 358523

e08783e2

[CVP] Simplify umulo and smulo that cannot overflow · 52b24ee9

Nikita Popov authored Apr 16, 2019

If a umul.with.overflow or smul.with.overflow operation cannot
overflow, simplify it to a simple mul nuw / mul nsw. After the
refactoring in D60668 this is just a matter of removing an
explicit check against multiplications.

Differential Revision: https://reviews.llvm.org/D60791

llvm-svn: 358521

52b24ee9

[SLP] Refactoring of the operand reordering code. · 82ffa88a

Simon Pilgrim authored Apr 16, 2019

This is a refactoring patch which should have all the functionality of the current code. Its goal is twofold:
i. Cleanup and simplify the reordering code, and
ii. Generalize reordering so that it will work for an arbitrary number of operands, not just 2.

This is the second patch in a series of patches that will enable operand reordering across chains of operations. An example of this was presented in EuroLLVM'18 https://www.youtube.com/watch?v=gIEn34LvyNo .

Committed on behalf of @vporpo (Vasileios Porpodas)

Differential Revision: https://reviews.llvm.org/D59973

llvm-svn: 358519

82ffa88a

[CVP] Add tests for non-overflowing mulo; NFC · 5a301779
Nikita Popov authored Apr 16, 2019
```
Should be simplified to simple mul.

llvm-svn: 358517
```
5a301779

[InstCombine] Prune fshl/fshr with masked operands · 5ecd6a48

Nikita Popov authored Apr 16, 2019

If a constant shift amount is used, then only some of the LHS/RHS
operand bits are demanded and we may be able to simplify based on
that. InstCombineSimplifyDemanded already had the necessary support
for that, we just weren't calling it with fshl/fshr as root.

In particular, this allows us to relax some masked funnel shifts
into simple shifts, as shown in the tests.

Patch by Shawn Landden.

Differential Revision: https://reviews.llvm.org/D60660

llvm-svn: 358515

5ecd6a48

[InstCombine] Add tests for fshl/fshr with masked operands; NFC · f700081a

Nikita Popov authored Apr 16, 2019

Baseline tests for D60660.

Patch by Shawn Landden.

Differential Revision: https://reviews.llvm.org/D60688

llvm-svn: 358514

f700081a

[Tests] Add branch_weights to latches so that test is not effected by future... · c44b68e2

Philip Reames authored Apr 16, 2019

[Tests] Add branch_weights to latches so that test is not effected by future profitability patch to LoopPredication

llvm-svn: 358506

c44b68e2

Re-commit r357452: SimplifyCFG SinkCommonCodeFromPredecessors: Also sink... · 21eb771d

Hans Wennborg authored Apr 16, 2019

Re-commit r357452: SimplifyCFG SinkCommonCodeFromPredecessors: Also sink function calls without used results (PR41259)

The original commit caused false positives from AddressSanitizer's
use-after-scope checks, which have now been fixed in r358478.

> The code was previously checking that candidates for sinking had exactly
> one use or were a store instruction (which can't have uses). This meant
> we could sink call instructions only if they had a use.
>
> That limitation seemed a bit arbitrary, so this patch changes it to
> "instruction has zero or one use" which seems more natural and removes
> the need to special-case stores.
>
> Differential revision: https://reviews.llvm.org/D59936

llvm-svn: 358483

21eb771d

[LSR] Rewrite misses some fixup locations if it splits critical edge · fda04268

Quentin Colombet authored Apr 15, 2019

If LSR split critical edge during rewriting phi operands and
phi node has other pending fixup operands, we need to
update those pending fixups. Otherwise formulae will not be
implemented completely and some instructions will not be eliminated.

llvm.org/PR41445

Differential Revision: https://reviews.llvm.org/D60645

Patch by: Denis Bakhvalov <denis.bakhvalov@intel.com>

llvm-svn: 358457

fda04268

Apr 15, 2019

[EarlyCSE] add more tests for double-negated select condition; NFC · 800a0c3e
Sanjay Patel authored Apr 15, 2019
```
llvm-svn: 358454
```
800a0c3e
[EarlyCSE] add test for select condition double-negation; NFC · 5ae05d81
Sanjay Patel authored Apr 15, 2019
```
llvm-svn: 358444
```
5ae05d81
[Tests] Add a few more tests for LoopPredication w/invariant loads · af808ee2
Philip Reames authored Apr 15, 2019
```
Making sure to cover an important legality cornercase.

llvm-svn: 358439
```
af808ee2

[DEBUGINFO] Prevent Instcombine from dropping debuginfo when removing zexts · 4fe42214

Wolfgang Pieb authored Apr 15, 2019

Zexts can be treated like no-op casts when it comes to assessing whether their
removal affects debug info.

Reviewer: aprantl

Differential Revision: https://reviews.llvm.org/D60641

llvm-svn: 358431

4fe42214

[PGO] Profile guided code size optimization. · 09e539fc

Hiroshi Yamauchi authored Apr 15, 2019

Summary:
Enable some of the existing size optimizations for cold code under PGO.

A ~5% code size saving in big internal app under PGO.

The way it gets BFI/PSI is discussed in the RFC thread

http://lists.llvm.org/pipermail/llvm-dev/2019-March/130894.html 

Note it doesn't currently touch loop passes.

Reviewers: davidxl, eraman

Reviewed By: eraman

Subscribers: mgorny, javed.absar, smeenai, mehdi_amini, eraman, zzheng, steven_wu, dexonsmith, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D59514

llvm-svn: 358422

09e539fc

[EarlyCSE] add tests for selects with commuted operands (PR41101); NFC · 0e0bb0e2
Sanjay Patel authored Apr 15, 2019
```
llvm-svn: 358420
```
0e0bb0e2

[LoopPred] Hoist and of predicated checks where legal · fbe64a2c

Philip Reames authored Apr 15, 2019

If we have multiple range checks which can be predicated, hoist the and of the results outside the loop.  This minorly cleans up the resulting IR, but the main motivation is as a building block for D60093.

llvm-svn: 358419

fbe64a2c

[EarlyCSE] regenerate test checks; NFC · c7143333
Sanjay Patel authored Apr 15, 2019
```
llvm-svn: 358407
```
c7143333

[InstCombine] canonicalize fdiv after fmul if reassociation is allowed · 5e13cd2e

Sanjay Patel authored Apr 15, 2019

  (X / Y) * Z --> (X * Z) / Y

This can allow other optimizations/reassociations as shown in the test diffs.

llvm-svn: 358404

5e13cd2e

[NewPM] Add Option handling for SimplifyCFG · f5432837

Serguei Katkov authored Apr 15, 2019

This patch enables passing options to SimplifyCFGPass via the passes pipeline.

Reviewers: chandlerc, fedor.sergeev, leonardchan, philip.pfaffe
Reviewed By: fedor.sergeev
Subscribers: llvm-commits
Differential Revision: https://reviews.llvm.org/D60675

llvm-svn: 358379

f5432837

Apr 14, 2019

[Tests] Add tests for D60659, and make adjustments to others to make diff clear · 0eeb2cd4

Philip Reames authored Apr 13, 2019

Three related changes:
1) auto-gen several test files
2) Add the new tests at the bottom of said files
3) Adjust a couple of other test files not to use stores to constants when trying to test constexpr address handling

llvm-svn: 358344

0eeb2cd4

Apr 13, 2019
- [CVP] Add tests for range of with.overflow result; NFC · 040871db
  Nikita Popov authored Apr 13, 2019
```
Test range of with.overflow result in the no-overflow branch.

llvm-svn: 358341
```
  040871db
- [CVP] Fix inverted predicates in test; NFC · 41e284b9
  Nikita Popov authored Apr 13, 2019
```
Checked the wrong direction in the umul tests... fix predicated to
line up with the test name.

llvm-svn: 358331
```
  41e284b9
- [CVP] Add tests for with.overflow used as condition; NFC · 25c1aa15
  Nikita Popov authored Apr 13, 2019
```
llvm-svn: 358330
```
  25c1aa15
- [InstCombine] Canonicalize (-X srem Y) to -(X srem Y). · 87dd0e06
  Chen Zheng authored Apr 13, 2019
```
Differential Revision: https://reviews.llvm.org/D60647

llvm-svn: 358328
```
  87dd0e06
- [InstCombine] [NFC] add testcases for canonicalizing (-X srem Y) to -(X srem Y). · fc59a032
  Chen Zheng authored Apr 13, 2019
```
llvm-svn: 358327
```
  fc59a032
Apr 12, 2019

[InstCombine] Fix a nasty miscompile introduced w/masked.gather demanded elts · b091cc08

Philip Reames authored Apr 12, 2019

This fixes a miscompile which was introduced in r356510 (https://reviews.llvm.org/D57372).

The problem is that the original patch removed pointer operands where the load results we're demanded, but without considering the legality of the load itself. If the masked.gather had active, but undemanded, lanes, then we could end up creating a load which loaded from an undef address. The result could be a segfault, or, in theory, an arbitrary read from a random memory location into an used register.

llvm-svn: 358299

b091cc08

[CVP] Set NSW/NUW flags when simplifying with.overflow · 00a0d5d1

Nikita Popov authored Apr 12, 2019

When CVP determines that a with.overflow intrinsic cannot overflow,
it currently inserts a simple add/sub. As we already determined that
there can be no overflow, we should add the appropriate NUW/NSW flag.

Differential Revision: https://reviews.llvm.org/D60585

llvm-svn: 358298

00a0d5d1

[Tests] Checkin a test demonstrating a miscompile so that patch which fixes it shows a clear diff · 7a60cd38
Philip Reames authored Apr 12, 2019
```
llvm-svn: 358296
```
7a60cd38

[DebugInfo] Fix pr41175 Dead Store Elimination missing debug loc · 32afe6a1

Jeremy Morse authored Apr 12, 2019

Bug: https://bugs.llvm.org/show_bug.cgi?id=41175

In the bug test case the DSE pass is shortening the range of memory that a
memset is working on. A getelementptr is generated so that the new
starting address can be passed to memset. This instruction was not given
a DebugLoc.

To fix the bug, copy the DebugLoc from the memset instruction.

Patch by Orlando Cazalet-Hyams!

Differential Revision: https://reviews.llvm.org/D60556

llvm-svn: 358270

32afe6a1

[ConstantFold] Don't evaluate FP or FP vector casts or truncations when simplifying icmp · d5c40424
Fangrui Song authored Apr 12, 2019
```
Fix PR41476

llvm-svn: 358262
```
d5c40424

Apr 11, 2019

[CVP] Generate full test checks for overflows.ll; NFC · 6ffa1511
Nikita Popov authored Apr 11, 2019
```
llvm-svn: 358229
```
6ffa1511

[PGO] Better handling of profile hash mismatch · 959ef168

Rong Xu authored Apr 11, 2019

We currently assume profile hash conflicts will be caught by an upfront
check and we assert for the cases that escape the check. The assumption
is not always true as there are chances of conflict. This patch prints
a warning and skips annotating the function for the escaped cases,.

Differential Revision: https://reviews.llvm.org/D60154

llvm-svn: 358225

959ef168

[ConstantFold] ExtractConstantBytes - handle shifts on large integer types · 8d083c5e

Simon Pilgrim authored Apr 11, 2019

Use APInt instead of getZExtValue from the ConstantInt until we can confirm that the shift amount is in range.

Reduced from OSS-Fuzz #14169 - https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=14169

llvm-svn: 358192

8d083c5e

Fix a hang when lowering __builtin_dynamic_object_size · cb5c7bd9

Erik Pilkington authored Apr 10, 2019

If the ObjectSizeOffsetEvaluator fails to fold the object size call, then it may
litter some unused instructions in the function. When done repeatably in
InstCombine, this results in an infinite loop. Fix this by tracking the set of
instructions that were inserted, then removing them on failure.

rdar://49172227

Differential revision: https://reviews.llvm.org/D60298

llvm-svn: 358146

cb5c7bd9

Apr 10, 2019

[InstCombine] Handle ssubo always overflow · 0a8228fd

Nikita Popov authored Apr 10, 2019

Following D60483 and D60497, this adds support for AlwaysOverflows
handling for ssubo. This is the last case we can handle right now.

Differential Revision: https://reviews.llvm.org/D60518

llvm-svn: 358100

0a8228fd

[InstCombine] ssubo X, C -> saddo X, -C · 7a543c37

Nikita Popov authored Apr 10, 2019

ssubo X, C is equivalent to saddo X, -C. Make the transformation in
InstCombine and allow the logic implemented for saddo to fold prior
usages of add nsw or sub nsw with constants.

Patch by Dan Robertson.

Differential Revision: https://reviews.llvm.org/D60061

llvm-svn: 358099

7a543c37

[InstCombine] Handle saddo always overflow · ef23e884

Nikita Popov authored Apr 10, 2019

Followup to D60483: Handle AlwaysOverflow conditions for saddo as
well.

Differential Revision: https://reviews.llvm.org/D60497

llvm-svn: 358095

ef23e884