Commits · 197e47f5c8f3ac74df0460e56f91355e2d9006f4 · Lorenzo Albano / LLVM bpEVL

Feb 01, 2018

[LSR] Don't force bases of foldable formulae to the final type. · 6d06976e

Mikael Holmen authored Feb 01, 2018

Summary:
Before emitting code for scaled registers, we prevent
SCEVExpander from hoisting any scaled addressing mode
by emitting all the bases first. However, these bases
are being forced to the final type, resulting in some
odd code.

For example, if the type of the base is an integer and
the final type is a pointer, we will emit an inttoptr
for the base, a ptrtoint for the scale, and then a
'reverse' GEP where the GEP pointer is actually the base
integer and the index is the pointer. It's more intuitive
to use the pointer as a pointer and the integer as index.

Patch by: Bevin Hansson

Reviewers: atrick, qcolombet, sanjoy

Reviewed By: qcolombet

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D42103

llvm-svn: 323946

6d06976e

Jan 31, 2018

[AggressiveInstCombine] Fixed TruncCombine class to handle TruncInst leaf node correctly. · b86b771c

Amjad Aboud authored Jan 31, 2018

This covers the case where TruncInst leaf node is a constant expression.
See PR36121 for more details.

Differential Revision: https://reviews.llvm.org/D42622

llvm-svn: 323926

b86b771c

Followup on Proposal to move MIR physical register namespace to '$' sigil. · 43e94b15

Puyan Lotfi authored Jan 31, 2018

Discussed here:

http://lists.llvm.org/pipermail/llvm-dev/2018-January/120320.html

In preparation for adding support for named vregs we are changing the sigil for
physical registers in MIR to '$' from '%'. This will prevent name clashes of
named physical register with named vregs.

llvm-svn: 323922

43e94b15

[SeparateConstOffsetFromGEP] Fix up addrspace in the AMDGPU test · 8f2df9d2
Marek Olsak authored Jan 31, 2018
```
llvm-svn: 323913
```
8f2df9d2

AMDGPU: Add intrinsics llvm.amdgcn.cvt.{pknorm.i16, pknorm.u16, pk.i16, pk.u16} · 13e47412

Marek Olsak authored Jan 31, 2018

Reviewers: arsenm, nhaehnle

Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye

Differential Revision: https://reviews.llvm.org/D41663

llvm-svn: 323908

13e47412

[SeparateConstOffsetFromGEP] Preserve metadata when splitting GEPs · 8e7d149a

Marek Olsak authored Jan 31, 2018

Summary:
!amdgpu.uniform needs to be preserved for AMDGPU, otherwise bad things
happen.

Reviewers: arsenm, nhaehnle, jingyue, broune, majnemer, bjarke.roune, dblaikie

Subscribers: wdng, tpr, llvm-commits

Differential Revision: https://reviews.llvm.org/D42744

llvm-svn: 323907

8e7d149a

[CodeGenPrepare] Improve source and dest alignments of memory intrinsics independently · be58a220

Daniel Neilson authored Jan 31, 2018

Summary:
  This change is part of step five in the series of changes to remove alignment argument from
memcpy/memmove/memset in favour of alignment attributes. In particular, this changes the
CodeGenPrepare pass to be more aggressive in improving the source and destination alignments
of memcpy/memmove/memset by exploiting our new ability to record independent alignments
for each argument.

Steps:
Step 1) Remove alignment parameter and create alignment parameter attributes for
memcpy/memmove/memset. ( rL322965, rC322964, rL322963 )
Step 2) Expand the IRBuilder API to allow creation of memcpy/memmove with differing
source and dest alignments. ( rL323597 )
Step 3) Update Clang to use the new IRBuilder API. ( rC323617 )
Step 4) Update Polly to use the new IRBuilder API. ( rL323618 )
Step 5) Update LLVM passes that create memcpy/memmove calls to use the new IRBuilder API,
and those that use use MemIntrinsicInst::[get|set]Alignment() to use [get|set]DestAlignment()
and [get|set]SourceAlignment() instead. ( rL323886 )
Step 6) Remove the single-alignment IRBuilder API for memcpy/memmove, and the
MemIntrinsicInst::[get|set]Alignment() methods.

Reference
   http://lists.llvm.org/pipermail/llvm-dev/2015-August/089384.html
   http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html

llvm-svn: 323891

be58a220

[InstCombine] move related tests into the same file; NFC · fd58ade8
Sanjay Patel authored Jan 31, 2018
```
llvm-svn: 323882
```
fd58ade8
[InstCombine] add tests to show limit of canEvaluate* ; NFC · 8c74a9a1
Sanjay Patel authored Jan 31, 2018
```
llvm-svn: 323881
```
8c74a9a1

[AggressiveInstCombine] Make TruncCombine class ignore unreachable basic blocks. · d895bff5

Amjad Aboud authored Jan 31, 2018

Because dead code may contain non-standard IR that causes infinite looping or crashes in underlying analysis.
See PR36134 for more details.

Differential Revision: https://reviews.llvm.org/D42683

llvm-svn: 323862

d895bff5

Jan 30, 2018

[SLP] Add extra test for extractelement shuffle, NFC. · 1c8f53f4
Alexey Bataev authored Jan 30, 2018
```
llvm-svn: 323815
```
1c8f53f4
[LoopStrengthReduce] add test to show potential macro-fusion-based diff (PR35681); NFC · ffb37a29
Sanjay Patel authored Jan 30, 2018
```
This is the baseline output for the test proposed with D42607.

llvm-svn: 323806
```
ffb37a29

[X86][XOP] Update isVectorShiftByScalarCheap with cases covered by XOP · 073f089c

Simon Pilgrim authored Jan 30, 2018

Similar to D42437, XOP supports variable shift for v16i8/v8i16/v4i32/v2i64 types.

Differential Revision: https://reviews.llvm.org/D42526

llvm-svn: 323797

073f089c

[DeadArgumentElimination] Preserve llvm.dbg.values's first argument · 9208e8fb

Petar Jovanovic authored Jan 30, 2018

When removing return value Dead Argument Elimination pass clobbers first
llvm.dbg.value’s argument for live arguments of that function by replacing
it with nullptr. In the next pass it will be deleted, so debug location
about those arguments are lost. This change fixes it.

Patch by Djordje Todorovic.

Differential Revision: https://reviews.llvm.org/D42541

llvm-svn: 323784

9208e8fb

Re-commit : [PowerPC] Add handling for ColdCC calling convention and a pass to mark · 1f59ae31

Zaara Syeda authored Jan 30, 2018

candidates with coldcc attribute.

This recommits r322721 reverted due to sanitizer memory leak build bot failures.

Original commit message:
This patch adds support for the coldcc calling convention for Power.
This changes the set of non-volatile registers. It includes a pass to stress
test the implementation by marking all static directly called functions with
the coldcc attribute through the option -enable-coldcc-stress-test. It also
includes an option, -ppc-enable-coldcc, to add the coldcc attribute to
functions which are cold at all call sites based on BlockFrequencyInfo when
the containing function does not call any non cold functions.

Differential Revision: https://reviews.llvm.org/D38413

llvm-svn: 323778

1f59ae31

[RS4GC] Handle call/invoke instructions as base defining values of vectors · 594f443b

Daniel Neilson authored Jan 30, 2018

Summary:
 There's an asymmetry in the definitions of findBaseDefiningValueOfVector() and
findBaseDefiningValue() of RS4GC. The later handles call and invoke instructions,
and the former does not. This appears to be simple oversight. This patch remedies
the oversight by adding the call and invoke cases to findBaseDefiningValueOfVector().

Reviewers: DaniilSuchkov, anna

Reviewed By: anna

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D42653

llvm-svn: 323764

594f443b

[DSE] make sure memory is not modified before partial store merging (PR36129) · 1aef27f5

Sanjay Patel authored Jan 30, 2018

We missed a critical check in D30703. We must make sure that no intermediate 
store is sitting between the stores that we want to merge.

This should fix:
https://bugs.llvm.org/show_bug.cgi?id=36129

Differential Revision: https://reviews.llvm.org/D42663

llvm-svn: 323759

1aef27f5

[InstSimplify] (X * Y) / Y --> X for relaxed floating-point ops · 83f05660

Sanjay Patel authored Jan 30, 2018

This is the FP counterpart that was mentioned in PR35709:
https://bugs.llvm.org/show_bug.cgi?id=35709

Differential Revision: https://reviews.llvm.org/D42385

llvm-svn: 323716

83f05660

Jan 29, 2018

[DSE] add test for PR36129; NFC · d023a9b7

Sanjay Patel authored Jan 29, 2018

We can miscompile because we're not checking is the memory might
me modified between the seemingly redundant store ops.

llvm-svn: 323704

d023a9b7

[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle. · 9c5c1032

Alexey Bataev authored Jan 29, 2018

Summary:
If the same value is going to be vectorized several times in the same
tree entry, this entry is considered to be a gather entry and cost of
this gather is counter as cost of InsertElementInstrs for each gathered
value. But we can consider these elements as ShuffleInstr with
SK_PermuteSingle shuffle kind.

Reviewers: spatel, RKSimon, mkuper, hfinkel

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D38697

llvm-svn: 323662

9c5c1032

[SLP] Add a test with extract for PR32086, NFC. · 10f5c9e7
Alexey Bataev authored Jan 29, 2018
```
llvm-svn: 323661
```
10f5c9e7

[CVP] Don't Replace incoming values from unreachable blocks with undef. · 8b797a0f

Davide Italiano authored Jan 29, 2018

This pretty much reverts r322006, except that we keep the test,
because we work around the issue exposed in a different way (a
recursion limit in value tracking). There's still probably some
sequence that exposes this problem, and the proper way to fix that
for somebody who has time is outlined in the code review.

llvm-svn: 323630

8b797a0f

[NFC] fix trivial typos in comments and documents · c8e92458
Hiroshi Inoue authored Jan 29, 2018
```
"to to" -> "to"

llvm-svn: 323628
```
c8e92458

Jan 28, 2018

[InlineCost] Mark functions accessing varargs as not viable. · 1636651e

Florian Hahn authored Jan 28, 2018

This prevents functions accessing varargs from being inlined if they
have the alwaysinline attribute.

Reviewers: efriedma, rnk, davide

Reviewed By: efriedma

Differential Revision: https://reviews.llvm.org/D42556

llvm-svn: 323619

1636651e

Jan 27, 2018
- Revert "[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle." · f86be121
  Alexey Bataev authored Jan 27, 2018
```
This reverts commit r323530 to fix possible problems in users code.

llvm-svn: 323581
```
  f86be121
Jan 26, 2018

[x86] auto-generate complete checks; NFC · 5bce08dd
Sanjay Patel authored Jan 26, 2018
```
llvm-svn: 323571
```
5bce08dd

[InstCombine] Preserve debug values for eliminable casts · e48597a5

Vedant Kumar authored Jan 26, 2018

A cast from A to B is eliminable if its result is casted to C, and if
the pair of casts could just be expressed as a single cast. E.g here,
%c1 is eliminable:

  %c1 = zext i16 %A to i32
  %c2 = sext i32 %c1 to i64

InstCombine optimizes away eliminable casts. This patch teaches it to
insert a dbg.value intrinsic pointing to the final result, so that local
variables pointing to the eliminable result are preserved.

Differential Revision: https://reviews.llvm.org/D42566

llvm-svn: 323570

e48597a5

[SLP] Test for trunc vectorization, NFC. · 7ad4e31c
Alexey Bataev authored Jan 26, 2018
```
llvm-svn: 323556
```
7ad4e31c

[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle. · 167003df

Alexey Bataev authored Jan 26, 2018

Summary:
If the same value is going to be vectorized several times in the same
tree entry, this entry is considered to be a gather entry and cost of
this gather is counter as cost of InsertElementInstrs for each gathered
value. But we can consider these elements as ShuffleInstr with
SK_PermuteSingle shuffle kind.

Reviewers: spatel, RKSimon, mkuper, hfinkel

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D38697

llvm-svn: 323530

167003df

[CallSiteSplitting] Fix infinite loop when recording conditions. · 212afb9f

Florian Hahn authored Jan 26, 2018

Fix infinite loop when recording conditions by correctly marking basic
blocks as visited.

Fixes https://bugs.llvm.org/show_bug.cgi?id=36105

llvm-svn: 323515

212afb9f

[Debug] LCSSA: Insert dbg.value at the first available insertion point · 6394df9f

Vedant Kumar authored Jan 25, 2018

Inserting a dbg.value instruction at the start of a basic block with a
landingpad instruction triggers a verifier failure. We should be OK if
we insert the instruction a bit later.

Speculative fix for the bot failure described here:
https://reviews.llvm.org/D42551

llvm-svn: 323482

6394df9f

Jan 25, 2018

[Debug] Add dbg.value intrinsics for PHIs created during LCSSA. · 60f54084

Vedant Kumar authored Jan 25, 2018

This patch is an enhancement to propagate dbg.value information when
Phis are created on behalf of LCSSA.  I noticed a case where a value
carried across a loop was reported as <optimized out>.

Specifically this case:

  int bar(int x, int y) {
    return x + y;
  }

  int foo(int size) {
    int val = 0;
    for (int i = 0; i < size; ++i) {
      val = bar(val, i);  // Both val and i are correct
    }
    return val; // <optimized out>
  }

In the above case, after all of the interesting computation completes
our value is reported as "optimized out." This change will add a
dbg.value to correct this.

This patch also moves the dbg.value insertion routine from
LoopRotation.cpp into Local.cpp, so that we can share it in both places
(LoopRotation and LCSSA).

Patch by Matt Davis!

Differential Revision: https://reviews.llvm.org/D42551

llvm-svn: 323472

60f54084

Revert "[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle." · 102d4b59
Alexey Bataev authored Jan 25, 2018
```
This reverts commit r323441 to fix buildbots.

llvm-svn: 323447
```
102d4b59

[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle. · c8cfa14b

Alexey Bataev authored Jan 25, 2018

Summary:
If the same value is going to be vectorized several times in the same
tree entry, this entry is considered to be a gather entry and cost of
this gather is counter as cost of InsertElementInstrs for each gathered
value. But we can consider these elements as ShuffleInstr with
SK_PermuteSingle shuffle kind.

Reviewers: spatel, RKSimon, mkuper, hfinkel

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D38697

llvm-svn: 323441

c8cfa14b

[InstCombine] narrow masked zexted binops (PR35792) · 1d68112c

Sanjay Patel authored Jan 25, 2018

This is guarded by shouldChangeType(), so the tests show that
we don't do the fold if the narrower type is not legal. Note
that there is a proposal (D42424) that would change the results
for the specific cases shown in these tests. That difference is
also discussed in PR35792:
https://bugs.llvm.org/show_bug.cgi?id=35792

Alive proofs for the cases handled here as well as the bitwise 
logic binops that we should already do better on:
https://rise4fun.com/Alive/c97
https://rise4fun.com/Alive/Lc5E
https://rise4fun.com/Alive/kdf

llvm-svn: 323437

1d68112c

[InstCombine] add tests for PR35792; NFC · 0f95dd23
Sanjay Patel authored Jan 25, 2018
```
llvm-svn: 323436
```
0f95dd23
Revert "[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle." · a0b2c78e
Alexey Bataev authored Jan 25, 2018
```
This reverts commit r323430 to fix buildbots.

llvm-svn: 323432
```
a0b2c78e

[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle. · ad51fe36

Alexey Bataev authored Jan 25, 2018

Summary:
If the same value is going to be vectorized several times in the same
tree entry, this entry is considered to be a gather entry and cost of
this gather is counter as cost of InsertElementInstrs for each gathered
value. But we can consider these elements as ShuffleInstr with
SK_PermuteSingle shuffle kind.

Reviewers: spatel, RKSimon, mkuper, hfinkel

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D38697

llvm-svn: 323430

ad51fe36

Another try to commit 323321 (aggressive instruction combine). · f1f57a31
Amjad Aboud authored Jan 25, 2018
```
llvm-svn: 323416
```
f1f57a31

Jan 24, 2018

[InstCombine] fix datalayout in test file · 60c13c77

Sanjay Patel authored Jan 24, 2018

The only part of the datalayout that should matter for these tests
is the part that specifies the legal int widths ('n*'). But there
was a bug - that part of the string was not correctly separated with
the expected '-' character, so we were testing as if there were no
legal int widths at all. Removed the leading cruft so we have some 
legal ints to test with.

I noticed this while testing a potential change to the way we 
transform shifts and sexts in D42424.

llvm-svn: 323377

60c13c77