Commits · 04d3d3bbff55ec3d605c5262551e64e967547599 · Roger Ferrer / llvm-epi

Jul 01, 2019

[InstCombine] (Y + ~X) + 1 --> Y - X fold (PR42459) · 04d3d3bb

Roman Lebedev authored Jul 01, 2019

Summary:
To be noted, this pattern is not unhandled by instcombine per-se,
it is somehow does end up being folded when one runs opt -O3,
but not if it's just -instcombine. Regardless, that fold is
indirect, depends on some other folds, and is thus blind
when there are extra uses.

This does address the regression being exposed in D63992.

https://godbolt.org/z/7DGltU
https://rise4fun.com/Alive/EPO0

Fixes [[ https://bugs.llvm.org/show_bug.cgi?id=42459 | PR42459 ]]

Reviewers: spatel, nikic, huihuiz

Reviewed By: spatel

Subscribers: llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D63993

llvm-svn: 364792

04d3d3bb

[InstCombine] Shift amount reassociation in bittest (PR42399) · 72b8d41c

Roman Lebedev authored Jul 01, 2019

Summary:
Given pattern:
`icmp eq/ne (and ((x shift Q), (y oppositeshift K))), 0`
we should move shifts to the same hand of 'and', i.e. rewrite as
`icmp eq/ne (and (x shift (Q+K)), y), 0`  iff `(Q+K) u< bitwidth(x)`

It might be tempting to not restrict this to situations where we know
we'd fold two shifts together, but i'm not sure what rules should there be
to avoid endless combine loops.

We pick the same shift that was originally used to shift the variable we picked to shift:
https://rise4fun.com/Alive/6x1v

Should fix [[ https://bugs.llvm.org/show_bug.cgi?id=42399 | PR42399]].

Reviewers: spatel, nikic, RKSimon

Reviewed By: spatel

Subscribers: llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D63829

llvm-svn: 364791

72b8d41c

[Hexagon] Custom-lower UADDO(x, 1) and USUBO(x, 1) · 5abf80cd
Krzysztof Parzyszek authored Jul 01, 2019
```
llvm-svn: 364790
```
5abf80cd
AMDGPU/GlobalISel: Select G_FRAME_INDEX · cda82f0b
Matt Arsenault authored Jul 01, 2019
```
llvm-svn: 364789
```
cda82f0b

AMDGPU/GFX10: fix scratch resource descriptor · 7cfd99ab

Nicolai Haehnle authored Jul 01, 2019

Summary:
The stride should depend on the wave size, not the hardware generation.

Also, the 32_FLOAT format is 0x16, not 16; though that shouldn't be
relevant.

Change-Id: I088f93bf6708974d085d1c50967f119061da6dc6

Reviewers: arsenm, rampitec, mareko

Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D63808

llvm-svn: 364788

7cfd99ab

AMDGPU/GlobalISel: Make s16 select legal · fdf36729

Matt Arsenault authored Jul 01, 2019

This is easy to handle and avoids legalization artifacts which are
likely to obscure combines.

llvm-svn: 364787

fdf36729

AMDGPU/GlobalISel: Select G_BRCOND for scc conditions · 6464280e
Matt Arsenault authored Jul 01, 2019
```
llvm-svn: 364786
```
6464280e
AMDGPU/GlobalISel: Tolerate copies with no type set · 1daad91a
Matt Arsenault authored Jul 01, 2019
```
isVCC has the same bug, but isn't used in a context where it can cause
a problem.

llvm-svn: 364784
```
1daad91a
AMDGPU: Fix tests using the default alloca address space · fb99fc7a
Matt Arsenault authored Jul 01, 2019
```
llvm-svn: 364783
```
fb99fc7a
AMDGPU/GlobalISel: Select src modifiers · 4f64ade0
Matt Arsenault authored Jul 01, 2019
```
llvm-svn: 364782
```
4f64ade0

Fixup r364512 · 2ba16011

Diana Picus authored Jul 01, 2019

Fix stack-use-after-scope errors from r364512. One instance was already
fixed in r364611 - this patch simplifies that fix and addresses one more
instance of similar code.

Discussed in: https://reviews.llvm.org/D63905

llvm-svn: 364778

2ba16011

[UpdateTestChecks][PowerPC] Avoid empty string when scrubbing loop comments · ee653934

Jinsong Ji authored Jul 01, 2019

Summary:
SCRUB_LOOP_COMMENT_RE was introduced in https://reviews.llvm.org/D31285
This works for some loops.

However, we may generate lines with loop comments only.
And since we don't scrub leading white spaces, this will leave an empty
line there, and FileCheck will complain it.

eg: llvm/test/CodeGen/PowerPC/PR35812-neg-cmpxchg.ll:27:15:
error: found empty check string with prefix 'CHECK:'
; CHECK-NEXT:

This prevented us from using the `update_llc_test_checks.py` for quite some cases.

We should still keep the comment token there, so that we can safely
scrub the loop comment without breaking FileCheck.

Reviewers: timshen, hfinkel, lebedev.ri, RKSimon

Subscribers: nemanjai, jfb, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D63957

llvm-svn: 364775

ee653934

[NFC][InstCombine] Better commutative tests for "shift amount reassociation in bittest" pattern. · 34a0b16e

Roman Lebedev authored Jul 01, 2019

As discussed in https://reviews.llvm.org/D63829
*if* *both* shifts are one-use, we'd most likely want to produce `lshr`,
and not rely on ordering.

Also, there should likely be a *separate* fold to do this reordering.

llvm-svn: 364772

34a0b16e

[Hexagon] Rework VLCR algorithm · 511ad50d

Krzysztof Parzyszek authored Jul 01, 2019

Add code to catch pattern for commutative instructions for VLCR.

Patch by Suyog Sarda.

llvm-svn: 364770

511ad50d

AMDGPU: Convert some places to Register · 1b317685
Matt Arsenault authored Jul 01, 2019
```
llvm-svn: 364769
```
1b317685
AMDGPU/GlobalISel: Fix RegBankSelect for G_FCANONICALIZE · 5bf850d5
Matt Arsenault authored Jul 01, 2019
```
llvm-svn: 364768
```
5bf850d5
AMDGPU/GlobalISel: Fix RegBankSelect for G_BUILD_VECTOR · b5fc94f3
Matt Arsenault authored Jul 01, 2019
```
llvm-svn: 364767
```
b5fc94f3
AMDGPU/GlobalISel: Fail on store to 32-bit address space · 89fc8bcd
Matt Arsenault authored Jul 01, 2019
```
llvm-svn: 364766
```
89fc8bcd
AMDGPU/GlobalISel: Improve icmp selection coverage. · 3b7668ae
Matt Arsenault authored Jul 01, 2019
```
Select s64 eq/ne scalar icmp.

llvm-svn: 364765
```
3b7668ae
[NFC][InstCombine] Improve test coverage for ((~x) + y) + 1 -> y - x fold fold (PR42459) · 9f364586
Roman Lebedev authored Jul 01, 2019
```
So we indeed to have this fold, but only if +1 is not the last operation..

llvm-svn: 364764
```
9f364586
AMDGPU/GlobalISel: RegBankSelect for WWM/WQM · c23149f6
Matt Arsenault authored Jul 01, 2019
```
llvm-svn: 364763
```
c23149f6
AMDGPU/GlobalISel: Use vcc reg bank for amdgcn.wqm.vote · facf69e8
Matt Arsenault authored Jul 01, 2019
```
llvm-svn: 364762
```
facf69e8

AMDGPU/GlobalISel: Fix scc->vcc copy handling · 9f992c23

Matt Arsenault authored Jul 01, 2019

This was checking the size of the register with the value of the size,
which happens to be exec. Also fix assuming VCC is 64-bit to fix
wave32.

Also remove some untested handling for physical registers which is
skipped. This doesn't insert the V_CNDMASK_B32 if SCC is the physical
copy source. I'm not sure if this should be trying to handle this
special case instead of dealing with this in copyPhysReg.

llvm-svn: 364761

9f992c23

AMDGPU/GlobalISel: Use and instead of BFE with inline immediate · 5dafcb9b
Matt Arsenault authored Jul 01, 2019
```
Zext from s1 is the only case where this should do anything with the
current legal extensions.

llvm-svn: 364760
```
5dafcb9b
GlobalISel: Add GINodeEquiv for min/max · 01bb075c
Matt Arsenault authored Jul 01, 2019
```
llvm-svn: 364759
```
01bb075c
GlobalISel: Add DAG compat for G_FCANONICALIZE · fbf67d88
Matt Arsenault authored Jul 01, 2019
```
llvm-svn: 364758
```
fbf67d88
[mips] Add missing schedinfo for MSA and ASE instructions · ceb9da5b
Simon Atanasyan authored Jul 01, 2019
```
llvm-svn: 364757
```
ceb9da5b
[mips] Add missing schedinfo for atomic instructions · c0121bf8
Simon Atanasyan authored Jul 01, 2019
```
llvm-svn: 364756
```
c0121bf8
[mips] Add missing schedinfo for ADJCALLSTACKDOWN, ADJCALLSTACKUP · 3a10810b
Simon Atanasyan authored Jul 01, 2019
```
llvm-svn: 364755
```
3a10810b

[AMDGPU] Call isLoopExiting for blocks in the loop. · 33c8c0ea

Florian Hahn authored Jul 01, 2019

isLoopExiting should only be called for blocks in the loop. A follow
up patch makes this requirement an assertion.

I've updated the usage here, to only match for actual exit blocks. Previously,
it would also match blocks not in the loop.

Reviewers: arsenm, nhaehnle

Reviewed By: nhaehnle

Differential Revision: https://reviews.llvm.org/D63980

llvm-svn: 364750

33c8c0ea

[NFC][InstCombine] Tests for ((~x) + y) + 1 -> y - x fold fold (PR42459) · d5c3e34c

Roman Lebedev authored Jul 01, 2019

To be noted, this pattern is not unhandled by instcombine per-se,
it is somehow does end up being folded when one runs opt -O3,
but not if it's just -instcombine. Regardless, that fold is
indirect, depends on some other folds, and is thus blind
when there are extra uses.

https://bugs.llvm.org/show_bug.cgi?id=42459
https://rise4fun.com/Alive/EPO0

llvm-svn: 364749

d5c3e34c

[RISCV] Add break; to the last switch case · 92e78b7b
Fangrui Song authored Jul 01, 2019
```
As suggested by jrtc27 in the post-commit review of D60528.

llvm-svn: 364746
```
92e78b7b

[X86] CombineShuffleWithExtract - updated description comments. NFCI. · 172fe5dd

Simon Pilgrim authored Jul 01, 2019

CombineShuffleWithExtract no longer requires that both shuffle ops are extract_subvectors, from the same type or from the same size.

llvm-svn: 364745

172fe5dd

[SelectionDAG] Do minnum->minimum at legalization time instead of building time · ed13fef4

Benjamin Kramer authored Jul 01, 2019

The SDAGBuilder behavior stems from the days when we didn't have fast
math flags available in SDAG. We do now and doing the transformation in
the legalizer has the advantage that it also works for vector types.

llvm-svn: 364743

ed13fef4

[benchmark] Disable CMake get_git_version · d74f2d0a

Andrew Ng authored Jul 01, 2019

Disabled CMake get_git_version as it is meaningless for this in-tree
build, and hardcoded a null version.

Not using get_git_version avoids a refresh of the git index that is
executed by get_git_version. Refreshing the index can take a
considerable amount of time if the index needs to be refreshed
(particularly with the mono repo). This situation can arise when
building shared source on a host in VMs.

Differential Revision: https://reviews.llvm.org/D63925

llvm-svn: 364742

d74f2d0a

[NFC][InstCombine] Tests for x - ~(y) -> x + y + 1 fold (PR42457) · 4f878fe3
Roman Lebedev authored Jul 01, 2019
```
https://bugs.llvm.org/show_bug.cgi?id=42457
https://rise4fun.com/Alive/iFhE

llvm-svn: 364739
```
4f878fe3

[InstCombine] Omit 'urem' where possible · f55818e3

Roman Lebedev authored Jul 01, 2019

This was added in D63390 / rL364286 to backend,
but it makes sense to also handle it in middle-end.
https://rise4fun.com/Alive/Zsln

llvm-svn: 364738

f55818e3

[NFC][InstCombine] Copy test for omit urem when possible from TargetLowering · 0f82f64c

Roman Lebedev authored Jul 01, 2019

Was added in D63390 / rL364286 to backend, but it makes sense to also handle it here.
https://rise4fun.com/Alive/Zsln

llvm-svn: 364737

0f82f64c

[DebugInfo] Avoid adding too much indirection to pointer-valued variables · d2b6665e

Jeremy Morse authored Jul 01, 2019

This patch addresses PR41675, where a stack-pointer variable is dereferenced
too many times by its location expression, presenting a value on the stack as
the pointer to the stack.

The difference between a stack *pointer* DBG_VALUE and one that refers to a
value on the stack, is currently the indirect flag. However the DWARF backend
will also try to guess whether something is a memory location or not, based
on whether there is any computation in the location expression. By simply
prepending the stack offset to existing expressions, we can accidentally
convert a register location into a memory location, which introduces a
suprise (and unintended) dereference.

The solution is to add DW_OP_stack_value whenever we add a DIExpression
computation to a stack *pointer*. It's an implicit location computed on the
expression stack, thus needs to be flagged as a stack_value.

For the edge case where the offset is zero and the location could be a register
location, DIExpression::prepend will still generate opcodes, and thus
DW_OP_stack_value must still be added.

Differential Revision: https://reviews.llvm.org/D63429

llvm-svn: 364736

d2b6665e

[SimpleLoopUnswitch] Implement handling of prof branch_weights metadata for SwitchInst · d4097b4a
Yevgeny Rouban authored Jul 01, 2019
```
Differential Revision: https://reviews.llvm.org/D60606

llvm-svn: 364734
```
d4097b4a