Commits · d60951f469919476dfae4ca3f5ccb153c7072f47 · Lorenzo Albano / LLVM bpEVL

Dec 20, 2017

DAG: Tolerate non-MemSDNodes for OPC_RecordMemRef · d60951f4

Matt Arsenault authored Dec 20, 2017

When intrinsics are allowed to have mem operands, there
are two ways this can happen. First is an intrinsic
that is marked has having a mem operand, but is not handled
by getTgtMemIntrinsic.

The second way can occur even for intrinsics which do not
have a mem operand. It seems the selector table does
some kind of sorting based on the opcode, and the
mem ref recording can happen in the same scope for
intrinsics that both do and do not have mem refs.
I haven't been able to figure out exactly why this happens
(although it happens even with the matcher optimizations disabled).
I'm not sure if it's worth trying to avoid hitting this for
these nodes since I think it's still reasonable to handle
this in case getTgtMemIntrinic is not implemented.

llvm-svn: 321208

d60951f4

[DAG] Fix condition on overlapping store check. · a869856c

Nirav Dave authored Dec 20, 2017

Prevent overlapping store elision when overlapping store is
pre-inc/dec as analysis is wrong in these cases.

llvm-svn: 321204

a869856c

Add optional SelectionDAG* parameter to SValue::dump and SDValue::dumpr · 3257e44c

Krzysztof Parzyszek authored Dec 20, 2017

These functions simply call their counterparts in the associated SDNode,
which do take an optional SelectionDAG. This change makes the legalization
debug trace a little easier to read, since target-specific nodes will
now have their names shown instead of "Unknown node #123".

llvm-svn: 321180

3257e44c

Dec 19, 2017

Silence a bunch of implicit fallthrough warnings · 0e6694d1
Adrian Prantl authored Dec 19, 2017
```
llvm-svn: 321114
```
0e6694d1

[DAG] Elide overlapping store · 51425fa5

Nirav Dave authored Dec 19, 2017

Summary:
Extend overlapping store elision to handle overwrites of stores by
larger stores.

Nontemporal tests have been modified to add memory dependencies to
prevent store elision.

Reviewers: craig.topper, rnk, t.p.northover

Subscribers: javed.absar, hiraditya, llvm-commits

Differential Revision: https://reviews.llvm.org/D40969

llvm-svn: 321089

51425fa5

Dec 18, 2017

[DAGCombine] Move AND nodes to multiple load leaves · 00804efd

Sam Parker authored Dec 18, 2017

Search from AND nodes to find whether they can be propagated back to
loads, so that the AND and load can be combined into a narrow load.
We search through OR, XOR and other AND nodes and all bar one of the
leaves are required to be loads or constants. The exception node then
needs to be masked off meaning that the 'and' isn't removed, but the
loads(s) are narrowed still.

Differential Revision: https://reviews.llvm.org/D41177

llvm-svn: 320962

00804efd

Dec 15, 2017

Fix unused variable in non-assert builds · 042fed54
Matthias Braun authored Dec 15, 2017
```
llvm-svn: 320885
```
042fed54
MachineFunction: Return reference from getFunction(); NFC · f1caa283
Matthias Braun authored Dec 15, 2017
```
The Function can never be nullptr so we can return a reference.

llvm-svn: 320884
```
f1caa283

[SelectionDAG][X86] Fix insert_vector_elt lowering for v32i1/v64i1 with non-constant index · 3fb83866

Craig Topper authored Dec 15, 2017

Summary:
Currently we don't handle v32i1/v64i1 insert_vector_elt correctly as we fail to look at the number of elements closely and assume it can only be v16i1 or v8i1.

We also can't type legalize v64i1 insert_vector_elt correctly on KNL due to the type not being byte addressable as required by the legalizing through memory accesses path requires.

For the first issue, the patch now tries to pick a 512-bit register with the correct number of elements and promotes to that.

For the second issue, we now extend the vector to a byte addressable type, do the stores to memory, load the two halves, and then truncate the halves back to the original type. Technically since we changed the type, we may not need two loads, but actually checking that is more work and for the v64i1 case we do need them.

Reviewers: RKSimon, delena, spatel, zvi

Reviewed By: RKSimon

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D40942

llvm-svn: 320849

3fb83866

[SelectionDAG] Make getNode calls that take an ArrayRef of SDValue for... · 23951ec2

Craig Topper authored Dec 15, 2017

[SelectionDAG] Make getNode calls that take an ArrayRef of SDValue for operands call NewSDValueDbgMsg.

This makes it work better with some build_vector and concat_vectors creations.

Adjust the NewSDValueDbgMsg in getConstant to avoid duplicating the print when it calls getSplatBuildVector since getSplatBuildVector didn't trigger a print before.

llvm-svn: 320783

23951ec2

Dec 14, 2017

EmitFuncArgumentDbgValue: Prefer stack slots over registers for stack arguments · c133d8a5

Adrian Prantl authored Dec 14, 2017

While investigating LLVM PR22316 (http://llvm.org/bugs/show_bug.cgi?id=22316)
I started wondering if it were not always preferable to emit the
initial DBG_VALUEs for stack arguments as FI locations instead of
describing the first register they get copied into. The advantage of
doing this is that the arguments will be available as soon as the
stack is setup. As illustrated by the testcase in the PR, the first
copy of the FI into a register may be sunk by MachineSink.cpp into a
later basic block. By describing the argument on the stack, we nicely
circumvent this problem.

<rdar://problem/19583723>

Differential Revision: https://reviews.llvm.org/D41135

llvm-svn: 320758

c133d8a5

TLI: Allow using PSV for intrinsic mem operands · 7d7adf4f
Matt Arsenault authored Dec 14, 2017
```
llvm-svn: 320756
```
7d7adf4f

Fix many -Wsign-compare and -Wtautological-constant-compare warnings. · 260fe3ec

Zachary Turner authored Dec 14, 2017

Most of the -Wsign-compare warnings are due to the fact that
enums are signed by default in the MS ABI, while the
tautological comparison warnings trigger on x86 builds where
sizeof(size_t) is 4 bytes, so N > numeric_limits<unsigned>::max()
is always false.

Differential Revision: https://reviews.llvm.org/D41256

llvm-svn: 320750

260fe3ec

DAG: Expose all MMO flags in getTgtMemIntrinsic · 11171336

Matt Arsenault authored Dec 14, 2017

Rather than adding more bits to express every
MMO flag you could want, just directly use the
MMO flags. Also fixes using a bunch of bool arguments to
getMemIntrinsicNode.

On AMDGPU, buffer and image intrinsics should always
have MODereferencable set, but currently there is no
way to do that directly during the initial intrinsic
lowering.

llvm-svn: 320746

11171336

Revert "[DAGCombine] Move AND nodes to multiple load leaves" · a85822cb
Benjamin Kramer authored Dec 14, 2017
```
This reverts commit r320679. Causes miscompiles.

llvm-svn: 320698
```
a85822cb

[DAGCombine] Move AND nodes to multiple load leaves · ef12b41e

Sam Parker authored Dec 14, 2017

Recommitting rL319773, which was reverted due to a recursive issue
causing timeouts. This happened because I failed to check whether
the discovered loads could be narrowed further. In the case of a tree
with one or more narrow loads, that could not be further narrowed, as
well as a node that would need masking, an AND could be introduced
which could then be visited and recombined again with the same load.
This could again create the masking load, with would be combined
again... We now check that the load can be narrowed so that this
process stops.

Original commit message:
Search from AND nodes to find whether they can be propagated back to
loads, so that the AND and load can be combined into a narrow load.
We search through OR, XOR and other AND nodes and all bar one of the
leaves are required to be loads or constants. The exception node then
needs to be masked off meaning that the 'and' isn't removed, but the
loads(s) are narrowed still.

Differential Revision: https://reviews.llvm.org/D41177

llvm-svn: 320679

ef12b41e

[SelectionDAG][X86] Improve legalization of v32i1 CONCAT_VECTORS of v16i1 for AVX512F. · eab2d466

Craig Topper authored Dec 14, 2017

A v32i1 CONCAT_VECTORS of v16i1 uses promotion to v32i8 to legalize the v32i1. This results in a bunch of extract_vector_elts and a build_vector that ultimately gets scalarized.

This patch checks to see if v16i8 is legal and inserts a any_extend to that so that we can concat v16i8 to v32i8 and avoid creating the extracts.

llvm-svn: 320674

eab2d466

[SelectionDAG] When legalizing the result type of CONCAT_VECTORS, take into... · cf77203f

Craig Topper authored Dec 14, 2017

[SelectionDAG] When legalizing the result type of CONCAT_VECTORS, take into account whether the input type also needs to be promoted.

If so go ahead and get the promoted input vector to extract from. Previously, we would create a bunch of any_extends of extract_vector_elts with illegal input type that needs to be promoted. The legalization of those extract_vector_elts would then potentially introduce a truncate. So now we have a bunch of any_extends of truncates. By legalizing both parts together we avoid creating these extra nodes.

The test changes seem to be because we were previously combining the build_vector with the any_extend before the any_extend got combined with the truncate.

llvm-svn: 320669

cf77203f

Dec 13, 2017
- Remove redundant includes from lib/CodeGen. · c468b648
  Michael Zolotukhin authored Dec 13, 2017
```
llvm-svn: 320619
```
  c468b648
- [DAG] Promote ADDCARRY / SUBCARRY · e8d4e88b
  Roger Ferrer Ibanez authored Dec 13, 2017
```
Add missing case that was not implemented yet.

Differential Revision: https://reviews.llvm.org/D38942

llvm-svn: 320567
```
  e8d4e88b
Dec 11, 2017

[DAGCombiner] protect against an infinite loop between shl <--> mul (PR35579) · f3436d7d

Sanjay Patel authored Dec 11, 2017

  
At first, I tried to thread the x86 needle and use a target hook (isVectorShiftByScalarCheap())
to disable the transform only for non-splat pow-of-2 constants, but not AVX2, but only some
element types, but...it's difficult.

Here we just avoid the loop with the x86 vector transform that conflicts with the general DAG
combine and preserve all of the existing behavior AFAICT otherwise.

Some tests that will probably fail if someone does try to restrict this in a more targeted way
for x86-only may be found in:

test/CodeGen/X86/combine-mul.ll
test/CodeGen/X86/vector-mul.ll
test/CodeGen/X86/widen_arith-5.ll

This should prevent the infinite looping seen with:
https://bugs.llvm.org/show_bug.cgi?id=35579

Differential Revision: https://reviews.llvm.org/D41040

llvm-svn: 320374

f3436d7d

[DAGCombiner] Add combined indexed load to the work list · 25d9af0c

Nemanja Ivanovic authored Dec 11, 2017

This commit is the first part of https://reviews.llvm.org/D40348.
In order to allow target combines to be performed on newly combined
indexed loads, add them back to the worklist. The remainder of the
above patch will be committed in subsequent revisions and will use
this. Test cases will be included with those follow-up commits.

llvm-svn: 320365

25d9af0c

[ARM] Use ADDCARRY / SUBCARRY · 5ea0f250

Roger Ferrer Ibanez authored Dec 11, 2017

This is a preparatory step for D34515.

This change:
 - makes nodes ISD::ADDCARRY and ISD::SUBCARRY legal for i32
 - lowering is done by first converting the boolean value into the carry flag
   using (_, C) ← (ARMISD::ADDC R, -1) and converted back to an integer value
   using (R, _) ← (ARMISD::ADDE 0, 0, C). An ARMISD::ADDE between the two
   operations does the actual addition.
 - for subtraction, given that ISD::SUBCARRY second result is actually a
   borrow, we need to invert the value of the second operand and result before
   and after using ARMISD::SUBE. We need to invert the carry result of
   ARMISD::SUBE to preserve the semantics.
 - given that the generic combiner may lower ISD::ADDCARRY and
   ISD::SUBCARRYinto ISD::UADDO and ISD::USUBO we need to update their lowering
   as well otherwise i64 operations now would require branches. This implies
   updating the corresponding test for unsigned.
 - add new combiner to remove the redundant conversions from/to carry flags
   to/from boolean values (ARMISD::ADDC (ARMISD::ADDE 0, 0, C), -1) → C
 - fixes PR34045
 - fixes PR34564
 - fixes PR35103

Differential Revision: https://reviews.llvm.org/D35192

llvm-svn: 320355

5ea0f250

[RISCV] Support lowering FrameIndex · 660bccec

Alex Bradbury authored Dec 11, 2017

Introduces the AddrFI "addressing mode", which is necessary simply because 
it's not possible to write a pattern that directly matches a frameindex.

Ensure callee-saved registers are accessed relative to the stackpointer. This
is necessary as callee-saved register spills are performed before the frame
pointer is set.

Move HexagonDAGToDAGISel::isOrEquivalentToAdd to SelectionDAGISel, so we can 
make use of it in the RISC-V backend.

Differential Revision: https://reviews.llvm.org/D39848

llvm-svn: 320353

660bccec

[DAGCombiner] Support folding (mulhs/u X, 0)->0 for vectors. · ad45bf58
Craig Topper authored Dec 11, 2017
```
We should probably also fold (mulhs/u X, 1) for vectors, but that's harder.

llvm-svn: 320344
```
ad45bf58
[DAGCombiner] Reuse existing SDLoc variable instead of creating a new one. NFC · 65ed4d44
Craig Topper authored Dec 11, 2017
```
llvm-svn: 320343
```
65ed4d44

Dec 09, 2017

Relax unaligned access assertion when type is byte aligned · 80463fe6

Dylan McKay authored Dec 09, 2017

Summary:
This relaxes an assertion inside SelectionDAGBuilder which is overly
restrictive on targets which have no concept of alignment (such as AVR).

In these architectures, all types are aligned to 8-bits.

After this, LLVM will only assert that accesses are aligned on targets
which actually require alignment.

This patch follows from a discussion on llvm-dev a few months ago
http://llvm.1065342.n5.nabble.com/llvm-dev-Unaligned-atomic-load-store-td112815.html

Reviewers: bogner, nemanjai, joerg, efriedma

Reviewed By: efriedma

Subscribers: efriedma, cactus, llvm-commits

Differential Revision: https://reviews.llvm.org/D39946

llvm-svn: 320243

80463fe6

Dec 08, 2017
- Generalize llvm::replaceDbgDeclare and actually support the use-case that · d1317017
  Adrian Prantl authored Dec 08, 2017
```
is mentioned in the documentation (inserting a deref before the plus_uconst).

llvm-svn: 320203
```
  d1317017
Dec 07, 2017

[DAGCombiner] eliminate shuffle of insert element · 9012391a

Sanjay Patel authored Dec 07, 2017

I noticed this pattern in D38316 / D38388. We failed to combine a shuffle that is either
repeating a scalar insertion at the same position in a vector or translated to a different
element index.

Like the earlier patch, this could be an instcombine too, but since we opted to make this
a DAG transform earlier, I've made this one a DAG patch too.

We do not need any legality checking because the new insert is identical to the existing
insert except that it may have a different constant insertion operand.

The constant insertion test in test/CodeGen/X86/vector-shuffle-combining.ll was the
motivation for D38756.

Differential Revision: https://reviews.llvm.org/D40209

llvm-svn: 320050

9012391a

[SelectionDAG] In SplitVecOp_EXTRACT_VECTOR_ELT, simplify the code that makes... · dfecd45f

Craig Topper authored Dec 07, 2017

[SelectionDAG] In SplitVecOp_EXTRACT_VECTOR_ELT, simplify the code that makes the type byte addressable.

We can just extend the original vector to vXi1 and trust that the legalization process will revisit it.

llvm-svn: 320013

dfecd45f

[SelectionDAG] Use TLI.getVectorIdxTy to determine type for an... · 26ed8d12

Craig Topper authored Dec 07, 2017

[SelectionDAG] Use TLI.getVectorIdxTy to determine type for an EXTRACT_VECTOR_ELT index instead of hardcoding MVT::i8.

llvm-svn: 320012

26ed8d12

Dec 06, 2017

[ARM][AArch64][DAG] Reenable post-legalize store merge · 7d8f3e0c

Nirav Dave authored Dec 06, 2017

Reenable post-legalize stores with constant merging computation and
corresponding test case.

 * Properly truncate store merge constants
 * Disable merging of truncated stores floating points
 * Ensure merges of constant stores into a single vector are
   constructed from legal elements.

Reviewers: eastig, efriedma

Reviewed By: eastig

Subscribers: spatel, rengolin, aemerson, javed.absar, kristof.beyls, hiraditya, llvm-commits

Differential Revision: https://reviews.llvm.org/D40701

llvm-svn: 319899

7d8f3e0c

Revert "[DAGCombine] Move AND nodes to multiple load leaves" · 0b40f211

Vlad Tsyrklevich authored Dec 06, 2017

This reverts commit r319773. It was causing some buildbots to hang, e.g.
http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-android/builds/5589

llvm-svn: 319867

0b40f211

[SelectionDAG] Don't promote the condition operand of VSELECT when promoting the result. · 0328d008
Craig Topper authored Dec 05, 2017
```
The condition operand should be promoted during operand promotion.

llvm-svn: 319853
```
0328d008

[SelectionDAG] Don't promote mask operand when widening mstore and mscatter. · dfd90802

Craig Topper authored Dec 05, 2017

If the mask needs to be promoted that should occur by the legalizer detecting the mask operand needs to be promoted not as a side effect of another action.

llvm-svn: 319852

dfd90802

[SelectionDAG] Don't promote mask when splitting mstore. · ddc6bba0

Craig Topper authored Dec 05, 2017

If the mask needs to be promoted it should be handled by operand promotion after the result is legalized.

llvm-svn: 319851

ddc6bba0

[SelectionDAG] Don't promote mask operands of MGATHER and MLOAD to setcc... · 2e684593

Craig Topper authored Dec 05, 2017

[SelectionDAG] Don't promote mask operands of MGATHER and MLOAD to setcc result type while widening the result. Just widen the mask.

The mask will be promoted if necessary when operands are promoted. It's possible the mask type is legal, but the setcc result type is a different. We shouldn't promote to the setcc result type unless the mask needs to be promoted.

llvm-svn: 319850

2e684593

[SelectionDAG] Don't call GetWidenedVector for mask operands of MLOAD/MSTORE. · 57440a6f

Craig Topper authored Dec 05, 2017

GetWidenedVector does't guarantee the widened elements are zero which would break the intended behavior of the operation.

llvm-svn: 319849

57440a6f

Dec 05, 2017

Re-commit r319490 "XOR the frame pointer with the stack cookie when protecting the stack" · 5df9f087

Hans Wennborg authored Dec 05, 2017

The patch originally broke Chromium (crbug.com/791714) due to its failing to
specify that the new pseudo instructions clobber EFLAGS. This commit fixes
that.

> Summary: This strengthens the guard and matches MSVC.
>
> Reviewers: hans, etienneb
>
> Subscribers: hiraditya, JDevlieghere, vlad.tsyrklevich, llvm-commits
>
> Differential Revision: https://reviews.llvm.org/D40622

llvm-svn: 319824

5df9f087

[SelectionDAG] Remove the code that handles SETCC with a scalar result type from vector widening. · 8adcbe8c

Craig Topper authored Dec 05, 2017

There's no such thing as a setcc with vector operands and scalar result. And if we're trying to widen the result we would have to already be looking at a vector result type.

So this patch renames the VSETCC function as the SETCC function and delete the original SETCC function.

llvm-svn: 319799

8adcbe8c