Commits · 20b9189975e06ed36bdaed16cbd9a377c0b579e7 · Lorenzo Albano / LLVM bpEVL

Feb 12, 2019

[NFC] Rename DontDeleteUselessPHIs --> KeepOneInputPHIs · 20b91899
Max Kazantsev authored Feb 12, 2019
```
llvm-svn: 353801
```
20b91899
[Statepoint Lowering] Update misleading comments about chains · b6dc6eb8
Philip Reames authored Feb 12, 2019
```
llvm-svn: 353800
```
b6dc6eb8
[NFC] Add parameter for keeping one-input Phis in DeleteDeadBlock(s) · 0686d1ae
Max Kazantsev authored Feb 12, 2019
```
llvm-svn: 353799
```
0686d1ae

[X86] Collapse FP_TO_INT16_IN_MEM/FP_TO_INT32_IN_MEM/FP_TO_INT64_IN_MEM into a... · 7670ede4

Craig Topper authored Feb 12, 2019

[X86] Collapse FP_TO_INT16_IN_MEM/FP_TO_INT32_IN_MEM/FP_TO_INT64_IN_MEM into a single opcode using memory VT to distinquish. NFC

llvm-svn: 353798

7670ede4

[X86] Remove the value type operand from the floating point load/store... · d7303ecd

Craig Topper authored Feb 12, 2019

[X86] Remove the value type operand from the floating point load/store MemIntrinsicSDNodes. Use the MemoryVT instead. NFCI

We already have the memory VT, we can just match from that during isel.

llvm-svn: 353797

d7303ecd

[LoopReroll] Fix reroll root legality checking. · 806136f8

Eli Friedman authored Feb 12, 2019

The code checked that the first root was an appropriate distance from
the base value, but skipped checking the other roots. This could lead to
rerolling a loop that can't be legally rerolled (at least, not without
rewriting the loop in a non-trivial way).

Differential Revision: https://reviews.llvm.org/D56812

llvm-svn: 353779

806136f8

DebugInfo: Split DWARF + gmlt + no-split-dwarf-inlining shouldn't emit anything to the .dwo file · 104dcb34

David Blaikie authored Feb 12, 2019

This configuration (due to r349207) was intended not to emit any DWO CU,
but a degenerate CU was still being emitted - containing a header and a
DW_TAG_compile_unit with no attributes.

Under that situation, emit nothing to the .dwo file. (since this is a
dynamic property of the input the .dwo file is still emitted, just with
nothing in it (so a valid, but empty, ELF file) - if some other CU
didn't satisfy this criteria, its DWO CU would still go there, etc)

llvm-svn: 353771

104dcb34

Be conservative about unordered accesses for the moment · 5906a659

Philip Reames authored Feb 11, 2019

Background: As described in https://reviews.llvm.org/D57601, I'm working towards separating volatile and atomic in the MMO uses for atomic instructions.

In https://reviews.llvm.org/D57593, I fixed a bug where isUnordered was returning the wrong result, but didn't account for the fact I was getting slightly ahead of myself. While both uses of isUnordered are correct (as far as I can tell), we don't have tests to demonstrate this and being aggressive gets in the way of having the removal of volatile truly be non-functional. Once D57601 lands, I will return to these call sites, revert this patch, and add the appropriate tests to show the expected behaviour.

Differential Revision: https://reviews.llvm.org/D57802

llvm-svn: 353766

5906a659

Feb 11, 2019

GlobalISel: Verify G_EXTRACT · b2d24577
Matt Arsenault authored Feb 11, 2019
```
llvm-svn: 353759
```
b2d24577

[TargetLibraryInfo] Update run time support for Windows · f4a36959

Evandro Menezes authored Feb 11, 2019

It seems that, since VC19, the `float` C99 math functions are supported for all
targets, unlike the C89 ones.

According to the discussion at https://reviews.llvm.org/D57625.

llvm-svn: 353758

f4a36959

[LegalizeTypes] Expand FNEG to bitwise op for IEEE FP types · 9a3dc3e6

Ana Pazos authored Feb 11, 2019

Summary:
Except for custom floating point types x86_fp80 and ppc_fp128,
expand Y = FNEG(X) to Y = X ^ sign mask to avoid library call.
 Using bitwise operation can improve code size and performance.

Reviewers: efriedma

Reviewed By: efriedma

Subscribers: efriedma, kpn, arsenm, eli.friedman, javed.absar, rbar, johnrusso, simoncook, sabuasal, niosHD, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, asb, llvm-commits

Differential Revision: https://reviews.llvm.org/D57875

llvm-svn: 353757

9a3dc3e6

[IRReader] Expose getLazyIRModule · 72a0f4e8

Scott Linder authored Feb 11, 2019

Currently there is no way to lazy-load an in-memory IR module without
first writing it to disk. This patch just exposes the existing
implementation of getLazyIRModule.

This is effectively a revert of rL212364

Differential Revision: https://reviews.llvm.org/D56203

llvm-svn: 353755

72a0f4e8

GlobalISel: Implement moreElementsVector for implicit_def · 18ec3826
Matt Arsenault authored Feb 11, 2019
```
llvm-svn: 353754
```
18ec3826
GlobalISel: Fix not calling the observer when legalizing G_EXTRACT · 68fc38ce
Matt Arsenault authored Feb 11, 2019
```
llvm-svn: 353750
```
68fc38ce
[globalisel] Correct string emitted by GISelChangeObserver::erasingInstr() · 24e0af69
Daniel Sanders authored Feb 11, 2019
```
The API indicates that the MI is about to be erased rather than it has been erased.

llvm-svn: 353746
```
24e0af69

[X86] Correct the memory operand for the FLD emitted in FP_TO_INTHelper for 32-bit SSE targets. · 75eb0af8

Craig Topper authored Feb 11, 2019

We were using DstTy, but that represents the integer type we are converting to which is i64 in this
case. The FLD is part of an intermediate step to get from the SSE registers to the x87 registers.
If the floating point type is f32, the memory operand should reflect a 4 byte access not an 8 byte
access. The store we used to get from SSE to the stack is using the corect size.

While there, consistenly use TheVT in place of Op.getOperand(0).getValueType() throughout the function.

llvm-svn: 353745

75eb0af8

[MemorySSA] Remove verifyClobberSanity. · d77edc00

Alina Sbirlea authored Feb 11, 2019

Summary:
This verification may fail after certain transformations due to
BasicAA's fragility. Added a small explanation and a testcase that
triggers the assert in checkClobberSanity (before its removal).
Addresses PR40509.

Reviewers: george.burgess.iv

Subscribers: sanjoy, jlebar, llvm-commits, Prazek

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D57973

llvm-svn: 353739

d77edc00

Refactor setAlreadyUnrolled() and setAlreadyVectorized(). · 77a614a6

Michael Kruse authored Feb 11, 2019

Loop::setAlreadyUnrolled() and
LoopVectorizeHints::setLoopAlreadyUnrolled() both add loop metadata that
stops the same loop from being transformed multiple times. This patch
merges both implementations.

In doing so we fix 3 potential issues:

 * setLoopAlreadyUnrolled() kept the llvm.loop.vectorize/interleave.*
   metadata even though it will not be used anymore. This already caused
   problems such as http://llvm.org/PR40546. Change the behavior to the
   one of setAlreadyUnrolled which deletes this loop metadata.

 * setAlreadyUnrolled() used to create a new LoopID by calling
   MDNode::get with nullptr as the first operand, then replacing it by
   the returned references using replaceOperandWith. It is possible
   that MDNode::get would instead return an existing node (due to
   de-duplication) that then gets modified. To avoid, use a fresh
   TempMDNode that does not get uniqued with anything else before
   replacing it with replaceOperandWith.

 * LoopVectorizeHints::matchesHintMetadataName() only compares the
   suffix of the attribute to set the new value for. That is, when
   called with "enable", would erase attributes such as
   "llvm.loop.unroll.enable", "llvm.loop.vectorize.enable" and
   "llvm.loop.distribute.enable" instead of the one to replace.
   Fortunately, function was only called with "isvectorized".

Differential Revision: https://reviews.llvm.org/D57566

llvm-svn: 353738

77a614a6

[InstCombine] Fix matchRotate bug when one operand is a ConstantExpr shift · 587fd849

Sanjay Patel authored Feb 11, 2019

This bug seems to be harmless in release builds, but will cause an error in UBSAN
builds or an assertion failure in debug builds.

When it gets to this opcode comparison, it assumes both of the operands are BinaryOperators,
but the prior m_LogicalShift will also match a ConstantExpr. The cast<BinaryOperator> will
assert in a debug build, or reading an invalid value for BinaryOp from memory with
((BinaryOperator*)constantExpr)->getOpcode() will cause an error in a UBSAN build.

The test I added will fail without this change in debug/UBSAN builds, but not in release.

Patch by: @AndrewScheidecker (Andrew Scheidecker)

Differential Revision: https://reviews.llvm.org/D58049

llvm-svn: 353736

587fd849

[SelectionDAGBuilder] Add restrictions to EmitFuncArgumentDbgValue · 4892f06e

Bjorn Pettersson authored Feb 11, 2019

Summary:
This patch fixes PR40587.

When a dbg.value instrinsic is emitted to the DAG
by using EmitFuncArgumentDbgValue the resulting
DBG_VALUE is hoisted to the beginning of the entry
block. I think the idea is to be able to locate
a formal argument already from the start of the
function.
However, EmitFuncArgumentDbgValue only checked that
the value that was used to describe a variable was
originating from a function parameter, not that the
variable itself actually was an argument to the
function. So when for example assigning a local
variable "local" the value from an argument "a",
the assocated DBG_VALUE instruction would be hoisted
to the beginning of the function, even if the scope
for "local" started somewhere else (or if "local"
was mapped to other values earlier in the function).

This patch adds some logic to EmitFuncArgumentDbgValue
to check that the variable being described actually
is an argument to the function. And that the dbg.value
being lowered already is in the entry block. Otherwise
we bail out, and the dbg.value will be handled as an
ordinary dbg.value (not as a "FuncArgumentDbgValue").

A tricky situation is when both the variable and
the value is related to function arguments, but not
neccessarily the same argument. We make sure that we
do not describe the same argument more than once as
a "FuncArgumentDbgValue". This solution works as long
as opt has injected a "first" dbg.value that corresponds
to the formal argument at the function entry.

Reviewers: jmorse, aprantl

Subscribers: jyknight, hiraditya, fedor.sergeev, dstenb, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D57702

llvm-svn: 353735

4892f06e

[LICM&MSSA] Limit store hoisting. · 605b2173

Alina Sbirlea authored Feb 11, 2019

Summary:
If there is no clobbering access for a store inside the loop, that store
can only be hoisted if there are no interfearing loads.
A more general verification introduced here: there are no loads that are
not optimized to an access outside the loop.
Addresses PR40586.

Reviewers: george.burgess.iv

Subscribers: sanjoy, jlebar, Prazek, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D57967

llvm-svn: 353734

605b2173

[TargetLibraryInfo] Update run time support for Windows · 4b86c474

Evandro Menezes authored Feb 11, 2019

It seems that the run time for Windows has changed and supports more math
functions than it used to, especially on AArch64, ARM, and AMD64.

Fixes PR40541.

Differential revision: https://reviews.llvm.org/D57625

llvm-svn: 353733

4b86c474

[AArch64][GlobalISel] Add isel support for a couple vector exts/truncs · e57fe23f

Jessica Paquette authored Feb 11, 2019

Add support for

- v4s16 <-> v4s32
- v2s64 <-> v2s32

And update tests that use them to show that we generate the correct
instructions.

Differential Revision: https://reviews.llvm.org/D57832

llvm-svn: 353732

e57fe23f

[DebugInfo] Fix /usr/lib/debug llvm-symbolizer lookup with relative paths · 5b7ad427

Jordan Rupprecht authored Feb 11, 2019

Summary:
rL189250 added a realpath call, and rL352916 because realpath breaks assumptions with some build systems. However, the /usr/lib/debug case has been clarified, falling back to /usr/lib/debug is currently broken if the obj passed in is a relative path. Adding a call to use absolute paths when falling back to /usr/lib/debug fixes that while still not making any realpath assumptions.

This also adds a --fallback-debug-path command line flag for testing (since we probably can't write to /usr/lib/debug from buildbot environments), but was also verified manually:

```
$ rm -f path/to/dwarfdump-test.elf-x86-64
$ strace llvm-symbolizer --obj=relative/path/to/dwarfdump-test.elf-x86-64.debuglink 0x40113f |& grep dwarfdump
```

Lookups went to relative/path/to/dwarfdump-test.elf-x86-64, relative/path/to/.debug/dwarfdump-test.elf-x86-64, and then finally /usr/lib/debug/absolute/path/to/dwarfdump-test.elf-x86-64.

Reviewers: dblaikie, samsonov

Reviewed By: dblaikie

Subscribers: krytarowski, aprantl, hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D57916

llvm-svn: 353730

5b7ad427

[MCA][Scheduler] Track resources that were found busy when issuing an instruction. · 23ff2aa4

Andrea Di Biagio authored Feb 11, 2019

This is a follow up of r353706. When the scheduler fails to issue a ready
instruction to the underlying pipelines, it now updates a mask of 'busy resource
units'. That information will be used in future to obtain the set of
"problematic" resources in the case of bottlenecks caused by resource pressure.
No functional change intended.

llvm-svn: 353728

23ff2aa4

[PowerPC] Avoid scalarization of vector truncate · 732fe224

Roland Froese authored Feb 11, 2019

The PowerPC code generator currently scalarizes vector truncates that would fit in a vector register, resulting in vector extracts, scalar operations, and vector merges. This patch custom lowers a vector truncate that would fit in a register to a vector shuffle instead.

Differential Revision: https://reviews.llvm.org/D56507

llvm-svn: 353724

732fe224

[GlobalISel][AArch64] Select G_FFLOOR · ebdb0210

Jessica Paquette authored Feb 11, 2019

This teaches the legalizer about G_FFLOOR, and lets us select G_FFLOOR in
AArch64.

It updates the existing floating point tests, and adds a select-floor.mir test.

Differential Revision: https://reviews.llvm.org/D57486

llvm-svn: 353722

ebdb0210

Recommit "[GlobalISel] Add IRTranslator support for G_FFLOOR" · f472f318

Jessica Paquette authored Feb 11, 2019

After the changes introduced in r353586, this instruction doesn't cause any
issues for any backend.

Original review: https://reviews.llvm.org/D57485

llvm-svn: 353720

f472f318

GlobalISel: Add G_FCANONICALIZE instruction · 9dba67f4
Matt Arsenault authored Feb 11, 2019
```
llvm-svn: 353719
```
9dba67f4
Move some classes into anonymous namespaces. NFC. · 711950c1
Benjamin Kramer authored Feb 11, 2019
```
llvm-svn: 353710
```
711950c1

[MCA] Return a mask of busy resources from method ResourceManager::checkAvailability(). NFCI · 83e68854

Andrea Di Biagio authored Feb 11, 2019

In case of bottlenecks caused by pipeline pressure, we want to be able to
correctly report the set of problematic pipelines. This is a first step towards
adding support for bottleneck hints in llvm-mca (see PR37494). No functional
change intended.

llvm-svn: 353706

83e68854

[AMDGPU] Remove unused variable · 582c1601
Benjamin Kramer authored Feb 11, 2019
```
llvm-svn: 353704
```
582c1601

[AMDGPU] Fix DPP sequence in atomic optimizer. · 8c10fa1a

Neil Henning authored Feb 11, 2019

This commit fixes the DPP sequence in the atomic optimizer (which was
previously missing the row_shr:3 step), and works around a read_register
exec bug by using a ballot instead.

Differential Revision: https://reviews.llvm.org/D57737

llvm-svn: 353703

8c10fa1a

Revert "[X86][SSE] Generalize X86ISD::BLENDI support to more value types" · e825ba91

Sam McCall authored Feb 11, 2019

This reverts commit r353610.
It causes a miscompile visible in macro expansion in a bootstrapped clang.

http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20190211/626590.html

llvm-svn: 353699

e825ba91

[ARM] Add v8m.base pattern for add negative imm · 8ff14303

Sam Parker authored Feb 11, 2019

The v8m.base ISA contains movw, which can operate on an unsigned
16-bit value. Add the pattern that converts an add with a negative
value, that could fit into 16-bits when negated, into a sub with that
positive value.

Differential Revision: https://reviews.llvm.org/D57942

llvm-svn: 353692

8ff14303

[AMDGPU] Enable DPP combiner pass by default. · ded96df0

Valery Pykhtin authored Feb 11, 2019

Related revisions: https://reviews.llvm.org/D55444, https://reviews.llvm.org/D55314

llvm-svn: 353691

ded96df0

[ARM] LoadStoreOptimizer: reoder limit · 150ccb88

Sjoerd Meijer authored Feb 11, 2019

The whole design of generating LDMs/STMs is fragile and unreliable: it depends on
rescheduling here in the LoadStoreOptimizer that isn't register pressure aware
and regalloc that isn't aware of generating LDMs/STMs.
This patch adds a (hidden) option to control the total number of instructions that
can be re-ordered. I appreciate this looks only a tiny bit better than a hard-coded
constant, but at least it allows more easy experimentation with different values
for now. Ideally we calculate this reorder limit based on some heuristics, and take
register pressure into account. I might be looking into that next.

Differential Revision: https://reviews.llvm.org/D57954

llvm-svn: 353678

150ccb88

Move CFLGraph and the AA summary code over to the new `CallBase` · 9beadff6
Chandler Carruth authored Feb 11, 2019
```
instruction base class rather than the `CallSite` wrapper.

llvm-svn: 353676
```
9beadff6
Remove `CallSite` from the CodeMetrics analysis, moving it to the new · 2d2a4359
Chandler Carruth authored Feb 11, 2019
```
`CallBase` and simpler APIs therein.

llvm-svn: 353673
```
2d2a4359
[ARM] LoadStoreOptimizer: just a clean-up. NFC. · 0cc50c6b
Sjoerd Meijer authored Feb 11, 2019
```
Differential Revision: https://reviews.llvm.org/D57955

llvm-svn: 353670
```
0cc50c6b