Commits · 100e797adb433724a17c9b42b6533cd634cb796b · Lorenzo Albano / LLVM bpEVL

Nov 05, 2019

[LV] Apply sink-after & interleave-groups as VPlan transformations (NFC) · 100e797a

Gil Rapaport authored Oct 07, 2019

This recommits 2be17087 (reverted in
d3ec06d2 for heap-use-after-free) with a fix
in IAI's reset() which was not clearing the set of interleave groups after
deleting them.

100e797a

Fix uninitialized variable warning. NFCI. · 95a25d88
Simon Pilgrim authored Nov 05, 2019

95a25d88
[MCObjectFileInfo] Fix uninitialized variable warnings. NFCI. · dec21e44
Simon Pilgrim authored Nov 05, 2019

dec21e44
[MachineOutliner] Fix uninitialized variable warnings. NFCI. · c7f127d9
Simon Pilgrim authored Nov 05, 2019

c7f127d9
[OPENMP][DOCS]Fix coloring of the implemented features status, NFC. · 642916ad
Alexey Bataev authored Nov 05, 2019

642916ad

[ObjC][ARC] Ignore lifetime markers between *ReturnValue calls · 47d10297

Francis Visoiu Mistrih authored Nov 04, 2019

When eliminating a pair of

`llvm.objc.autoreleaseReturnValue`

followed by

`llvm.objc.retainAutoreleasedReturnValue`

we need to make sure that the instructions in between are safe to
ignore.

Other than bitcasts and useless GEPs, it's also safe to ignore lifetime
markers for both static allocas (lifetime.start/lifetime.end) and dynamic
allocas (stacksave/stackrestore).

These get added by the inliner as part of the return sequence and can
prevent the transformation from happening in practice.

Differential Revision: https://reviews.llvm.org/D69833

47d10297

[NFC][ObjC][ARC] Add tests for OptimizeRetainRVCall · 68f39de0
Francis Visoiu Mistrih authored Nov 04, 2019
```
Add tests for bitcasts + zero GEPs, and pre-commit tests for lifetime
markers.
```
68f39de0

[JumpThreading] Factor out common code to update the SSA form (NFC) · 0016c1f4

Kazu Hirata authored Nov 04, 2019

Summary:
This patch factors out common code to update the SSA form in
JumpThreading.cpp -- partly for readability and partly to facilitate
an coming patch of my own.

Reviewers: wmi

Subscribers: hiraditya, jfb, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D69811

0016c1f4

[GVN] Fix uninitialized variable warnings. NFCI. · 77debf51
Simon Pilgrim authored Nov 05, 2019

77debf51

Add missing GVN =operator. NFCI. · 1842fe6b

Simon Pilgrim authored Nov 05, 2019

Fixes PVS Studio warning that the 'ValueTable' class implements a copy constructor, but lacks the '=' operator.

1842fe6b

[InstCombine] add tests for shift-logic-shift; NFC · 3ce0c785

Sanjay Patel authored Nov 05, 2019

This is based on existing CodeGen test files for x86 and AArch64.
The corresponding potential transform is shown in:
rL370617

3ce0c785

[lldb] Fix readline/libedit compat patch for py2 · d5904988
serge-sans-paille authored Nov 05, 2019
```
This is a follow-up to https://reviews.llvm.org/D69793
```
d5904988
[AtomicExpandPass] Silence static analyzer warnings about operator priority. NFCI. · 9f294fc4
Dávid Bolvanský authored Nov 05, 2019

9f294fc4

[MachineScheduler] Enable AA in PostRA Machine scheduler · f01b9aa8

David Green authored Nov 05, 2019

This adds AA to Post-RA Machine Scheduling, allowing the pass more
freedom when handling memory operations.

My understanding is that this was just never done, not that it is
inherently incorrect to do so. The older PostRA List scheduler already
makes use of AA, it's just that the MI PostRA Scheduler was never taught
to use it.

Differential Revision: https://reviews.llvm.org/D69814

f01b9aa8

[Docs] Add LangRef documentation for freeze instruction · 2d21068d

Nuno Lopes authored Nov 05, 2019

Summary:
 - Describe the new freeze instruction
 - Make it explicit that branch on undef/poison is UB

Reviewers: chandlerc, majnemer, efriedma, nikic, reames, jdoerfert, lebedev.ri, regehr

Subscribers: fhahn, bollu, lebedev.ri, delcypher, spatel, filcab, llvm-commits, aqjune

Differential Revision: https://reviews.llvm.org/D29121

2d21068d

[Clang FE] Recognize -mnop-mcount CL option (SystemZ only). · 93767143

Jonas Paulsson authored Nov 05, 2019

Recognize -mnop-mcount from the command line and add a function attribute
"mnop-mcount"="true" when passed.

When this option is used, a nop is added instead of a call to fentry. This
is used when building the Linux Kernel.

If this option is passed for any other target than SystemZ, an error is
generated.

Review: Ulrich Weigand
https://reviews.llvm.org/D67763

93767143

Fix PR40644: miscompile indexed FP constant store · 646896a4

Thomas Preud'homme authored Oct 03, 2019

Summary:
Functions replaceStoreOfFPConstant() and OptimizeFloatStore() both
replace store of float by a store of an integer unconditionally. However
this generates wrong code when the store that is replaced is an indexed
or truncating store. This commit solves this issue by adding an early
return in these functions when the store being considered is not a
normal store.

Bug was only observed on out of tree targets, hence the lack of testcase
in this commit.

Reviewers: efriedma

Subscribers: hiraditya, arphaman, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D68420

646896a4

[ARM] Always enable UseAA in the arm backend · cf581d79

David Green authored Nov 05, 2019

This feature controls whether AA is used into the backend, and was
previously turned on for certain subtargets to help create less
constrained scheduling graphs. This patch turns it on for all
subtargets, so that they can all make use of the extra information to
produce better code.

Differential Revision: https://reviews.llvm.org/D69796

cf581d79

[Scheduling][ARM] Consistently enable PostRA Machine scheduling · 7d9af03f

David Green authored Nov 05, 2019

In the ARM backend, for historical reasons we have only some targets
using Machine Scheduling. The rest use the old list scheduler as they
are using itinaries and the list scheduler seems to produce better code
(and not crash running out of register on v6m codes). So whether to use
the MIScheduler or not is checked at runtime from the subtarget
features.

This is fine, except for post-ra scheduling. Whether to use the old
post-ra list scheduler or the post-ra machine schedule is decided as the
pass manager is set up, in arms case from a newly constructed subtarget.
Under some situations, like LTO, this won't include the correct cpu so
can pick the wrong option. This can have a surprising effect on
performance.

To fix that, this patch overrides targetSchedulesPostRAScheduling and
addPreSched2 in the ARM backend, adding _both_ post-ra schedulers and
picking at runtime which to execute. To pick between the two I've had to
add a enablePostRAMachineScheduler() method that normally returns
enableMachineScheduler() && enablePostRAScheduler(), which can be
overridden to enable just one of PostRAMachineScheduler vs
PostRAScheduler.

Thanks to David Penry for the identifying this problem.

Differential Revision: https://reviews.llvm.org/D69775

7d9af03f

lldb/breakpad: add suppport for the "x86_64h" architecture · f71e35dc
Pavel Labath authored Nov 05, 2019

f71e35dc

Revert and patch "[Python] Remove readline module" · 9357b5d0

serge-sans-paille authored Nov 05, 2019

Fix https://bugs.llvm.org/show_bug.cgi?id=43830 while avoiding polluting the
global Python namespace.

This both reverts r357277 to rebundle a version of Python's readline module
based on libedit.

However, this patch also provides two improvements over the previous
implementation:

1. use PyMem_RawMalloc instead of PyMem_Malloc, as expected by PyOS_Readline
   (prevents to segfault upon exit of interactive session)
2. patch the readline module upon embedded interpreter loading, instead of
   patching it globally, which should prevent any side effect on other
   modules/packages
3. only activate the patched module if libedit is actually linked in lldb

Differential Revision: https://reviews.llvm.org/D69793

9357b5d0

[OpenCL] Group builtin functions by prototype · 0e56b0f9

Sven van Haastregt authored Nov 05, 2019

The TableGen-generated file containing the function definitions can be
reorganized to save some memory in the Clang binary. Functions having
the same prototype(s) will point to a shared list of prototype(s).

Patch by Pierre Gondois and Sven van Haastregt.

Differential Revision: https://reviews.llvm.org/D63557

0e56b0f9

[OpenCL] Add builtin function attribute handling · 9a8d477a

Sven van Haastregt authored Nov 05, 2019

Add handling for the "pure", "const" and "convergent" function
attributes for OpenCL builtin functions.

Patch by Pierre Gondois and Sven van Haastregt.

Differential Revision: https://reviews.llvm.org/D64319

9a8d477a

lldb/minidump: Add support for the alternate ARM64 constant · 4ecff91e
Pavel Labath authored Nov 05, 2019

4ecff91e

MemoryRegion: Print "don't know" permission values as such · 28cf9698

Pavel Labath authored Oct 16, 2019

Summary:
The permissions in a memory region have ternary states (yes, no, don't
know), but the memory region command only prints in binary, treating
"don't know" as "yes", which is particularly confusing as for instance
the unwinder will treat an unknown value as "no".

This patch makes is so that we distinguish all three states when
printing the values, using "?" to indicate the lack of information. It
is implemented via a special argument to the format provider for the
OptionalBool enumeration.

Reviewers: clayborg, jingham

Subscribers: lldb-commits

Differential Revision: https://reviews.llvm.org/D69106

28cf9698

[LoopUnroll] peel-loop-conditions.ll: add some 'is even/odd' peeling tests · 12c4a71c
Roman Lebedev authored Nov 05, 2019

12c4a71c

[InstCombine] dropRedundantMaskingOfLeftShiftInput(): truncation (PR42563) · ccf1a5f4

Roman Lebedev authored Nov 05, 2019

Summary:
That fold keeps growing and growing :(
I think this may be one of the last pieces for it.

Since D67677/D67725, the fold knowns the general form
of the pattern - where some masking is needed:
https://rise4fun.com/Alive/F5R
https://rise4fun.com/Alive/gslRa

But there is one more huge piece missing - if you are extracting some bits,
it is not impossible that the origin is wider than the extraction,
i.e. there may be a truncation. And we don't deal with that yet.

But we can, and the generalization remains fully identical:
https://rise4fun.com/Alive/Uar
https://rise4fun.com/Alive/5SW

After a preparatory cleanup i think the diff looks rather clean.

One missing piece is that in some patterns (especially pat. b),
`-1` only needs to be `-1` in final type, but that is for later..

https://bugs.llvm.org/show_bug.cgi?id=42563

Reviewers: spatel, nikic

Reviewed By: spatel

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D69125

ccf1a5f4

[RISCV] Add InstrInfo areMemAccessesTriviallyDisjoint hook · 0d47c7ab

Luís Marques authored Nov 05, 2019

Summary: Introduces the `InstrInfo::areMemAccessesTriviallyDisjoint`
hook. The test could check for instruction reorderings, but to avoid
being brittle it just checks instruction dependencies.

Reviewers: asb, lenary
Reviewed By: lenary
Tags: #llvm
Differential Revision: https://reviews.llvm.org/D67046

0d47c7ab

DWARFDebugLoclists: Make it possible to read relocated addresses · b4c5b8f3

Pavel Labath authored Oct 31, 2019

Summary:
Handling relocations was not needed when the loclists section was a
DWO-only thing. But since DWARF5, it is possible to use it in regular
objects too, and the standard permits embedding addresses into the
section directly. These addresses need to be relocated in unlinked
files.

Reviewers: JDevlieghere, dblaikie, probinson

Subscribers: aprantl, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D68271

b4c5b8f3

[mips] Set __OCTEON__ macros · 0d14656b
Simon Atanasyan authored Nov 05, 2019

0d14656b
[mips] Fix `__mips_isa_rev` macros value for Octeon CPU · e578d0fd
Simon Atanasyan authored Nov 05, 2019

e578d0fd

Recommit "[HardwareLoops] Optimisation remarks" · 92164cf2

Sjoerd Meijer authored Nov 05, 2019

With a few things fixed:
- initialisaiton of the optimisation remark pass (this was causing the buildbot
  failures on PPC),
- a test case.

Differential Revision: https://reviews.llvm.org/D69660

92164cf2

[AArch64] Update test checks on merge-store-dependency.ll. NFC · edfb8eea
David Green authored Nov 05, 2019

edfb8eea
[lldb][NFC] Give some parameters in CommandInterpreter more descriptive names · db5074dc
Raphael Isemann authored Nov 04, 2019

db5074dc
[IR] Remove switch's default block that causes clang 8 raise error · 92ef101d
aqjune authored Nov 05, 2019

92ef101d

[X86] Lower the cost of avx512 horizontal bool and/or reductions to... · 103968d1

Craig Topper authored Nov 04, 2019

[X86] Lower the cost of avx512 horizontal bool and/or reductions to 2*log2(bitwidth)+1 for legal types.

This better represents the kshift+binop we'd get for each stage
before the final extract. Its likely we'll do even better by
doing a kmov and a cmp with a GPR, but this is a good start.

The default handling was costing a worst case single source
permute shuffle of the vector before the binop. This worst
case assumes the shuffle might have to be emulated with
extracts and inserts. But since we know we're doing a reduction
we can assume we'll get kshift lowering.

There's still some room for improvement here, but this is
much better than it was.

103968d1

[IR] Add Freeze instruction · 58acbce3

aqjune authored Nov 05, 2019

Summary:
- Define Instruction::Freeze, let it be UnaryOperator
- Add support for freeze to LLLexer/LLParser/BitcodeReader/BitcodeWriter
  The format is `%x = freeze <ty> %v`
- Add support for freeze instruction to llvm-c interface.
- Add m_Freeze in PatternMatch.
- Erase freeze when lowering IR to SelDag.

Reviewers: deadalnix, hfinkel, efriedma, lebedev.ri, nlopes, jdoerfert, regehr, filcab, delcypher, whitequark

Reviewed By: lebedev.ri, jdoerfert

Subscribers: jfb, kristof.beyls, hiraditya, lebedev.ri, steven_wu, dexonsmith, xbolva00, delcypher, spatel, regehr, trentxintong, vsk, filcab, nlopes, mehdi_amini, deadalnix, llvm-commits

Differential Revision: https://reviews.llvm.org/D29011

58acbce3

[BPF] fix a use after free bug · 9f34447f

Yonghong Song authored Nov 04, 2019

Commit fff27212 ("[BPF] Fix CO-RE bugs with bitfields")
fixed CO-RE handling bitfield issues. But the implementation
introduced a use after free bug. The "Base" of the intrinsic
might be freed so later on accessing the Type of "Base"
might access the freed memory. The failed test case,
  CodeGen/BPF/CORE/offset-reloc-middle-chain.ll
is exactly used to test such a case.

Similarly to previous attempt to remember Metadata etc,
remember "Base" pointee Alignment in advance to avoid
such use after free bug.

9f34447f

[X86] Teach X86MCInstLower to swap operands of commutable instructions to... · f65493a8

Craig Topper authored Nov 04, 2019

[X86] Teach X86MCInstLower to swap operands of commutable instructions to enable 2-byte VEX encoding.

Summary:
The 2 source operands commutable instructions are encoded in the
VEX.VVVV field and the r/m field of the MODRM byte plus the VEX.B
field.

The VEX.B field is missing from the 2-byte VEX encoding. If the
VEX.VVVV source is 0-7 and the other register is 8-15 we can
swap them to avoid needing the VEX.B field. This works as long as
the VEX.W, VEX.mmmmm, and VEX.X fields are also not needed.

Fixes PR36706.

Reviewers: RKSimon, spatel

Reviewed By: RKSimon

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D68550

f65493a8

[analyzer] Require darwin for scan-build tests · abc04ff4
Devin Coughlin authored Nov 04, 2019
```
Let's at least get some coverage from these tests. We can generalize to
other platforms later.
```
abc04ff4