Commits · 65b4ab9921364d27eb0c4704d8259b22b0cdc53e · Lorenzo Albano / LLVM bpEVL

Feb 22, 2019

BreakCriticalEdges: Update PostDominatorTree · 65b4ab99
Matt Arsenault authored Feb 22, 2019
```
llvm-svn: 354673
```
65b4ab99

[mips][micromips] fix filling delay slots for PseudoIndirectBranch_MM · 6083106b

Petar Jovanovic authored Feb 22, 2019

Filling a delay slot in 32bit jump instructions with a 16bit instruction
can cause issues. According to the documentation such an operation is
unpredictable.
This patch adds opcode Mips::PseudoIndirectBranch_MM alongside
Mips::PseudoIndirectBranch and other instructions that are expanded to jr
instruction and do not allow a 16bit instruction in their delay slots.

Patch by Mirko Brkusanin.

Differential Revision: https://reviews.llvm.org/D58507

llvm-svn: 354672

6083106b

[LowerSwitch][AMDGPU] Do not handle impossible values · 99a6672b

Roman Tereshin authored Feb 22, 2019

This patch adds LazyValueInfo to LowerSwitch to compute the range of the
value being switched over and reduce the size of the tree LowerSwitch
builds to lower a switch.

Reviewed By: arsenm

Differential Revision: https://reviews.llvm.org/D58096

llvm-svn: 354670

99a6672b

[DTU] Refine the interface and logic of applyUpdates · 70e97163

Chijun Sima authored Feb 22, 2019

Summary:
This patch separates two semantics of `applyUpdates`:
1. User provides an accurate CFG diff and the dominator tree is updated according to the difference of `the number of edge insertions` and `the number of edge deletions` to infer the status of an edge before and after the update.
2. User provides a sequence of hints. Updates mentioned in this sequence might never happened and even duplicated.

Logic changes:

Previously, removing invalid updates is considered a side-effect of deduplication and is not guaranteed to be reliable. To handle the second semantic, `applyUpdates` does validity checking before deduplication, which can cause updates that have already been applied to be submitted again. Then, different calls to `applyUpdates` might cause unintended consequences, for example,
```
DTU(Lazy) and Edge A->B exists.
1. DTU.applyUpdates({{Delete, A, B}, {Insert, A, B}}) // User expects these 2 updates result in a no-op, but {Insert, A, B} is queued
2. Remove A->B
3. DTU.applyUpdates({{Delete, A, B}}) // DTU cancels this update with {Insert, A, B} mentioned above together (Unintended)
```
But by restricting the precondition that updates of an edge need to be strictly ordered as how CFG changes were made, we can infer the initial status of this edge to resolve this issue.

Interface changes:
The second semantic of `applyUpdates`  is separated to `applyUpdatesPermissive`.
These changes enable DTU(Lazy) to use the first semantic if needed, which is quite useful in `transforms/utils`.

Reviewers: kuhar, brzycki, dmgreen, grosser

Reviewed By: brzycki

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D58170

llvm-svn: 354669

70e97163

[ARM] Add some missing thumb1 opcodes to enable peephole optimisation of CMPs · acb628b2

David Green authored Feb 22, 2019

This adds a number of missing Thumb1 opcodes so that the peephole optimiser can
remove redundant CMP instructions.

Reapplying this after the first attempt broke non-thumb1 code as the t2ADDri
instruction can be used with frame indices. In thumb1 we use tADDframe.

Differential Revision: https://reviews.llvm.org/D57833

llvm-svn: 354667

acb628b2

[ARM GlobalISel] Support floating point for Thumb2 · 35e1c666

Diana Picus authored Feb 22, 2019

This is exactly the same as arm mode, so for the instruction selector
tests we just extract them to a new file and run with the same checks
for both arm and thumb mode.

For the legalizer we need to update the tests for soft float a bit, but
only because BL and tBL are slightly different. We could be pedantic and
check that we get a well-formed BL for arm mode and a tBL for thumb, but
for the purposes of the legalizer test it's sufficient to just skip over
the predicate operands in the checks. Also note that we have the
pedantic checks in the divmod test, so we're covered.

llvm-svn: 354665

35e1c666

Fix BB after r354661 · d22686b6
George Rimar authored Feb 22, 2019
```
Update 2 test cases after obj2yaml fix in r354661.

llvm-svn: 354663
```
d22686b6

[obj2yaml] - Do not miss section index for special symbols. · 11358dd6

George Rimar authored Feb 22, 2019

This fixes https://bugs.llvm.org/show_bug.cgi?id=40786 
("obj2yaml symbol output missing section index for SHN_ABS and SHN_COMMON symbols")

Since SHN_ABS and SHN_COMMON symbols are special, we should preserve
the st_shndx for them. The patch does this for them and the other special symbols.

The test case is based on the test provided by James Henderson at the bug page!

Differential revision: https://reviews.llvm.org/D58498

llvm-svn: 354661

11358dd6

[MemorySSA] Update test with minimized one. NFCI · 15110078
Alina Sbirlea authored Feb 22, 2019
```
llvm-svn: 354658
```
15110078

[WebAssembly] Remove getBottom function from CFGStackify (NFC) · 85631d8b

Heejin Ahn authored Feb 22, 2019

Summary:
This removes `getBottom` function and the bookeeping map of <begin
marker instruction, bottom BB>.

Reviewers: dschuff

Subscribers: sunfish, sbc100, jgravelle-google, jdoerfert, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D58319

llvm-svn: 354657

85631d8b

[MemorySSA & LoopPassManager] Resolve PR40038. · 90d2e3a1

Alina Sbirlea authored Feb 22, 2019

The correct edge being deleted is not to the unswitched exit block, but to the
original block before it was split. That's the key in the map, not the
value.
The insert is correct. The new edge is to the .split block.

The splitting turns OriginalBB into:
OriginalBB -> OriginalBB.split.
Assuming the orignal CFG edge: ParentBB->OriginalBB, we must now delete
ParentBB->OriginalBB, not ParentBB->OriginalBB.split.

llvm-svn: 354656

90d2e3a1

[LegalizeVectorOps] Improve the placement of ANDs in the ExpandLoad path for non-byte-sized loads. · fa6187d2

Craig Topper authored Feb 22, 2019

When we need to merge two adjacent loads the AND mask for the low piece was still sized for the full src element size. But we didn't have that many bits. The upper bits are already zero due to the SRL. So we can skip the AND if we're going to combine with the high bits.

We do need an AND to clear out any bits from the high part. We were anding the high part before combining with the low part, but it looks like ANDing after the OR gets better results.

So we can just emit the final AND after the optional concatentation is done. That will handling skipping before the OR and get rid of extra high bits after the OR.

llvm-svn: 354655

fa6187d2

[LegalizeVectorOps] Simplify the non-byte sized load handling VectorLegalizer::ExpandLoad. NFCI · 069cf05e
Craig Topper authored Feb 22, 2019
```
Remove an if that should always be true. Merge the body of another into the only block that could make the if true.

llvm-svn: 354654
```
069cf05e

[X86] Add test cases to cover the path in VectorLegalizer::ExpandLoad for... · 0ca023b3

Craig Topper authored Feb 22, 2019

[X86] Add test cases to cover the path in VectorLegalizer::ExpandLoad for non-byte sized loads where bits from two loads need to be concatenated.

If the scalar type doesn't divide evenly into the WideVT then the code will need to take some bits from adjacent scalar loads and combine them.

But most of our testing is for i1 element type which always divides evenly.

llvm-svn: 354653

0ca023b3

[DTU] Deprecate insertEdge*/deleteEdge* · f131d611

Chijun Sima authored Feb 22, 2019

Summary: This patch converts all existing `insertEdge*/deleteEdge*` to `applyUpdates` and marks `insertEdge*/deleteEdge*` as deprecated.

Reviewers: kuhar, brzycki

Reviewed By: kuhar, brzycki

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D58443

llvm-svn: 354652

f131d611

Fix a think-o in the disable-kaleidoscope-tests-on-windows predicate of r354646. · de9b30db
Lang Hames authored Feb 22, 2019
```
llvm-svn: 354650
```
de9b30db
DAG: Add helper for creating shifts with correct type · 0280a5e1
Matt Arsenault authored Feb 22, 2019
```
llvm-svn: 354649
```
0280a5e1

[X86] Add a DAG combine for (aext_vector_inreg (aext_vector_inreg X)) ->... · 3a391fc0

Craig Topper authored Feb 22, 2019

[X86] Add a DAG combine for (aext_vector_inreg (aext_vector_inreg X)) -> (aext_vector_inreg X) to fix a regression from my previous commit.

Type legalization is causing two nodes to be created here, but we can use a single node to extend from v8i16 to v2i64.

llvm-svn: 354648

3a391fc0

[LegalizeTypes] When promoting the result of EXTRACT_SUBVECTOR, also check if... · be22f329

Craig Topper authored Feb 22, 2019

[LegalizeTypes] When promoting the result of EXTRACT_SUBVECTOR, also check if the input needs to be promoted. Use that to determine the element type to extract.

Otherwise we end up creating extract_vector_elts that then each need to have their input promoted. This can lead to truncates needing to be emitted for each of those.

But we already emitted any_extends when we legalized the extract_subvector. So now we have pairs of any_extend+trunc that partially cancel. But depending on how DAGCombiner visits them we can get weird results.

By promoting the input at the same time we can create only a single any_extend or truncate.

There's one regression in the vector-narrow-binop.ll case, but that looks easy to fix with a follow up patch.

llvm-svn: 354647

be22f329

Add 'Windows' to the disabled platforms list for the Kaleidoscope tests. · 4a7db8cb
Lang Hames authored Feb 22, 2019
```
Expands on the check from r354645.

llvm-svn: 354646
```
4a7db8cb

Disable Kaleidoscope tests on Win32 -- looks like they're still failing there. · f4b86cb2

Lang Hames authored Feb 22, 2019

The Kaleidoscope tests were re-enabled in r354630, but are still failing on
Windows. This patch disables them on that platform until the failure can be
investigated.

llvm-svn: 354645

f4b86cb2

[X86] Fix some copy/paste mistakes that caused a VR128 to be used as the... · 427404c7

Craig Topper authored Feb 22, 2019

[X86] Fix some copy/paste mistakes that caused a VR128 to be used as the address of a load in an isel pattern

This was introduced in r354511.

Fixes PR40811.

llvm-svn: 354640

427404c7

[gn] Add LLVM_BUILD_EXAMPLES. · 2703ced6
Evgeniy Stepanov authored Feb 21, 2019
```
llvm-svn: 354636
```
2703ced6
AMDGPU: Remove debugger related subtarget features · aa6fb4c4
Matt Arsenault authored Feb 21, 2019
```
As far as I know these aren't needed anymore.

llvm-svn: 354634
```
aa6fb4c4

Feb 21, 2019

[Kaleidoscope] Re-enable Kaleidoscope tests. · ecfac6cd

Lang Hames authored Feb 21, 2019

These were disabled in r246267 (back in 2015). I suspect that the Win32 issues
that caused them to be disabled at the time have been resovlved, but if not
we can disable them again while we sort those out.

llvm-svn: 354630

ecfac6cd

Revert "[asan] Fix vfork handling.", +1 · 45eea20d
Evgeniy Stepanov authored Feb 21, 2019
```
Revert r354625, r354627 - multiple build failures.

llvm-svn: 354629
```
45eea20d

[X86] Remove hasSideEffects=1 from the X87 pseudos with folded load. · 2b34fdc6

Craig Topper authored Feb 21, 2019

This was done in r321424 to prevent scheduling from reordering things. But now that we model FPCW as a dependency, I don't think the same scheduling we were trying to prevent can occur.

llvm-svn: 354628

2b34fdc6

[hwasan,asan] Intercept vfork. · df913868

Evgeniy Stepanov authored Feb 21, 2019

Summary: AArch64 only for now.

Reviewers: vitalybuka, pcc

Subscribers: srhines, kubamracek, mgorny, javed.absar, krytarowski, kristof.beyls, jdoerfert, #sanitizers, llvm-commits, kcc

Tags: #sanitizers, #llvm

Differential Revision: https://reviews.llvm.org/D58313

llvm-svn: 354625

df913868

[MemorySSA & LoopPassManager] Update MemorySSA in formDedicatedExitBlocks. · 97468e92
Alina Sbirlea authored Feb 21, 2019
```
MemorySSA is now updated when forming dedicated exit blocks.
Resolves PR40037.

llvm-svn: 354623
```
97468e92
AMDGPU/NFC: Cleanup subtarget predicates · c2650178
Konstantin Zhuravlyov authored Feb 21, 2019
```
Differential Revision: https://reviews.llvm.org/D58522

llvm-svn: 354620
```
c2650178

[x86] vectorize more cast ops in lowering to avoid register file transfers · 234a5e8e

Sanjay Patel authored Feb 21, 2019

This is a follow-up to D56864.

If we're extracting from a non-zero index before casting to FP,
then shuffle the vector and optionally narrow the vector before doing the cast:

cast (extelt V, C) --> extelt (cast (extract_subv (shuffle V, [C...]))), 0

This might be enough to close PR39974:
https://bugs.llvm.org/show_bug.cgi?id=39974

Differential Revision: https://reviews.llvm.org/D58197

llvm-svn: 354619

234a5e8e

[gn] Add target flags to asmflags. · e497f3e4

Evgeniy Stepanov authored Feb 21, 2019

Reviewers: pcc, thakis

Subscribers: llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D58483

llvm-svn: 354618

e497f3e4

Re-land "[AArch64][GlobalISel] Implement partial support for G_SHUFFLE_VECTOR"" · 1abe05c0
Amara Emerson authored Feb 21, 2019
```
Thanks to Richard Trieu for pointing out that the failures were due to a
use-after-free of an ArrayRef.

llvm-svn: 354616
```
1abe05c0
[llvm] Fix typo: 's/ ot / to /' [NFC] · 096fae32
Mandeep Singh Grang authored Feb 21, 2019
```
llvm-svn: 354614
```
096fae32

[LoopSimplifyCFG] Update MemorySSA after r353911. · d2d32443

Alina Sbirlea authored Feb 21, 2019

Summary:
MemorySSA is not properly updated in LoopSimplifyCFG after recent changes. Use SplitBlock utility to resolve that and clear all updates once handleDeadExits is finished.
All updates that follow are removal of edges which are safe to handle via the removeEdge() API.
Also, deleting dead blocks is done correctly as is, i.e. delete from MemorySSA before updating the CFG and DT.

Reviewers: mkazantsev, rtereshin

Subscribers: sanjoy, jlebar, Prazek, george.burgess.iv, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D58524

llvm-svn: 354613

d2d32443

[EarlyCSE] Cleanup deadcode. [NFCI] · 73446cd5

Alina Sbirlea authored Feb 21, 2019

Summary: Cleanup nop assignments.

Reviewers: george.burgess.iv, davide

Subscribers: sanjoy, jlebar, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D58308

llvm-svn: 354612

73446cd5

[Hexagon] Use misaligned load instead of trap0(#0) for __builtin_trap · f6e875ba

Krzysztof Parzyszek authored Feb 21, 2019

The trap instruction is intercepted by various runtime environments,
and instead of a crash it creates confusion.

This reapplies r354606 with a fix.

llvm-svn: 354611

f6e875ba

Revert r354606, it breaks asan tests · 948c9f93
Krzysztof Parzyszek authored Feb 21, 2019
```
llvm-svn: 354609
```
948c9f93

[Hexagon] Use misaligned load instead of trap0(#0) for __builtin_trap · 5f47fac3

Krzysztof Parzyszek authored Feb 21, 2019

The trap instruction is intercepted by various runtime environments,
and instead of a crash it creates confusion.

llvm-svn: 354606

5f47fac3

[AMDGPU] remove unused AssemblerPredicates · 599ce44d

Mark Searles authored Feb 21, 2019

An internal build is hitting asserts complaining about too many subtarget
features:
  llvm/utils/TableGen/Types.cpp:42:
    const char* llvm::getMinimalTypeForEnumBitfield(uint64_t):
    Assertion `MaxIndex <= 64 && "Too many bits"' failed.

  llvm/utils/TableGen/AsmMatcherEmitter.cpp:1476:
    void {anonymous}::AsmMatcherInfo::buildInfo():
    Assertion `SubtargetFeatures.size() <= 64 && "Too many subtarget features!"'
    failed.

The short-term solution is to remove a few unused AssemblerPredicates to get
under the limit.

The long-term solution seems to be to revisit these asserts. E.g., rather than
hardcoded '64', use the standard sized std::bitset like the other places that
track subtarget features.

Differential Revision: https://reviews.llvm.org/D58516

llvm-svn: 354604

599ce44d