Commits · f3063baa6ecb88a54d0832f2971ffeec28db6312 · Lorenzo Albano / LLVM bpEVL

Sep 13, 2018

Renovate CMake files in the `llvm-(cfi-verify|exegesis|mca)` tools. · f3063baa
Richard Diamond authored Sep 13, 2018
```
llvm-svn: 342148
```
f3063baa

[InstCombine] reorder folds to reduce chance of infinite loops · 6f00fc33

Sanjay Patel authored Sep 13, 2018

I don't have a test case for this, but it's motivated by
the discussion in D51964, and I've added TODO comments for
the better fix - move simplifications into instsimplify
because that's more efficient and reduces risk of infinite
loops in instcombine caused by transforms trying to do the
opposite folds.

In this case, we know that the transform that tries to move
'not' through min/max can be fooled by the multiple uses
of a value in another min/max, so try to squash the 
foldSPFofSPF() patterns first.

llvm-svn: 342147

6f00fc33

[ARM] Allow truncs as sources in ARM CGP · aaec3c62

Sam Parker authored Sep 13, 2018

We previously only allowed truncs as sinks, but now allow them as
sources too. We do this by checking that the result type is the
narrow type that we're trying to optimise for.

Differential Revision: https://reviews.llvm.org/D51978

llvm-svn: 342141

aaec3c62

[ARM] Fix FixConst for ARMCodeGenPrepare · 96f77f14

Sam Parker authored Sep 13, 2018

Part of FixConsts wrongly assumes either a 8- or 16-bit constant
which can result in the wrong constants being generated during
promotion.

Differential Revision: https://reviews.llvm.org/D52032

llvm-svn: 342140

96f77f14

[MC/Dwarf] Unclamp DWARF linetables format on Darwin. · 64c901d2

Jonas Devlieghere authored Sep 13, 2018

In r319995, we fixed the line table format to version 2 on Darwin
because dsymutil didn't yet understand the new format which caused test
failures for the LLDB bots. This has been resolved in the meantime so
there's no reason to keep this limitation.

rdar://problem/35968332

llvm-svn: 342136

64c901d2

AMDGPU: Fix not preserving alignent in call setups · ff987ac6

Matt Arsenault authored Sep 13, 2018

If an argument was passed on the stack, this
was using the default alignment.

I'm not sure there's an observable change from this. This
was observable due to bugs in expansion of unaligned
loads and stores, but since that is fixed I don't think
this matters much.

llvm-svn: 342133

ff987ac6

DAG: Fix expansion of unaligned FP loads and stores · 842cda63

Matt Arsenault authored Sep 13, 2018

This was trying to scalarizing a scalar FP type,
resulting in an assert.

Fixes unaligned f64 stack stores for AMDGPU.

llvm-svn: 342132

842cda63

AMDGPU: Fix some outdated datalayouts in tests · 9de2fb58
Matt Arsenault authored Sep 13, 2018
```
llvm-svn: 342131
```
9de2fb58
Fix unused variable warning. NFCI. · 5b65e41a
Simon Pilgrim authored Sep 13, 2018
```
llvm-svn: 342128
```
5b65e41a

ARM: align loops to 4 bytes on Cortex-M3 and Cortex-M4. · c15d47bb

Tim Northover authored Sep 13, 2018

The Technical Reference Manuals for these two CPUs state that branching
to an unaligned 32-bit instruction incurs an extra pipeline reload
penalty. That's bad.

This also enables the optimization at -Os since it costs on average one
byte per loop in return for 1 cycle per iteration, which is pretty good
going.

llvm-svn: 342127

c15d47bb

[XRay] Bug fixes for FDR custom event and arg-logging · 90a46bde

Dean Michael Berris authored Sep 13, 2018

Summary:
This change has a number of fixes for FDR mode in compiler-rt along with
changes to the tooling handling the traces in llvm.

In the runtime, we do the following:

- Advance the "last record" pointer appropriately when writing the
  custom event data in the log.

- Add XRAY_NEVER_INSTRUMENT in the rewinding routine.

- When collecting the argument of functions appropriately marked, we
  should not attempt to rewind them (and reset the counts of functions
  that can be re-wound).

In the tooling, we do the following:

- Remove the state logic in BlockIndexer and instead rely on the
  presence/absence of records to indicate blocks.

- Move the verifier into a loop associated with each block.

Reviewers: mboerger, eizan

Subscribers: llvm-commits, hiraditya

Differential Revision: https://reviews.llvm.org/D51965

llvm-svn: 342122

90a46bde

[AMDGPU] Load divergence predicate refactoring · 4d302f69

Alexander Timofeev authored Sep 13, 2018

    Differential revision: https://reviews.llvm.org/D51931

    Reviewers: rampitec

llvm-svn: 342120

4d302f69

[mips] Enable the mnemonic spell corrector · c49da2e4

Simon Atanasyan authored Sep 13, 2018

This implements suggesting alternative mnemonics when an invalid one is
specified. For example `addru $9, $6, 17767` leads to the following
error message:

error: unknown instruction, did you mean: add, addiu, addu, maddu?

Differential revision: https://reviews.llvm.org/D40646

llvm-svn: 342119

c49da2e4

[llvm-exegesis][NFC] Remove dead parameter. · 7958735e
Clement Courbet authored Sep 13, 2018
```
llvm-svn: 342118
```
7958735e

[llvm-exegesis][NFC] Split BenchmarkRunner class · d939f6d0

Clement Courbet authored Sep 13, 2018

Summary:
The snippet-generation part goes to the SnippetGenerator class.

This will allow benchmarking arbitrary code (see PR38437).

Reviewers: gchatelet

Subscribers: mgorny, tschuett, llvm-commits

Differential Revision: https://reviews.llvm.org/D51979

llvm-svn: 342117

d939f6d0

[AMDGPU] Preliminary patch for divergence driven instruction selection.... · 2fb44808

Alexander Timofeev authored Sep 13, 2018

    [AMDGPU] Preliminary patch for divergence driven instruction selection. Load offset inlining pattern changed.

    Differential revision: https://reviews.llvm.org/D51975

    Reviewers: rampitec

llvm-svn: 342115

2fb44808

[X86] Type legalize v2i32 div/rem by scalarizing rather than promoting · f107123a

Craig Topper authored Sep 13, 2018

Summary:
Previously we type legalized v2i32 div/rem by promoting to v2i64. But we don't support div/rem of vectors so op legalization would then scalarize it using i64 scalar ops since it doesn't know about the original promotion. 64-bit scalar divides on Intel hardware are known to be slow and in 32-bit mode they require a libcall.

This patch switches type legalization to do the scalarizing itself using i32.

It looks like the division by power of 2 optimization is still kicking in and leaving the code as a vector. The division by other constant optimization doesn't kick in pre type legalization since it ignores illegal types. And previously, after type legalization we scalarized the v2i64 since we don't have v2i64 MULHS/MULHU support.

Another option might be to widen v2i32 to v4i32 so we could do division by constant optimizations, but we'd have to be careful to only do that for constant divisors or we risk scalaring to 4 scalar divides.

Reviewers: RKSimon, spatel

Reviewed By: spatel

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D51325

llvm-svn: 342114

f107123a

ARM: correct the relocation type for `bl` on WoA · aaa72c54

Saleem Abdulrasool authored Sep 13, 2018

The `IMAGE_REL_ARM_BRANCH20T` applies only to a `b.w` instruction.  A
thumb-2 `bl` should be relocated using a `IMAGE_REL_ARM_BRANCH24T`.
Correct the relocation that we emit in such a case.

Resolves PR38620!  Based on the patch by Jordan Rhee!

llvm-svn: 342109

aaa72c54

[NFC] Add Requires: asserts where needed · b2724d9a
Max Kazantsev authored Sep 13, 2018
```
llvm-svn: 342108
```
b2724d9a
[NFC] Use expensive asserts in relevant LICM tests · 0e0e19c9
Max Kazantsev authored Sep 13, 2018
```
llvm-svn: 342107
```
0e0e19c9
Remove isAsCheapAsAMove from v128.const · 65825cd7
Thomas Lively authored Sep 13, 2018
```
llvm-svn: 342106
```
65825cd7
Remove isAsCheapAsAMove from mem ops · 17ba6bec
Thomas Lively authored Sep 13, 2018
```
llvm-svn: 342105
```
17ba6bec

[WebAssembly] Add missing SIMD instruction attributes · 56b34f6c

Thomas Lively authored Sep 13, 2018

Summary:
These attributes are copied from equivalent instructions in
WebAssemblyInstrInfo.td.

Reviewers: aheejin, dschuff

Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits

Differential Revision: https://reviews.llvm.org/D51518

llvm-svn: 342104

56b34f6c

STLExtras: Add some more algorithm wrappers · 911907ca
David Blaikie authored Sep 13, 2018
```
llvm-svn: 342102
```
911907ca
DebugInfo/PDB: Remove unused member · eee709f0
David Blaikie authored Sep 13, 2018
```
llvm-svn: 342101
```
eee709f0
dwarfdump: Improve performance on large DWP files · da36f3f4
David Blaikie authored Sep 12, 2018
```
llvm-svn: 342099
```
da36f3f4
[DAGCombiner] improve formatting for select+setcc code; NFC · 8a478b79
Sanjay Patel authored Sep 12, 2018
```
llvm-svn: 342095
```
8a478b79
fix 80-column violation with clang-format · 9a454529
Adrian Prantl authored Sep 12, 2018
```
llvm-svn: 342094
```
9a454529

[PDB] Remove all clone() methods. · c43d5560

Zachary Turner authored Sep 12, 2018

These are dead code and encourage poor usage patterns, so I'm
removing them.  They weren't called anywhere anyway.

llvm-svn: 342093

c43d5560

[Hexagon] Use shuffles when lowering "gather" shufflevectors · a6d4fc0e

Krzysztof Parzyszek authored Sep 12, 2018

Shufflevector instructions in LLVM IR that extract a subset of elements
of a longer input into a shorter vector can be done using VECTOR_SHUFFLEs.
This will avoid expanding them into constly extracts and inserts.

llvm-svn: 342091

a6d4fc0e

[Hexagon] Improve the selection algorithm in scalarizeShuffle · f8537411
Krzysztof Parzyszek authored Sep 12, 2018
```
Use topological ordering for newly generated nodes.

llvm-svn: 342090
```
f8537411

[Support] sys::fs::directory_entry includes the file_type. · 3a55d1ef

Kristina Brooks authored Sep 12, 2018

This is available on most platforms (Linux/Mac/Win/BSD) with no extra syscalls.
On other platforms (e.g. Solaris) we stat() if this information is requested.

This will allow switching clang's VFS to efficiently expose (path, type) when
traversing a directory. Currently it exposes an entire Status, but does so by
calling fs::status() on all platforms.
Almost all callers only need the path, and all callers only need (path, type).

Patch by sammccall (Sam McCall)

Differential Revision: https://reviews.llvm.org/D51918

llvm-svn: 342089

3a55d1ef

Sep 12, 2018

[llvm-cov] Delete custom JSON serialization code (NFC) · 2963c490

Vedant Kumar authored Sep 12, 2018

Teach llvm-cov to use the new llvm JSON library, and remove some
redundant/brittle JSON serialization tests.

llvm-svn: 342088

2963c490

[ORC] Merge ExecutionSessionBase with ExecutionSession by moving a couple of · 8be0d2e3

Lang Hames authored Sep 12, 2018

template methods in JITDylib out-of-line.

This also splits JITDylib::define into a pair of template methods, one taking an
lvalue reference and the other an rvalue reference. This simplifies the
templates at the cost of a small amount of code duplication.

llvm-svn: 342087

8be0d2e3

[ORC] Add a special 'main' JITDylib that is created on ExecutionSession · 13014d3c

Lang Hames authored Sep 12, 2018

construction, a new convenience lookup method, and add-to layer methods.

ExecutionSession now creates a special 'main' JITDylib upon construction. All
subsequently created JITDylibs are added to the main JITDylib's search order by
default (controlled by the AddToMainDylibSearchOrder parameter to
ExecutionSession::createDylib). The main JITDylib's search order will be used in
the future to properly handle cross-JITDylib weak symbols, with the first
definition in this search order selected.

This commit also adds a new ExecutionSession::lookup convenience method that
performs a blocking lookup using the main JITDylib's search order, as this will
be a very common operation for clients.

Finally, new convenience overloads of IRLayer and ObjectLayer's add methods are
introduced that add the given program representations to the main dylib, which
is likely to be the common case.

llvm-svn: 342086

13014d3c

[WebAssembly] Make tied inline asm operands work again · 300f42fb

Heejin Ahn authored Sep 12, 2018

Summary:
rL341389 broke code with tied register operands in inline assembly. For
example, `asm("" : "=r"(var) : "0"(var));`
The code above specifies the input operand to be in the same register
with the output operand, tying the two register. This patch makes this
kind of code work again.

Reviewers: dschuff

Subscribers: sbc100, jgravelle-google, eraman, sunfish, llvm-commits

Differential Revision: https://reviews.llvm.org/D51991

llvm-svn: 342084

300f42fb

revert r341288 - [Reassociate] swap binop operands to increase factoring potential · d341988c
Sanjay Patel authored Sep 12, 2018
```
This causes or exposes indeterminism that is visible in the output of -reassociate.

llvm-svn: 342083
```
d341988c
[InstCombine] add tests for unsigned add overflow; NFC · 31017cd1
Sanjay Patel authored Sep 12, 2018
```
llvm-svn: 342082
```
31017cd1

Guard FMF context by excluding some FP operators from FPMathOperator · 22a53cbc

Michael Berg authored Sep 12, 2018

Summary:
Some FPMathOperators succeed and the retrieve FMF context when they never have it, we should omit these cases to keep from removing FMF context.

For instance when we visit some FPMathOperator mapped Instructions which never have FMF flags and a Node was associated which does have FMF flags, that Node today will have all its flags cleared via the intersect operation. With this change, we exclude associating Nodes that never have FPMathOperator status under FMF.

Reviewers: spatel, wristow, arsenm, hfinkel, aemerson

Reviewed By: spatel

Subscribers: llvm-commits, wdng

Differential Revision: https://reviews.llvm.org/D51145

llvm-svn: 342081

22a53cbc

[PDB] Emit old fpo data to the PDB file. · a1f85f8b

Zachary Turner authored Sep 12, 2018

r342003 added support for emitting FPO data from the
DEBUG_S_FRAMEDATA subsection of the .debug$S section to the PDB
file.  However, that is not the end of the story.  FPO can end
up in two different destinations in a PDB, each corresponding to
a different FPO data source.

The case handled by r342003 involves copying data from the
DEBUG_S_FRAMEDATA subsection of the .debug$S section to the
"New FPO" stream in the PDB, which is then referred to by the
DBI stream.  The case handled by this patch involves copying
records from the .debug$F section of an object file to the "FPO"
stream (or perhaps more aptly, the "Old FPO" stream) in the PDB
file, which is also referred to by the DBI stream.

The formats are largely similar, and the difference is mostly
only visible in masm generated object files, such as some of the
low-level CRT object files like memcpy.  MASM doesn't appear to
support writing the DEBUG_S_FRAMEDATA subsection, and instead
just writes these records to the .debug$F section.

Although clang-cl does not emit a .debug$F section ever, lld still
needs to support it so we have good debugging for CRT functions.

Differential Revision: https://reviews.llvm.org/D51958

llvm-svn: 342080

a1f85f8b