Commits · 6a94134b1167d1245058ce65b3c8c33956d37ec3 · Lorenzo Albano / LLVM bpEVL

Jun 26, 2018

[ORC] Add LLJIT and LLLazyJIT, and replace OrcLazyJIT in LLI with LLLazyJIT. · 6a94134b

Lang Hames authored Jun 26, 2018

LLJIT is a prefabricated ORC based JIT class that is meant to be the go-to
replacement for MCJIT. Unlike OrcMCJITReplacement (which will continue to be
supported) it is not API or bug-for-bug compatible, but targets the same
use cases: Simple, non-lazy compilation and execution of LLVM IR.

LLLazyJIT extends LLJIT with support for function-at-a-time lazy compilation,
similar to what was provided by LLVM's original (now long deprecated) JIT APIs.

This commit also contains some simple utility classes (CtorDtorRunner2,
LocalCXXRuntimeOverrides2, JITTargetMachineBuilder) to support LLJIT and
LLLazyJIT.

Both of these classes are works in progress. Feedback from JIT clients is very
welcome!

llvm-svn: 335670

6a94134b

AMDGPU: Silence unused warnings in waitcnt insertion pass in release build · 77747770
Konstantin Zhuravlyov authored Jun 26, 2018
```
Differential Revision: https://reviews.llvm.org/D48607

llvm-svn: 335669
```
77747770

[X86][AsmParser] Recommit r335658 · 67599c2e

Jessica Paquette authored Jun 26, 2018

Recommit of r335658 so that it does not change the behaviour of any
existing error output.

llvm-svn: 335668

67599c2e

Rename skipDebugInfo -> skipDebugIntrinsics, NFC · 1cb63dc2

Vedant Kumar authored Jun 26, 2018

This addresses post-commit feedback about the name 'skipDebugInfo' being
misleading. This name could be interpreted as meaning 'a function that
skips instructions with debug locations'.

The new name, 'skipDebugIntrinsics', makes it clear that this function
only skips debug info intrinsics.

Thanks to Adrian Prantl for pointing this out!

llvm-svn: 335667

1cb63dc2

[ORC] Allow IRTransformLayer2's transform to be modified after initialization. · afc2758f
Lang Hames authored Jun 26, 2018
```
Also give the constructor's transform parameter a default no-op transform value.

llvm-svn: 335665
```
afc2758f

[ORC] Reset AsynchronousSymbolQuery's NotifySymbolsResolved callback on error. · 2795a0a0

Lang Hames authored Jun 26, 2018

AsynchronousSymbolQuery::canStillFail checks the value of the callback to
prevent sending it redundant error notifications, so we need to reset it after
running it.

llvm-svn: 335664

2795a0a0

[ORC] Move the VSOList typedef out of VSO. · 831c5758
Lang Hames authored Jun 26, 2018
```
llvm-svn: 335663
```
831c5758
[ORC] Add a FIXME. · 9725cf85
Lang Hames authored Jun 26, 2018
```
llvm-svn: 335662
```
9725cf85
[ORC] Fix a FIXME by moving MangleAndInterner to Core.h. · ec8f5c8e
Lang Hames authored Jun 26, 2018
```
llvm-svn: 335661
```
ec8f5c8e

Revert "[X86][AsmParser] Emit an error when RIP-relative instructions are used in 32-bit mode" · 0a80af07

Jessica Paquette authored Jun 26, 2018

This reverts commit 4850a9aae8b38c7deadc103d634ec7397e6c323b.

It caused MC/X86/x86_errors.s to fail. Will fix and recommit shortly.

llvm-svn: 335660

0a80af07

[X86][AsmParser] Emit an error when RIP-relative instructions are used in 32-bit mode · 0e40d4bf

Jessica Paquette authored Jun 26, 2018

Right now, when we use RIP-relative instructions in 32-bit mode, we'll just
assert and crash.

This adds an error message which tells the user that they can't do that in
32-bit mode, so that we don't crash (and also can see the issue outside of
assert builds).

llvm-svn: 335658

0e40d4bf

[AMDGPU] Add llvm.amdgcn.fmad.ftz intrinsic · dacda79e

Stanislav Mekhanoshin authored Jun 26, 2018

This intrinsic selects v_mad_f32 regardless of fp32 denorm support.

Differential Revision: https://reviews.llvm.org/D48573

llvm-svn: 335654

dacda79e

[DAGCombiner] use isBitwiseNot to simplify code; NFC · fb9c440b
Sanjay Patel authored Jun 26, 2018
```
llvm-svn: 335652
```
fb9c440b

AMDGPU: Add pass to lower kernel arguments to loads · 8c4a3523

Matt Arsenault authored Jun 26, 2018

This replaces most argument uses with loads, but for
now not all.

The code in SelectionDAG for calling convention lowering
is actively harmful for amdgpu_kernel. It attempts to
split the argument types into register legal types, which
results in low quality code for arbitary types. Since
all kernel arguments are passed in memory, we just want the
raw types.

I've tried a couple of methods of mitigating this in SelectionDAG,
but it's easier to just bypass this problem alltogether. It's
possible to hack around the problem in the initial lowering,
but the real problem is the DAG then expects to be able to use
CopyToReg/CopyFromReg for uses of the arguments outside the block.

Exposing the argument loads in the IR also has the advantage
that the LoadStoreVectorizer can merge them.

I'm not sure the best approach to dealing with the IR
argument list is. The patch as-is just leaves the IR arguments
in place, so all the existing code will still compute the same
kernarg size and pointlessly lowers the arguments.

Arguably the frontend should emit kernels with an empty argument
list in the first place. Alternatively a dummy array could be
inserted as a single argument just to reserve space.

This does have some disadvantages. Local pointer kernel arguments can
no longer have AssertZext placed  on them as the equivalent !range
metadata is not valid on pointer  typed loads. This is mostly bad
for SI which needs to know about the known bits in order to use the
DS instruction offset, so in this case this is not done.

More importantly, this skips noalias arguments since this pass
does not yet convert this to the equivalent !alias.scope and !noalias
metadata. Producing this metadata correctly seems to be tricky,
although this logically is the same as inlining into a function which
doesn't exist. Additionally, exposing these loads to the vectorizer
may result in degraded aliasing information if a pointer load is
merged with another argument load.

I'm also not entirely sure this is preserving the current clover
ABI, although I would greatly prefer if it would stop widening
arguments and match the HSA ABI. As-is I think it is extending
< 4-byte arguments to 4-bytes but doesn't align them to 4-bytes.

llvm-svn: 335650

8c4a3523

ConstantFold: Don't fold global address vs. null for addrspace != 0 · 7e991d30

Matt Arsenault authored Jun 26, 2018

Not sure why this logic seems to be repeated in 2 different places,
one called by the other.

On AMDGPU addrspace(3) globals start allocating at 0, so these
checks will be incorrect (not that real code actually tries
to compare these addresses)

llvm-svn: 335649

7e991d30

Use a variable to appease a no-asserts bot, NFC · 78ff0f1b

Vedant Kumar authored Jun 26, 2018

Failure URL:
http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/22836

llvm-svn: 335648

78ff0f1b

[Debugify] Don't treat missing dbg.values as an error (PR37942) · 2e6c5f96

Vedant Kumar authored Jun 26, 2018

When checking the debug info in a module, don't treat a missing
dbg.value as an error. The dbg.value may simply have been DCE'd, in
which case the debugger has enough information to display the variable
as <optimized out>.

llvm-svn: 335647

2e6c5f96

[ConstantRange] Add support of mul in makeGuaranteedNoWrapRegion. · b32823cb

Tim Shen authored Jun 26, 2018

Summary: This is trying to add support for r334428.

Reviewers: sanjoy

Subscribers: jlebar, hiraditya, bixia, llvm-commits

Differential Revision: https://reviews.llvm.org/D48399

llvm-svn: 335646

b32823cb

LoopUnroll: Allow analyzing intrinsic call costs · 2c1a570a

Matt Arsenault authored Jun 26, 2018

I'm not sure why the code here is skipping calls since
TTI does try to do something for general calls, but it
at least should allow intrinsics.

Skip intrinsics that should not be omitted as calls, which
is by far the most common case on AMDGPU.

llvm-svn: 335645

2c1a570a

[Local] Add a convenient insertReplacementDbgValues overload, NFC · c85ca4cd

Vedant Kumar authored Jun 26, 2018

Add an overload for the common case where the replacement dbg.values
have the same DIExpressions as the originals.

llvm-svn: 335643

c85ca4cd

[Local] Sink salvageDI's early exit into helper functions, NFC · de46f65b

Vedant Kumar authored Jun 26, 2018

salvageDebugInfo() performs a check that allows it to exit early without
doing a DenseMap lookup. It's a bit neater and marginally more useful to
sink this early exit into the findDbg{Addr,Users,Values} helpers.

llvm-svn: 335642

de46f65b

[Hexagon] Add a "generic" cpu · b7169c43

Brendon Cahoon authored Jun 26, 2018

Add the generic processor for Hexagon so that it can be used
with 3rd party programs that create a back-end with the
"generic" CPU. This patch also enables the JIT for Hexagon.

Differential Revision: https://reviews.llvm.org/D48571

llvm-svn: 335641

b7169c43

[DAGCombiner] Don't accept -1 sdiv divisors in sdiv-by-pow2 vector expansion (PR37119) · 7f55af37
Simon Pilgrim authored Jun 26, 2018
```
Temporary fix until I've managed to get D45806 updated - both +1 and -1 special cases need to be properly supported.

llvm-svn: 335637
```
7f55af37
Move `REQUIRES:` line to the top · ee15d3dc
Fangrui Song authored Jun 26, 2018
```
llvm-svn: 335635
```
ee15d3dc
[InstSimplify] fold shifts by sext bool · ad0bfb84
Sanjay Patel authored Jun 26, 2018
```
https://rise4fun.com/Alive/c3Y

llvm-svn: 335633
```
ad0bfb84
[InstSimplify] add tests for shifts by sext bool; NFC · 3d1e4d6f
Sanjay Patel authored Jun 26, 2018
```
llvm-svn: 335631
```
3d1e4d6f
[X86][SSE] Add another sdiv by (nonuniform) minus one test (PR37119) · 1576df53
Simon Pilgrim authored Jun 26, 2018
```
Include a test that divides by -1 but not by 1 (another special case)

llvm-svn: 335629
```
1576df53
[InstCombine] simplify code for urem fold; NFCI · 9adea01c
Sanjay Patel authored Jun 26, 2018
```
llvm-svn: 335623
```
9adea01c

[InstCombine] fold urem with sext bool divisor · 3575f0c0

Sanjay Patel authored Jun 26, 2018

Similar to other patches in this series:
https://reviews.llvm.org/rL335512
https://reviews.llvm.org/rL335527
https://reviews.llvm.org/rL335597
https://reviews.llvm.org/rL335616

...this is filling a gap in analysis that is exposed by an unrelated select-of-constants transform.
I didn't see a way to unify the sext cases because each div/rem opcode results in a different fold.

Note that in this case, the backend might want to convert the select into math:
Name: sext urem
%e = sext i1 %x to i32
%r = urem i32 %y, %e
=>
%c = icmp eq i32 %y, -1
%z = zext i1 %c to i32
%r = add i32 %z, %y

llvm-svn: 335622

3575f0c0

[SLPVectorizer] Recognise non uniform power of 2 constants · bbfc18b5

Simon Pilgrim authored Jun 26, 2018

Since D46637 we are better at handling uniform/non-uniform constant Pow2 detection; this patch tweaks the SLP argument handling to support them.

As SLP works with arrays of values I don't think we can easily use the pattern match helpers here.

Differential Revision: https://reviews.llvm.org/D48214

llvm-svn: 335621

bbfc18b5

[InstCombine] add tests for urem with sext bool divisor; NFC · 0f44759b
Sanjay Patel authored Jun 26, 2018
```
llvm-svn: 335619
```
0f44759b
[DAGCombiner] Pull out VT bitwidth in visitSDIV. NFCI. · 133b1cdf
Simon Pilgrim authored Jun 26, 2018
```
llvm-svn: 335617
```
133b1cdf
[InstSimplify] fold srem with sext bool divisor · 2b7e3109
Sanjay Patel authored Jun 26, 2018
```
llvm-svn: 335616
```
2b7e3109
Fix doc title underlining. · c307b003
James Henderson authored Jun 26, 2018
```
llvm-svn: 335615
```
c307b003

[FileCheck] Add CHECK-EMPTY directive for checking for blank lines · 5507f668

James Henderson authored Jun 26, 2018

Prior to this change, there was no clean way of getting FileCheck to
check that a line is completely empty. The expected way of using
"CHECK: {{^$}}" does not work because the '^' matches the end of the
previous match (this behaviour may be desirable in certain instances).
For the same reason, "CHECK-NEXT: {{^$}}" will fail when the previous
match was at the end of the line, as the pattern will match there.
Using the recommended [[:space:]] to match an explicit new line could
also match a space, and thus is not always desired. Literal '\n'
matches also do not work. A workaround was suggested in the review, but
it is a little clunky.

This change adds a new directive that behaves the same as CHECK-NEXT,
except that it only matches against empty lines (nothing, not even
whitespace, is allowed). As with CHECK-NEXT, it will fail if more than
one newline occurs before the next blank line. Example usage:
; test.txt
foo

bar
; CHECK: foo
; CHECK-EMPTY:
; CHECK-NEXT: bar

Differential Revision: https://reviews.llvm.org/D28896

Reviewed by: probinson

llvm-svn: 335613

5507f668

Silence "unused variable" warning in LiveIntervals.cpp after r335607 · 9f199ebe
Krzysztof Parzyszek authored Jun 26, 2018
```
llvm-svn: 335610
```
9f199ebe
[InstSimplify] add tests for srem with sext bool divisor; NFC · 0e0dbebe
Sanjay Patel authored Jun 26, 2018
```
llvm-svn: 335609
```
0e0dbebe
Fix LLVM_ENABLE_THREADS=0 builds after r335440. · a7f9e66c
Nico Weber authored Jun 26, 2018
```
llvm-svn: 335608
```
a7f9e66c

Account for undef values from predecessors in extendSegmentsToUses · 70f02702

Krzysztof Parzyszek authored Jun 26, 2018

It is legal for a PHI node not to have a live value in a predecessor
as long as the end of the predecessor is jointly dominated by an undef
value.

llvm-svn: 335607

70f02702

[TargetLowering] isVectorClearMaskLegal - use ArrayRef<int> instead of const SmallVectorImpl<int>& · aa2bf2be
Simon Pilgrim authored Jun 26, 2018
```
This is more generic and matches isShuffleMaskLegal.

Differential Revision: https://reviews.llvm.org/D48591

llvm-svn: 335605
```
aa2bf2be