Commits · ed4dbe63260f5a0a3410995b85b32f0ec34b0076 · Roger Ferrer / llvm-epi

May 14, 2019

DWARF v5: emit DW_AT_addr_base if DW_AT_low_pc references .debug_addr · 2f6ef2fc

Fangrui Song authored May 14, 2019

The condition !AddrPool.empty() is tested before attachRangesOrLowHighPC(), which may add an entry to AddrPool. We emit DW_AT_low_pc (DW_FORM_addrx) but may incorrectly omit DW_AT_addr_base for LineTablesOnly. This can be easily reproduced:

clang -gdwarf-5 -gmlt -c a.cc

Fix this by moving !AddrPool.empty() below.

This was discovered while investigating an lld crash (fixed by D61889) on such object files: ld.lld --gdb-index a.o

Reviewed By: probinson

Differential Revision: https://reviews.llvm.org/D61891

llvm-svn: 360678

2f6ef2fc

[PowerPC] Custom lower known CR bit spills · 22561972

Lei Huang authored May 14, 2019

For known CRBit spills, CRSET/CRUNSET, it is more efficient to load and spill
the known value instead of extracting the bit.

eg. This sequence is currently used to spill a CRUNSET:
    crclr   4*cr5+lt
    mfocrf  r3,4
    rlwinm  r3,r3,20,0,0
    stw     r3,132(r1)

This patch custom lower it to:
    li  r3,0
    stw r3,132(r1)

Differential Revision: https://reviews.llvm.org/D61754

llvm-svn: 360677

22561972

[lit][tests]Add feature libcxx-used and use it in llvm-*-fuzzer tests · fe4f6d53

Xing Xue authored May 14, 2019

When a LLVM binary such as llvm-*-fuzzer is built with libc++, it has dependency on libc++. The path to find shared libraries specified in llvm-*-fuzzer is relative. As a result, these binaries cannot be copied to an arbitrary directory and launched from there. Changes in this patch add a LIT feature to indicate that libc++ is used to build and, based on the feature exclude test cases that test by copying llvm-*-fuzzer binaries to a directory.

Reviewers: hubert.reinterpretcast, dberris, amyk, jasonliu, EricWF

Reviewed By: hubert.reinterpretcast, amyk

Subscribers: javed.absar, jsji, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D61265

llvm-svn: 360672

fe4f6d53

TableGen: support #ifndef in addition to #ifdef. · 717b62a1
Tim Northover authored May 14, 2019
```
TableGen has a limited preprocessor, which only really supports
easier.

llvm-svn: 360670
```
717b62a1

Reinstate "FileCheck [5/12]: Introduce regular numeric variables" · 7b4ecdd3

Thomas Preud'homme authored May 14, 2019

This reinstates r360578 (git e47362c1),
reverted in r360653 (git 00439368),
with a fix for the list added in FileCheck.rst to build without error.

Copyright:
    - Linaro (changes up to diff 183612 of revision D55940)
    - GraphCore (changes in later versions of revision D55940 and
                 in new revision created off D55940)

Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar,
arichardson, rnk

Subscribers: hiraditya, llvm-commits, probinson, dblaikie, grimar,
arichardson, tra, rnk, kristina, hfinkel, rogfer01, JonChesterfield

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D60385

llvm-svn: 360665

7b4ecdd3

AArch64: support binutils-like things on arm64_32. · ff6875ac

Tim Northover authored May 14, 2019

This adds support for the arm64_32 watchOS ABI to LLVM's low level tools,
teaching them about the specific MachO choices and constants needed to
disassemble things.

llvm-svn: 360663

ff6875ac

GlobalOpt: do not promote globals used atomically to constants. · ed9117f8

Tim Northover authored May 14, 2019

Some atomic loads are implemented as cmpxchg (particularly if large or
floating), and that usually requires write access to the memory involved
or it will segfault.

We can still propagate the constant value to users we understand though.

llvm-svn: 360662

ed9117f8

[test]Make test work on Windows · ce0da8ba

James Henderson authored May 14, 2019

Previously, the test didn't work because '\' characters appeared in the
sed string, causing bogus escape characters to form in the substituted
string literal. Switching to using '%/p' causes the path to be emitted
with '/' characters instead, so that there are are no escaping issues.

Reviewed by: kzhuravl, grimar

Differential Revision: https://reviews.llvm.org/D61856

llvm-svn: 360660

ce0da8ba

[IRTranslator] Don't hardcode GEP index type · a568222d

Diana Picus authored May 14, 2019

When breaking up loads and stores of aggregates, the IRTranslator uses
LLT::scalar(64) for the index type of the G_GEP instructions that
compute the addresses. This is unnecessarily large for 32-bit targets.
Use the int ptr type provided by the DataLayout instead.

Note that we're already doing the right thing when translating
getelementptr instructions from the IR. This is just an oversight when
generating new ones while translating loads/stores.

Both x86 and AArch64 already have tests confirming that the old
behaviour is preserved for 64-bit targets.

Differential Revision: https://reviews.llvm.org/D61852

llvm-svn: 360656

a568222d

Revert "FileCheck [5/12]: Introduce regular numeric variables" · 00439368

Thomas Preud'homme authored May 14, 2019

This reverts r360578 (git e47362c1) to
solve the sphinx build failure on
http://lab.llvm.org:8011/builders/llvm-sphinx-docs buildbot.

llvm-svn: 360653

00439368

[X86] Prefer locked stack op over mfence for seq_cst 64-bit stores on 32-bit targets · 3098e44d

Philip Reames authored May 14, 2019

This is a follow on to D58632, with the same logic. Given a memory operation which needs ordering, but doesn't need to modify any particular address, prefer to use a locked stack op over an mfence.

Differential Revision: https://reviews.llvm.org/D61863

llvm-svn: 360649

3098e44d

[PowerPC][NFC] Fix typos in triples · b7b3d866
Jinsong Ji authored May 14, 2019
```
Found by bzEq (Kai Luo).

llvm-svn: 360643
```
b7b3d866

[X86] Use X86 instead of X32 as a check prefix in atomic-idempotent.ll. NFC · cc761e6f

Craig Topper authored May 14, 2019

X32 can refer to a 64-bit ABI that uses 32-bit ints, longs, and pointers.

I plan to add gnux32 command lines to this test so this prepares for that.

Also remove some check lines that have a prefix that is not in any run lines.

llvm-svn: 360642

cc761e6f

[SDAG, x86] allow targets to override test for binop opcodes · 3a13d970

Sanjay Patel authored May 14, 2019

This follows the pattern of the existing isCommutativeBinOp().

x86 shows improvements from vector narrowing for the min/max opcodes.

llvm-svn: 360639

3a13d970

[coroutines] Fix spills of static array allocas · d64455cd

Gor Nishanov authored May 13, 2019

Summary:
CoroFrame was not considering static array allocas, and was only ever reserving a single element in the coroutine frame.
This meant that stores to the non-zero'th element would corrupt later frame data.

Store static array allocas as field arrays in the coroutine frame.

Added test.

Committed by Gor Nishanov on behalf of ben-clayton
Reviewers: GorNishanov, modocache

Reviewed By: GorNishanov

Subscribers: Orlando, capn, EricWF, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D61372

llvm-svn: 360636

d64455cd

May 13, 2019

[Pass Pipeline][NFC] Add a test prior to committing D61726 · 1d662316

Nemanja Ivanovic authored May 13, 2019

This patch just adds a test case to show the differences in code emitted
by opt before and after https://reviews.llvm.org/D61726.

Previous attempt to commit this did not include the registered target
requirement so it caused buildbot breaks.

llvm-svn: 360620

1d662316

[JITLink][MachO] Honor the no-dead-strip flag on nlist entries. · 56baade1
Lang Hames authored May 13, 2019
```
llvm-svn: 360618
```
56baade1

[WebAssembly] Don't assume that zext/sext result is i32/i64 in fast isel (PR41841) · 323dc634

Nikita Popov authored May 13, 2019

Usually this will abort fast-isel at the instruction using the
non-legal result, but if the only use is in a different basic block,
we'll incorrectly assume that the zext/sext is to i32 (rather than
i128 in this case).

Differential Revision: https://reviews.llvm.org/D61823

llvm-svn: 360616

323dc634

[AMDGPU] gfx1010 tests. NFC. · d9930d49
Stanislav Mekhanoshin authored May 13, 2019
```
llvm-svn: 360615
```
d9930d49
Revert [X86] Avoid SFB - Fix inconsistent codegen with/without debug info · 91a9d4ef
Robert Lougher authored May 13, 2019
```
Revert r360436 as it is causing clang-x64-windows-msvc buildbot to fail.

llvm-svn: 360606
```
91a9d4ef

[InstCombine] try harder to form rotate (funnel shift) (PR20750) · 760f61ab

Sanjay Patel authored May 13, 2019

We have a similar match for patterns ending in a truncate. This
should be ok for all targets because the default expansion would
still likely be better from replacing 2 'and' ops with 1.

Attempt to show the logic equivalence in Alive (which doesn't
currently have funnel-shift in its vocabulary AFAICT):

  %shamt = zext i8 %i to i32
  %m = and i32 %shamt, 31
  %neg = sub i32 0, %shamt
  %and4 = and i32 %neg, 31
  %shl = shl i32 %v, %m
  %shr = lshr i32 %v, %and4
  %or = or i32 %shr, %shl
  =>
  %a = and i8 %i, 31
  %shamt2 = zext i8 %a to i32
  %neg2 = sub i32 0, %shamt2
  %and4 = and i32 %neg2, 31
  %shl = shl i32 %v, %shamt2
  %shr = lshr i32 %v, %and4
  %or = or i32 %shr, %shl

https://rise4fun.com/Alive/V9r

llvm-svn: 360605

760f61ab

[TargetLowering] Handle multi depth GEPs w/ inline asm constraints · c33f754e

Nick Desaulniers authored May 13, 2019

Summary:
X86TargetLowering::LowerAsmOperandForConstraint had better support than
TargetLowering::LowerAsmOperandForConstraint for arbitrary depth
getelementpointers for "i", "n", and "s" extended inline assembly
constraints. Hoist its support from the derived class into the base
class.

Link: https://github.com/ClangBuiltLinux/linux/issues/469

Reviewers: echristo, t.p.northover

Reviewed By: t.p.northover

Subscribers: t.p.northover, E5ten, kees, jyknight, nemanjai, javed.absar, eraman, hiraditya, jsji, llvm-commits, void, craig.topper, nathanchance, srhines

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D61560

llvm-svn: 360604

c33f754e

[InstCombine] add tests for rotates with narrow shift amount (PR20750); NFC · cb8957f7
Sanjay Patel authored May 13, 2019
```
llvm-svn: 360601
```
cb8957f7

[X86][SSE] LowerBuildVectorv4x32 - don't insert MOVQ for undef elts · 73aee290

Simon Pilgrim authored May 13, 2019

Fixes the regression noted in D61782 where a VZEXT_MOVL was being inserted because we weren't discriminating between 'zeroable' and 'all undef' for the upper elts.

Differential Revision: https://reviews.llvm.org/D61782

llvm-svn: 360596

73aee290

[X86][SSE] Relax use limits for lowerAddSubToHorizontalOp (PR32433) · cf5a8eb7

Simon Pilgrim authored May 13, 2019

Now that we can use HADD/SUB for scalar additions from any pair of extracted elements (D61263), we can relax the one use limit as we will be able to merge multiple uses into using the same HADD/SUB op.

This exposes a couple of missed opportunities in LowerBuildVectorv4x32 which will be committed separately.

Differential Revision: https://reviews.llvm.org/D61782

llvm-svn: 360594

cf5a8eb7

[TargetLowering] Add SimplifyDemandedBits support for ZERO_EXTEND_VECTOR_INREG · d3cedee3
Simon Pilgrim authored May 13, 2019
```
More work for PR39709.

llvm-svn: 360592
```
d3cedee3
[X86] Add test case for mask register variant of PR41619 which should be fixed after r360552 · c6a6c107
Craig Topper authored May 13, 2019
```
llvm-svn: 360591
```
c6a6c107

[DAGCombiner] narrow vector binop with inserts/extract · 05dafb1c

Sanjay Patel authored May 13, 2019

We catch most of these patterns (on x86 at least) by matching
a concat vectors opcode early in combining, but the pattern may
emerge later using insert subvector instead.

The AVX1 diffs for add/sub overflow show another missed narrowing
pattern. That one may be falling though the cracks because of
combine ordering and multiple uses.

llvm-svn: 360585

05dafb1c

[x86] add test for insert/extract binop; NFC · 83e61bc5
Sanjay Patel authored May 13, 2019
```
This pattern is visible in the c-ray benchmark with an AVX target.

llvm-svn: 360582
```
83e61bc5

Add constrained fptrunc and fpext intrinsics. · 5987749e

Kevin P. Neal authored May 13, 2019

The new fptrunc and fpext intrinsics are constrained versions of the
regular fptrunc and fpext instructions.

Reviewed by:	Andrew Kaylor, Craig Topper, Cameron McInally, Conner Abbot
Approved by:	Craig Topper
Differential Revision: https://reviews.llvm.org/D55897

llvm-svn: 360581

5987749e

FileCheck [5/12]: Introduce regular numeric variables · e47362c1

Thomas Preud'homme authored May 13, 2019

Summary:
This patch is part of a patch series to add support for FileCheck
numeric expressions. This specific patch introduces regular numeric
variables which can be set on the command-line.

This commit introduces regular numeric variable that can be set on the
command-line with the -D option to a numeric value. They can then be
used in CHECK patterns in numeric expression with the same shape as
@LINE numeric expression, ie. VAR, VAR+offset or VAR-offset where offset
is an integer literal.

The commit also enable strict whitespace in the verbose.txt testcase to
check that the position or the location diagnostics. It fixes one of the
existing CHECK in the process which was not accurately testing a
location diagnostic (ie. the diagnostic was correct, not the CHECK).

Copyright:
    - Linaro (changes up to diff 183612 of revision D55940)
    - GraphCore (changes in later versions of revision D55940 and
                 in new revision created off D55940)

Reviewers: jhenderson, chandlerc, jdenny, probinson, grimar, arichardson, rnk

Subscribers: hiraditya, llvm-commits, probinson, dblaikie, grimar, arichardson, tra, rnk, kristina, hfinkel, rogfer01, JonChesterfield

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D60385

llvm-svn: 360578

e47362c1

[ThinLTO] Don't internalize weak writeable variables · 053c6fc2

Eugene Leviant authored May 13, 2019

Variables with linkonce_odr and weak_odr linkage shouldn't be internalized
if they're not readonly. Otherwise we may end up with multiple copies of
such variable, so reads and writes will become inconsistent

Differential revision: https://reviews.llvm.org/D61255

llvm-svn: 360577

053c6fc2

[SystemZ] Model floating-point control register · 8e42f6dd

Ulrich Weigand authored May 13, 2019

This adds the FPC (floating-point control register) as a reserved
physical register and models its use by SystemZ instructions.

Note that only the current rounding modes and the IEEE exception
masks are modeled.  *Changes* of the FPC due to exceptions (in
particular the IEEE exception flags and the DXC) are not modeled.

At this point, this patch is mostly NFC, but it will prevent
scheduling of floating-point instructions across SPFC/LFPC etc.

llvm-svn: 360570

8e42f6dd

[ARM][ParallelDSP] Relax alias checks · a33e311a

Sam Parker authored May 13, 2019

When deciding the safety of generating smlad, we checked for any
writes within the block that may alias with any of the loads that
need to be widened. This is overly conservative because it only
matters when there's a potential aliasing write to a location
accessed by a pair of loads.

Now we check for aliasing writes only once, during setup. If two
loads are found to have an aliasing write between them, we don't add
these loads to LoadPairs. This means that later during the transform,
we can safely widened a pair without worrying about aliasing.

However, to maintain correctness, we also need to change the way that
wide loads are inserted because the order is now important.

The MatchSMLAD method has also been changed, absorbing
MatchReductions and AddMACCandidate to hopefully improve readability.

Differential Revision: https://reviews.llvm.org/D6102

llvm-svn: 360567

a33e311a

[DAGCombiner] Fix invalid alias analysis. · 9afc4764

Clement Courbet authored May 13, 2019

Summary:
When we know for sure whether two addresses do or do not alias, we
should immediately return from DAGCombiner::isAlias().

I think this comes from a bad copy/paste, Sorry for not catching that during the
code review.

Fixes PR41855.

Reviewers: niravd, gchatelet, EricWF

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D61846

llvm-svn: 360566

9afc4764

[DAGCombiner][NFC] Commit test to show fix in D61846. · c4e37fd9
Clement Courbet authored May 13, 2019
```
llvm-svn: 360561
```
c4e37fd9

[BPF] emit BTF sections only if debuginfo available · 98fe9c98

Yonghong Song authored May 13, 2019



Currently, without -g, BTF sections may still be emitted with
data sections, e.g., for linux kernel bpf selftest
test_tcp_check_syncookie_kern.c issue discovered by Martin
as shown below.

-bash-4.4$ bpftool btf dump file test_tcp_check_syncookie_kern.o
[1] VAR 'results' type_id=0, linkage=global-alloc
[2] VAR '_license' type_id=0, linkage=global-alloc
[3] DATASEC 'license' size=0 vlen=1
        type_id=2 offset=0 size=4
[4] DATASEC 'maps' size=0 vlen=1
        type_id=1 offset=0 size=28

Let disable BTF generation if no debuginfo, which is
the original design.

Signed-off-by: Yonghong Song <yhs@fb.com>

Differential Revision: https://reviews.llvm.org/D61826

llvm-svn: 360556

98fe9c98

[JITLink] Track section alignment and make sure it is respected during layout. · 45139290

Lang Hames authored May 13, 2019

Previously we had only honored alignments on individual atoms, but
tools/runtimes may assume that the section alignment is respected too.

llvm-svn: 360555

45139290

Recommit r358887 "[TargetLowering][AMDGPU][X86] Improve SimplifyDemandedBits bitcast handling" · 61e556d2

Craig Topper authored May 13, 2019

I've included a new fix in X86RegisterInfo to prevent PR41619 without
reintroducing r359392. We might be able to improve that in the base class
implementation of shouldRewriteCopySrc somehow. But this hopefully enables
forward progress on SimplifyDemandedBits improvements for now.

Original commit message:

This patch adds support for BigBitWidth -> SmallBitWidth bitcasts, splitting the DemandedBits/Elts accordingly.

The AMDGPU backend needed an extra (srl (and x, c1 << c2), c2) -> (and (srl(x, c2), c1) combine to encourage BFE creation, I investigated putting this in DAGComb
but it caused a lot of noise on other targets - some improvements, some regressions.

The X86 changes are all definite wins.

llvm-svn: 360552

61e556d2

[JITLink] Add a test for zero-filled content. · 23085ec3

Lang Hames authored May 12, 2019

Also updates RuntimeDyldChecker and llvm-rtdyld to support zero-fill tests by
returning a content address of zero (but no error) for zero-fill atoms, and
treating loads from zero as returning zero.

llvm-svn: 360547

23085ec3