Commits · ad60ff70eb56a7d198e613152f9974d5d4baabd4 · Lorenzo Albano / LLVM bpEVL

May 13, 2020

[NFC] Code cleanup in TargetInfo.cpp · ad60ff70
Shengchen Kan authored May 13, 2020
```
Fix the signed/unsigned mismatch issue
```
ad60ff70

[PowerPC] Exploit VSX neg, abs and nabs for f32 · 8ffe8891

Qiu Chaofan authored May 12, 2020

xsnegdp, xsabsdp and xsnabsdp can be used to operate on f32 operand.

This patch adds the missing patterns since we prefer VSX instructions
when available.

Reviewed By: steven.zhang

Differential Revision: https://reviews.llvm.org/D75344

8ffe8891

[CostModel] Modify BasicTTI getCastInstrCost · 6bbad728

Sam Parker authored May 13, 2020

Fix the assumption that all bitcasts of the same type sizes are free.
We now only assume that bitcasts between ints and ptrs of the same
size are free. This allows TTImpl to just call the concrete
implementation of getCastInstrCost.

Differential Revision: https://reviews.llvm.org/D78918

6bbad728

[mlir][StandardToLLVM] Add SinOp to LLVM dialect and lowering of std.sin to this op. · 49e6c191
MaheshRavishankar authored May 12, 2020
```
Differential Revision: https://reviews.llvm.org/D79505
```
49e6c191

[PowerPC] Respect SDNodeFlags in lowering SELECT_CC · e9753822

Qiu Chaofan authored May 13, 2020

Legalizer should respect both command-line options or SDNode-level
fast-math flags.

Also, this patch propagates other flags during custom simplifying.

Reviewed By: steven.zhang

Differential Revision: https://reviews.llvm.org/D79074

e9753822

[mlir][Linalg] Add folders and canonicalizers for · 5440d0a1

MaheshRavishankar authored May 12, 2020

linalg.reshape/linalg.tensor_reshape operations.

Differential Revision: https://reviews.llvm.org/D79765

5440d0a1

[mlir][Linalg] Allow reshapes to collapse to a zero-rank tensor. · d2a95698

MaheshRavishankar authored May 12, 2020

This is only valid if the source tensors (result tensor) is static
shaped with all unit-extents when the reshape is collapsing
(expanding) dimensions.

Differential Revision: https://reviews.llvm.org/D79764

d2a95698

[PowerPC] Use add instead of addReg in ppc-early-ret pass · 782a4dd1

Kang Zhang authored May 13, 2020

Summary:
The ppc-early-ret pass use the addReg() to add operand to the new
instruction, it can't reserve the flag of old operand. This has caused
machine verfications failed.
This patch use add() to instead of addReg().

Reviewed By: steven.zhang

Differential Revision: https://reviews.llvm.org/D77997

782a4dd1

[cmake] Update creation of object library dependencies for LINK_LIBS PUBLIC · 085234be

Stephen Neuendorffer authored May 12, 2020

We need to avoid declaring dependencies on strings which are valid
LINK_LIBS and not valid targets.  Previously, we used if(TARGET) to
check this condition.  However, if(TARGET) checks whether a target has
been created (in the cmake subdirectory traversal order) and not
whether it *will* be created.  This results in annoying directory
ordering problems.

This patch changes the check to more explicitly eliminate problematic
libraries (namely -lpthread) using a REGEX.

Differential Revision: https://reviews.llvm.org/D79837

085234be

[gcov] Fix simultaneous .gcda creation/lock · 7d416743

KAWASHIMA Takahiro authored May 07, 2020

Fixes PR45673

The commit 9180c14f (D76206) resolved only a part of the problem
of concurrent .gcda file creation. It ensured that only one process
creates the file but did not ensure that the process locks the
file first. If not, the process which created the file may clobber
the contents written by a process which locked the file first.
This is the cause of PR45673.

This commit prevents the clobbering by revising the assumption
that a process which creates the file locks the file first.
Regardless of file creation, a process which locked the file first
uses fwrite (new_file==1) and other processes use mmap (new_file==0).

I also tried to keep the creation/first-lock process same by using
mkstemp/link/unlink but the code gets long. This commit is more
simple.

Note: You may be confused with other changes which try to resolve
concurrent file access. My understanding is (may not be correct):

D76206:   Resolve race of .gcda file creation (but not lock)
This one: Resolve race of .gcda file creation and lock
D54599:   Same as D76206 but abandoned?
D70910:   Resolve race of multi-threaded counter flushing
D74953:   Resolve counter sharing between parent/children processes
D78477:   Revision of D74953

Differential Revision: https://reviews.llvm.org/D79556

7d416743

[LoopReroll] Fix rerolling loop with use outside the loop · 272bc25b

KAWASHIMA Takahiro authored May 07, 2020

Fixes PR41696

The loop-reroll pass generates an invalid IR (or its assertion
fails in debug build) if values of the base instruction and
other root instructions (terms used in the loop-reroll pass)
are used outside the loop block. See IRs written in PR41696
as examples.

The current implementation of the loop-reroll pass can reroll
only loops that don't have values that are used outside the
loop, except reduced values (the last values of reduction chains).
This is described in the comment of the `LoopReroll::reroll`
function.
https://github.com/llvm/llvm-project/blob/llvmorg-10.0.0/llvm/lib/Transforms/Scalar/LoopRerollPass.cpp#L1600

This is checked in the `LoopReroll::DAGRootTracker::validate`
function.
https://github.com/llvm/llvm-project/blob/llvmorg-10.0.0/llvm/lib/Transforms/Scalar/LoopRerollPass.cpp#L1393

However, the base instruction and other root instructions skip
this check in the validation loop.
https://github.com/llvm/llvm-project/blob/llvmorg-10.0.0/llvm/lib/Transforms/Scalar/LoopRerollPass.cpp#L1229

Moving the check in front of the skip is the logically simplest
fix. However, inserting the check in an earlier stage is better
in terms of compilation time of unrerollable loops. This fix
inserts the check for the base instruction into the function
to validate possible base/root instructions. Check for other
root instructions is unnecessary because they don't match any
base instructions if they have uses outside the loop.

Differential Revision: https://reviews.llvm.org/D79549

272bc25b

[LLDB] Fix typo in xfail decorator assert.test · 67087a7b
Muhammad Omair Javaid authored May 13, 2020
```
Fix a typo in earlier xfailed assert.test replace // with #.
```
67087a7b

[LLDB] Mark some xfails for arm-linux · 6805a77e

Muhammad Omair Javaid authored May 13, 2020

This patch marks following tests as xfail for arm-linux target.

lldb/test/API/functionalities/load_using_paths/TestLoadUsingPaths.py
lldb/test/API/python_api/thread/TestThreadAPI.py
lldb/test/Shell/Recognizer/assert.test

Bugs have been filed for all of them for the corresponding failing
component.

6805a77e

[mlir] [VectorOps] Implement vector.constant_mask lowering to LLVM IR · fb2c4d50

aartbik authored May 12, 2020

Summary:
Makes this operation runnable on CPU by generating MLIR instructions
that are eventually folded into an LLVM IR constant for the mask.

Reviewers: nicolasvasilache, ftynse, reidtatge, bkramer, andydavis1

Reviewed By: nicolasvasilache, ftynse, andydavis1

Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D79815

fb2c4d50

[LLDB] Fix minidebuginfo-set-and-hit-breakpoint.test for arm 32-bit · 302c492c

Muhammad Omair Javaid authored May 13, 2020

This patch fixes minidebuginfo-set-and-hit-breakpoint.test for arm-linux
targets. 32-bit elf executables use .rel.dyn and 64-bit uses .rela.dyn for
relocation entries for dynamic symbols.

302c492c

[Attributor][FIX] Stabilize the state of AAReturnedValues each update · af48351c

Johannes Doerfert authored May 12, 2020

For AAReturnedValues we treated new and existing information differently
in the updateImpl. Only the latter was properly analyzed and
categorized. The former was thought to be analyzed in the subsequent
update. Since the Attributor does not support "self-updates" we need to
make sure the state is "stable" after each updateImpl invocation. That
is, if the surrounding information does not change, the state is valid.
Now we make sure all return values have been handled and properly
categorized each iteration. We might not update again if we have not
requested a non-fix attribute so we cannot "wait" for the next update to
analyze a new return value.

Bug reported by @sdmitriev.

af48351c

[libcxx] Constrain function assignment operator (2574). · 8aa2266f
zoecarver authored May 12, 2020
```
This patch fixes LWG issue 2574.

Differential Review: https://reviews.llvm.org/D62928
```
8aa2266f
test commit · 96282b1a
Zequan Wu authored May 12, 2020

96282b1a

[ValueTracking] Fix crash in isGuaranteedNotToBeUndefOrPoison when V is in an unreachable block · d3eb51f0

Juneyoung Lee authored May 13, 2020

Summary:
This fixes PR45885 by fixing isGuaranteedNotToBeUndefOrPoison so it does not look into dominating
branch conditions of V when V is an instruction in an unreachable block.

Reviewers: spatel, nikic, lebedev.ri

Reviewed By: nikic

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D79790

d3eb51f0

Fix error in TestNumThreads.py when frame.GetFunctionName returns none · 0796b170

Muhammad Omair Javaid authored May 13, 2020

Summary:
This patch fixes an error happening in TestNumThreads.py when it encounters frame.GetFunctionName none for address only locations in stripped libc.

This error was showing up on arm-linux docker container running lldb buildbot.

Reviewers: labath

Reviewed By: labath

Subscribers: kristof.beyls, lldb-commits

Differential Revision: https://reviews.llvm.org/D79777

0796b170

[mlir] Revisit std.subview handling of static information. · 63c0e72b

Nicolas Vasilache authored May 12, 2020

The main objective of this revision is to change the way static information is represented, propagated and canonicalized in the SubViewOp.

In the current implementation the issue is that canonicalization may strictly lose information because static offsets are combined in irrecoverable ways into the result type, in order to fit the strided memref representation.

The core semantics of the op do not change but the parser and printer do: the op always requires `rank` offsets, sizes and strides. These quantities can now be either SSA values or static integer attributes.

The result type is automatically deduced from the static information and more powerful canonicalizations (as powerful as the representation with sentinel `?` values allows). Previously static information was inferred on a best-effort basis from looking at the source and destination type.

Relevant tests are rewritten to use the idiomatic `offset: x, strides : [...]`-form. Bugs are corrected along the way that were not trivially visible in flattened strided memref form.

Lowering to LLVM is updated, simplified and now supports all cases.
A mixed static-dynamic mode test that wouldn't previously lower is added.

It is an open question, and a longer discussion, whether a better result type representation would be a nicer alternative. For now, the subview op carries the required semantic.

Differential Revision: https://reviews.llvm.org/D79662

63c0e72b

Add nomerge function attribute to supress tail merge optimization in simplifyCFG · cb22ab74

Zequan Wu authored May 12, 2020

We want to add a way to avoid merging identical calls so as to keep the
separate debug-information for those calls. There is also an asan
usecase where having this attribute would be beneficial to avoid
alternative work-arounds.

Here is the link to the feature request:
https://bugs.llvm.org/show_bug.cgi?id=42783.

`nomerge` is different from `noline`. `noinline` prevents function from
inlining at callsites, but `nomerge` prevents multiple identical calls
from being merged into one.

This patch adds `nomerge` to disable the optimization in IR level. A
followup patch will be needed to let backend understands `nomerge` and
avoid tail merge at backend.

Reviewed By: asbirlea, rnk

Differential Revision: https://reviews.llvm.org/D78659

cb22ab74

[lld-macho] Ignore -platform_version and -syslibroot flags. · 759bae95

Nico Weber authored May 12, 2020

clang passes these flags; this makes it easier to try `clang -v`
output with `ld -flavor darwinnew`.

Differential Revision: https://reviews.llvm.org/D79797

759bae95

[libc][Obvious] Fix deps of few threads targets. · e17a47b2
Siva Chandra Reddy authored May 12, 2020
```
A missing dep has been added, and a few redundent deps have been
removed.
```
e17a47b2
[libc++][test] Properly mark libc++-only XFAILs · 2c861e8a
Casey Carter authored May 12, 2020
```
These tests PASS on libstdc++ and MSVC.
```
2c861e8a

[AMDGPU] Make v4i64/v4f64/v8i64/v8f64 legal · 71ed66d9

Stanislav Mekhanoshin authored May 12, 2020

We can produce such vectors in the Promote Alloca pass,
but we are unable to use movrel to operate it and lower
via scratch. Making it legal makes SI_INDIRECT patterns
work.

There is more work to do in subsequent changes:

1. We initialize m0 twice to access each dword. It shall
be possible to only do it once and increment base register
number instead.
2. We also need v16i64/v16f64 but these first need to be
added to tablegen.

Differential Revision: https://reviews.llvm.org/D79808

71ed66d9

[lldb/Reproducers] Also record directories FileSystem::Collect. · ab22f71d

Jonas Devlieghere authored May 12, 2020

Now that the FileCollector knows how to deal with directories we no
longer have to ignore them in the FileSystem class.

ab22f71d

Revert of Revert of [mlir][shape] Tidy up shape.shape_of · 452e2fc4

Sean Silva authored May 12, 2020

Summary:

- Mark it NoSideEffect
- Add custom parser/printer

This reverts the temporary revert in
https://reviews.llvm.org/rG84a9c725742d26df04808a3c7349dbd98684c6cb
That was a false alarm. A downstream test actually needed to be updated.

452e2fc4

[YAMLVFSWriter] Fix for delimiters · 759465ee
Jan Korous authored May 12, 2020
```
Differential Revision: https://reviews.llvm.org/D79809
```
759465ee

[x86][CGP] enable target hook to sink funnel shift intrinsic's splatted shift amount · f490ca76

Sanjay Patel authored May 12, 2020

SDAG suffers when it can't see that a funnel operand is a splat value
(due to single-basic-block visibility), so invert the normal loop
hoisting rules to move a splat op closer to its use.

This would be part 1 of an enhancement similar to D63233.

This is needed to re-fix PR37426:
https://bugs.llvm.org/show_bug.cgi?id=37426
...because we got better at canonicalizing IR to funnel shift intrinsics.

The existing CGP code for shift opcodes is likely overstepping what it was
intended to do, so that will be fixed in a follow-up.

Differential Revision: https://reviews.llvm.org/D79718

f490ca76

[GIsel] Update a comment and make it more precise. · a9e85626
Davide Italiano authored May 12, 2020
```
This only covers ANYEXT/ZEXT. SEXT is covered in another test
I just checked in.
```
a9e85626

[mlir] Move Conversion/StandardToStandard to Dialect/StandardOps/Transforms/FuncConversions · 473bdaf2

Alex Zinenko authored May 13, 2020

Conversion/ folders were originally intended to store patterns for
DialectA->DialectB conversions that depend on both dialects and do not
conceptually belong to either of the dialects. As such, DialectA->DialectA
conversion does not make sense under Conversion/ and should rather live with
the dialect it operates on.

Differential Revision: https://reviews.llvm.org/D79569

473bdaf2

[GlobalISel] Assign the correct location when combining G_SEXT. · 99d60a1d
Davide Italiano authored May 12, 2020
```
<rdar://problem/62991635>
```
99d60a1d
Fix buildbots #2 after aa1eb515 . · 1c44430e
Alexey Lapshin authored May 13, 2020

1c44430e

PowerPC: Treat llvm.fma.f* intrinsic as using CTR with SPE · 0138cc01

Justin Hibbits authored Apr 18, 2020

Summary:
The SPE doesn't have a 'fma' instruction, so the intrinsic becomes a
libcall.  It really should become an expansion to two instructions, but
for some reason the compiler doesn't think that's as optimal as a
branch.  Since this lowering is done after CTR is allocated for loops,
tell the optimizer that CTR may be used in this case.  This prevents a
"Invalid PPC CTR loop!" assertion in the case that a fma() function call
is used in a C/C++ file, and clang converts it into an intrinsic.

Reviewed By: shchenz
Differential Revision: https://reviews.llvm.org/D78668

0138cc01

Fix buildbots after aa1eb515 . · 293c6d38
Alexey Lapshin authored May 13, 2020

293c6d38

[SampleFDO] Rename llvm-profdata flag -partial-profile to -gen-partial-profile. · 56926ae0

Wei Mi authored May 12, 2020

The internal flag -partial-profile in llvm conflicts with the flag with
the same name in llvm-profdata. The conflict happens in builds with
LLVM_LINK_LLVM_DYLIB enabled. In this case the tools are linked with libLLVM
and we end up with two definitions for the same cl::opt.

The patch renames llvm-profdata flag -partial-profile to -gen-partial-profile.

56926ae0

May 12, 2020

[VirtualFileSystem] Add unit test that showcases another YAMLVFSWriter bug · 58bc507b

Jonas Devlieghere authored May 12, 2020

This scenario generates another broken YAML mapping as illustrated below.

  {
    'type': 'directory',
    'name': "c",
    'contents': [
      ,
      {
        'type': 'directory',
        'name': "d",
        'contents': [
          ,
          {
            'type': 'directory',
            'name': "e",
            'contents': [
              {
                'type': 'file',
                'name': "f",
                'external-contents': "//root/a/c/d/e/f"
              }                    {
                'type': 'file',
                'name': "g",
                'external-contents': "//root/a/c/d/e/g"
              }
            ]
          }
        ]
      }
    ]
  },

58bc507b

[VirtualFileSystem] Add unit test that showcases YAMLVFSWriter bug · 59ba19c5

Jonas Devlieghere authored May 12, 2020

This scenario generates a broken YAML mapping as illustrated below.

 {
   'type': 'directory',
   'name': "c",
   'contents': [
     {
       'type': 'file',
       'name': "d",
       'external-contents': "//root/a/c/d"
     }            {
       'type': 'file',
       'name': "e",
       'external-contents': "//root/a/c/e"
     }            {
       'type': 'file',
       'name': "f",
       'external-contents': "//root/a/c/f"
     }
   ]
 },

59ba19c5

[X86][ISelLowering] refactor Varargs handling in X86ISelLowering.cpp · aa1eb515

Alexey Lapshin authored Feb 12, 2020

Summary:
This patch refactors handling of VarArgs in
X86TargetLowering::LowerFormalArguments.
That refactoring was requested while reviewing
D69372. Code related to varargs handling is removed
from X86TargetLowering::LowerFormalArguments and
is divided into smaller routines.

Reviewed By: aeubanks

Differential Revision: https://reviews.llvm.org/D74794

aa1eb515