Commits · 8263975c3294a62864cad99dc9d474785f074256 · Lorenzo Albano / LLVM bpEVL

Jul 10, 2018

[VPlan] Add VPlanTestBase.h with helper class to build VPlan for tests. · 8263975c

Florian Hahn authored Jul 10, 2018

Reviewers: dcaballe, hsaito, rengolin

Reviewed By: dcaballe

Differential Revision: https://reviews.llvm.org/D49032

llvm-svn: 336653

8263975c

Fix MSVC "signed/unsigned mismatch" warning. NFCI. · c048599f
Simon Pilgrim authored Jul 10, 2018
```
llvm-svn: 336649
```
c048599f
[PM/Unswitch] Fix unused variable in r336646. · 148861f5
Chandler Carruth authored Jul 10, 2018
```
llvm-svn: 336647
```
148861f5

[PM/Unswitch] Fix a collection of closely related issues with trivial · 47dc3a34

Chandler Carruth authored Jul 10, 2018

switch unswitching.

The core problem was that the way we handled unswitching trivial exit
edges through the default successor of a switch. For some reason
I thought the right way to do this was to add a block containing
unreachable and point the default successor at this block. In
retrospect, this has an amazing number of problems.

The first issue is the one that this pass has always worked around -- we
have to *detect* such edges and avoid unswitching them again. This
seemed pretty easy really. You juts look for an edge to a block
containing unreachable. However, this pattern is woefully unsound. So
many things can break it. The amazing thing is that I found a test case
where *simple-loop-unswitch itself* breaks this! When we do
a *non-trivial* unswitch of a switch we will end up splitting this exit
edge. The result will be a default successor that is an exit and
terminates in ... a perfectly normal branch. So the first test case that
I started trying to fix is added to the nontrivial test cases. This is
a ridiculous example that did just amazing things previously. With just
unswitch, it would create 10+ copies of this stuff stamped out. But if
you combine it *just right* with a bunch of other passes (like
simplify-cfg, loop rotate, and some LICM) you can get it to do this
infinitely. Or at least, I never got it to finish. =[

This, in turn, uncovered another related issue. When we are manipulating
these switches after doing a trivial unswitch we never correctly updated
PHI nodes to reflect our edits. As soon as I started changing how these
edges were managed, it became obvious there were more issues that
I couldn't realistically leave unaddressed, so I wrote more test cases
around PHI updates here and ensured all of that works now.

And this, in turn, required some adjustment to how we collect and manage
the exit successor when it is the default successor. That showed a clear
bug where we failed to include it in our search for the outer-most loop
reached by an unswitched exit edge. This was actually already tested and
the test case didn't work. I (wrongly) thought that was due to SCEV
failing to analyze the switch. In fact, it was just a simple bug in the
code that skipped the default successor. While changing this, I handled
it correctly and have updated the test to reflect that we now get
precise SCEV analysis of trip counts for the outer loop in one of these
cases.

llvm-svn: 336646

47dc3a34

[X86] Fast-isel tests for lowered truncation intrinsics · 89c919c2

Mikhail Dvoretckii authored Jul 10, 2018

This patch adds fast-isel tests for the IR patterns produced for truncation
intrinsics in rC336643.

Differential Revision: https://reviews.llvm.org/D48822

llvm-svn: 336645

89c919c2

[X86][SSE] Prefer BLEND(SHL(v,c1),SHL(v,c2)) over MUL(v, c3) · d32ca2c0

Simon Pilgrim authored Jul 10, 2018

Now that rL336250 has landed, we should prefer 2 immediate shifts + a shuffle blend over performing a multiply. Despite the increase in instructions, this is quicker (especially for slow v4i32 multiplies), avoid loads and constant pool usage. It does mean however that we increase register pressure. The code size will go up a little but by less than what we save on the constant pool data.

This patch also adds support for v16i16 to the BLEND(SHIFT(v,c1),SHIFT(v,c2)) combine, and also prevents blending on pre-SSE41 shifts if it would introduce extra blend masks/constant pool usage.

Differential Revision: https://reviews.llvm.org/D48936

llvm-svn: 336642

d32ca2c0

[X86] Regenerate vector-shuffle-512-v8.ll so the script will merge the 32 and... · 5fd020c0
Craig Topper authored Jul 10, 2018
```
[X86] Regenerate vector-shuffle-512-v8.ll so the script will merge the 32 and 64 bit checks together. NFC

llvm-svn: 336641
```
5fd020c0

[X86] Use IsProfitableToFold to block vinsertf128rm in favor of insert_subreg... · 08b81a55

Craig Topper authored Jul 10, 2018

[X86] Use IsProfitableToFold to block vinsertf128rm in favor of insert_subreg instead of artifically increasing pattern complexity to give priority.

This is a much more direct way to solve the issue than just giving extra priority.

llvm-svn: 336639

08b81a55

[X86] Remove some seemingly unnecessary patterns. · db73f564

Craig Topper authored Jul 10, 2018

We're missing the EVEX equivalents of these patterns and seem to get along fine.

I think we end up with X86vzload for the obvious IR cases that would produce this DAG.

llvm-svn: 336638

db73f564

[X86] Add back GCCBuiltin on mask_div_ss/sd_round. · 3e7406b4
Craig Topper authored Jul 10, 2018
```
We no longer need custom handling in clang.

llvm-svn: 336627
```
3e7406b4

[X86] Correct vfixupimm load patterns to look for an integer load, not a... · 866a377e

Craig Topper authored Jul 10, 2018

[X86] Correct vfixupimm load patterns to look for an integer load, not a floating point load bitcasted to integer.

DAG combine wouldn't let a floating point load bitcasted to integer exist. It would just be an integer load.

llvm-svn: 336626

866a377e

[X86] Add test cases that show failure to fold load into vfixupimm... · 59fd2f4c

Craig Topper authored Jul 10, 2018

[X86] Add test cases that show failure to fold load into vfixupimm instructions due to bad isel pattern.

llvm-svn: 336625

59fd2f4c

[X86] Remove FloatVT from X86VectorVTInfo in X86InstrAVX512.td · e4f46e4f

Craig Topper authored Jul 10, 2018

The only places it was used where places where VT was the same as FloatVT. So switch those uses to VT and drop it.

llvm-svn: 336624

e4f46e4f

Revert "AMDGPU: Force inlining if LDS global address is used" · 688e7522
Vlad Tsyrklevich authored Jul 10, 2018
```
This reverts commit r336587, it was causing test failures on the
sanitizer bots.

llvm-svn: 336623
```
688e7522

[DWARF][NFC] Refactor range list emission to use a static helper · e194f73e

Wolfgang Pieb authored Jul 10, 2018

This is prep for DWARF v5 range list emission. Emission of a single range list is moved
to a static helper function.

Reviewer: jdevlieghere

Differential Revision: https://reviews.llvm.org/D49098

llvm-svn: 336621

e194f73e

[InstCombine] allow more shuffle folds using safe constants · 69faf464

Sanjay Patel authored Jul 09, 2018

getSafeVectorConstantForBinop() was calling getBinOpIdentity() assuming
that the constant we wanted was operand 1 (RHS). That's wrong, but I
don't think we could expose a bug or even a suboptimal fold from that
because the callers have other guards for any binop that would have
been affected.

llvm-svn: 336617

69faf464

[WebAssembly] Support for binary atomic RMW instructions · fed7382e

Heejin Ahn authored Jul 09, 2018

Summary:
This adds support for binary atomic read-modify-write instructions:
add, sub, and, or, xor, and xchg.

This does not yet support translations of some of LLVM IR atomicrmw
instructions (nand, max, min, umax, and umin) that do not have a direct
counterpart in wasm instructions.

Reviewers: dschuff

Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits

Differential Revision: https://reviews.llvm.org/D49088

llvm-svn: 336615

fed7382e

llvm: Add support for "-fno-delete-null-pointer-checks" · 77eeac3d

Manoj Gupta authored Jul 09, 2018

Summary:
Support for this option is needed for building Linux kernel.
This is a very frequently requested feature by kernel developers.

More details : https://lkml.org/lkml/2018/4/4/601

GCC option description for -fdelete-null-pointer-checks:
This Assume that programs cannot safely dereference null pointers,
and that no code or data element resides at address zero.

-fno-delete-null-pointer-checks is the inverse of this implying that
null pointer dereferencing is not undefined.

This feature is implemented in LLVM IR in this CL as the function attribute
"null-pointer-is-valid"="true" in IR (Under review at D47894).
The CL updates several passes that assumed null pointer dereferencing is
undefined to not optimize when the "null-pointer-is-valid"="true"
attribute is present.

Reviewers: t.p.northover, efriedma, jyknight, chandlerc, rnk, srhines, void, george.burgess.iv

Reviewed By: efriedma, george.burgess.iv

Subscribers: eraman, haicheng, george.burgess.iv, drinkcat, theraven, reames, sanjoy, xbolva00, llvm-commits

Differential Revision: https://reviews.llvm.org/D47895

llvm-svn: 336613

77eeac3d

Use StringRef instead of `const char *`. · 0230f7c7

Rui Ueyama authored Jul 09, 2018

I don't think there's a need to use `const char *`. In most (probably all?)
cases, we need a length of a name later, so discarding a length will
lead to a wasted effort.

Differential Revision: https://reviews.llvm.org/D49046

llvm-svn: 336612

0230f7c7

Make llvm.objectsize more conservative with null · 3fbfa9c4

George Burgess IV authored Jul 09, 2018

In non-zero address spaces, we were reporting that an object at `null`
always occupies zero bytes. This is incorrect in many cases, so just
return `unknown` in those cases for now.

Differential Revision: https://reviews.llvm.org/D48860

llvm-svn: 336611

3fbfa9c4

Jul 09, 2018

[ORC] Rename MaterializationResponsibility::delegate to replace and add a new · f07dad3d

Lang Hames authored Jul 09, 2018

delegate method (and unit test).

The name 'replace' better captures what the old delegate method did: it
returned materialization responsibility for a set of symbols to the VSO.

The new delegate method delegates responsibility for a set of symbols to a new
MaterializationResponsibility instance. This can be used to split responsibility
between multiple threads, or multiple materialization methods.

llvm-svn: 336603

f07dad3d

Fix line endings. NFCI. · 017c68c1
Simon Pilgrim authored Jul 09, 2018
```
llvm-svn: 336602
```
017c68c1

[Power9] Add __float128 builtins for Rounding Operations · 133acb22

Stefan Pintilie authored Jul 09, 2018

Added __float128 support for a number of rounding operations:

trunc
rint
nearbyint
round
floor
ceil

Differential Revision: https://reviews.llvm.org/D48415

llvm-svn: 336601

133acb22

[WebAssembly] Improve readability of load/stores and tests. NFC. · d31bc986

Heejin Ahn authored Jul 09, 2018

Summary:
- Changed variable/function names to be more consistent
- Improved comments in test files
- Added more tests
- Fixed a few typos
- Misc. cosmetic changes

Reviewers: dschuff

Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits

Differential Revision: https://reviews.llvm.org/D49087

llvm-svn: 336598

d31bc986

[Power9] [LLVM] Add __float128 support for trunc to double round to odd · 58e3e0a8

Stefan Pintilie authored Jul 09, 2018

Add support for this builtin:
double builtin_truncf128_round_to_odd(float128)

Differential Revision: https://reviews.llvm.org/D48483

llvm-svn: 336595

58e3e0a8

RenameIndependentSubregs: Fix handling of undef tied operands · 7139dea6

Mark Searles authored Jul 09, 2018

Ensure that, if updating a tied operand pair, to only update
that pair.

Differential Revision: https://reviews.llvm.org/D49052

llvm-svn: 336593

7139dea6

[globalisel][irtranslator] Add support for atomicrmw and (strong) cmpxchg · 9481399c

Daniel Sanders authored Jul 09, 2018

Summary:
This patch adds support for the atomicrmw instructions and the strong
cmpxchg instruction to the IRTranslator.

I've left out weak cmpxchg because LangRef.rst isn't entirely clear on what
difference it makes to the backend. As far as I can tell from the code, it
only matters to AtomicExpandPass which is run at the LLVM-IR level.

Reviewers: ab, t.p.northover, qcolombet, rovka, aditya_nandakumar, volkan, javed.absar

Reviewed By: qcolombet

Subscribers: kristof.beyls, javed.absar, igorb, llvm-commits

Differential Revision: https://reviews.llvm.org/D40092

llvm-svn: 336589

9481399c

[AMDGPU][Waitcnt] fix "comparison of integers of different signs" build error · 5bfd8d89

Mark Searles authored Jul 09, 2018

Build error on Android; reported by and fix provided by (thanks) by Mauro Rossi <issor.oruam@gmail.com>

Fixes the following building error:

external/llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp:1903:61:
error: comparison of integers of different signs:
'typename iterator_traits<__wrap_iter<MachineBasicBlock **> >::difference_type'
(aka 'int') and 'unsigned int' [-Werror,-Wsign-compare]
                      BlockWaitcntProcessedSet.end(), &MBB) < Count)) {
                      ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ^ ~~~~~
1 error generated.

Differential Revision: https://reviews.llvm.org/D49089

llvm-svn: 336588

5bfd8d89

AMDGPU: Force inlining if LDS global address is used · 40cb6cab

Matt Arsenault authored Jul 09, 2018

These won't work for the forseeable future. These aren't allowed
from OpenCL, but IPO optimizations can make them appear.

Also directly set the attributes on functions, regardless
of the linkage rather than cloning functions like before.

llvm-svn: 336587

40cb6cab

[X86][TLI] DAGCombine: Unfold variable bit-clearing mask to two shifts. · 5ccae175

Roman Lebedev authored Jul 09, 2018

Summary:
This adds a reverse transform for the instcombine canonicalizations
that were added in D47980, D47981.

As discussed later, that was worse at least for the code size,
and potentially for the performance, too.

https://rise4fun.com/Alive/Zmpl

Reviewers: craig.topper, RKSimon, spatel

Reviewed By: spatel

Subscribers: reames, llvm-commits

Differential Revision: https://reviews.llvm.org/D48768

llvm-svn: 336585

5ccae175

[Utils] Fix gdb pretty printers to work with Python 3. · 0566f235

Philip Pfaffe authored Jul 09, 2018

Reiterate D23202 for container printers added after the change landed.

Differential Revision: https://reviews.llvm.org/D46578

llvm-svn: 336580

0566f235

[Power9] Add __float128 builtins for Round To Odd · 83a5fe14

Stefan Pintilie authored Jul 09, 2018

GCC has builtins for these round to odd instructions:

__float128 __builtin_sqrtf128_round_to_odd (__float128)
__float128 __builtin_{add,sub,mul,div}f128_round_to_odd (__float128, __float128)
__float128 __builtin_fmaf128_round_to_odd (__float128, __float128, __float128)

Differential Revision: https://reviews.llvm.org/D47550

llvm-svn: 336578

83a5fe14

[DebugInfo] Change default value of FDEPointerEncoding · fa762cc1

Maksim Panchenko authored Jul 09, 2018

Summary:
If the encoding is not specified in CIE augmentation string, then it
should be DW_EH_PE_absptr instead of DW_EH_PE_omit.

Reviewers: ruiu, MaskRay, plotfi, rafauler

Reviewed By: MaskRay

Subscribers: rafauler, JDevlieghere, llvm-commits

Differential Revision: https://reviews.llvm.org/D49000

llvm-svn: 336577

fa762cc1

[SelectionDAG] Add VT consistency checks to the creation of ISD::FMA. · e3b0c7e5

Craig Topper authored Jul 09, 2018

This is similar to what is done for binops. I don't know if this would have helped us catch the bug fixed in r336566 earlier or not, but I figured it couldn't hurt.

llvm-svn: 336576

e3b0c7e5

Add bitcode compatibility test for 6.0 · a1a8e66a

Steven Wu authored Jul 09, 2018

Summary:
Add bitcode compatibility test for 6.0. On top of the normal disassemble
test, also runs the verifier to make sure simple 6.0 bitcode can pass
the current IR verifier.

Reviewers: vsk

Reviewed By: vsk

Subscribers: dexonsmith, llvm-commits

Differential Revision: https://reviews.llvm.org/D49086

llvm-svn: 336574

a1a8e66a

[LoopInfo] Port loop exit interfaces from Loop to LoopBase · 29a07b37

Diego Caballero authored Jul 09, 2018

This patch ports hasDedicatedExits, getUniqueExitBlocks and
getUniqueExitBlock in Loop to LoopBase so that they can be used
from other LoopBase sub-classes.

Reviewers: chandlerc, sanjoy, hfinkel, fhahn

Reviewed By: chandlerc

Differential Revision: https://reviews.llvm.org/D48817

llvm-svn: 336572

29a07b37

[InstCombine] correct test comments; NFC · 651438c2
Sanjay Patel authored Jul 09, 2018
```
llvm-svn: 336570
```
651438c2

[X86] In combineFMA, make sure we bitcast the result of isFNEG back the... · 47170b31

Craig Topper authored Jul 09, 2018

[X86] In combineFMA, make sure we bitcast the result of isFNEG back the expected type before creating the new FMA node.

Previously, we were creating malformed SDNodes, but nothing noticed because the type constraints prevented isel from noticing.

llvm-svn: 336566

47170b31

[X86][AVX] Regenerate AVX1 fast-isel tests. · d0706592
Simon Pilgrim authored Jul 09, 2018
```
Let the update script merge 32/64 tests where possible

llvm-svn: 336565
```
d0706592

[InstCombine] avoid extra poison when moving shift above shuffle · 7cd32419

Sanjay Patel authored Jul 09, 2018

As discussed in D49047 / D48987, shift-by-undef produces poison,
so we can't use undef vector elements in that case..

Note that we need to extend this for poison-generating flags,
and there's a proposal to create poison from FMF in D47963,

llvm-svn: 336562

7cd32419