Commits · 7f2db993500923a51c0b0aed650a3e0d4241205b · Lorenzo Albano / LLVM bpEVL

May 12, 2020

[PATCH] #pragma float_control should be permitted in namespace scope. · 7f2db993

Melanie Blower authored May 08, 2020

Summary: Erroneous error diagnostic observed in VS2017 <numeric> header
Also correction to propagate usesFPIntrin from template func to instantiation.

Reviewers: rjmccall, erichkeane (no feedback received)

Differential Revision: https://reviews.llvm.org/D79631

7f2db993

[X86] combineX86ShuffleChain - use narrowShuffleMaskElts scale == 1 builtin handling. NFC. · 0387df7f
Simon Pilgrim authored May 12, 2020
```
narrowShuffleMaskElts already has the fast-path for scale == 1, no need to reimplement it here.
```
0387df7f

[CUDA][HIP] Workaround for resolving host device function against wrong-sided function · e03394c6

Yaxun (Sam) Liu authored Apr 24, 2020

recommit c77a4078 with fix

https://reviews.llvm.org/D77954 caused regressions due to diagnostics in implicit
host device functions.

For now, it seems the most feasible workaround is to treat implicit host device function and explicit host
device function differently. Basically in device compilation for implicit host device functions, keep the
old behavior, i.e. give host device candidates and wrong-sided candidates equal preference. For explicit
host device functions, favor host device candidates against wrong-sided candidates.

The rationale is that explicit host device functions are blessed by the user to be valid host device functions,
that is, they should not cause diagnostics in both host and device compilation. If diagnostics occur, user is
able to fix them. However, there is no guarantee that implicit host device function can be compiled in
device compilation, therefore we need to preserve its overloading resolution in device compilation.

Differential Revision: https://reviews.llvm.org/D79526

e03394c6

[NFC][AArch64] More casts tests... · f1f8cffc
Sam Parker authored May 12, 2020
```
Don't use truncs are users because sometimes they're free too.
```
f1f8cffc
[X86][AVX] Use X86ISD::VPERM2X128 for blend-with-zero if optimizing for size · 45aa1b88
Simon Pilgrim authored May 12, 2020
```
Last part of PR22984 - avoid the zero-register dependency if optimizing for size
```
45aa1b88
FuzzerCLI.h - reduce StringRef.h include to forward declaration. NFC. · 24ac6a2d
Simon Pilgrim authored May 10, 2020

24ac6a2d

DebugCounter.h - remove unused includes. NFC. · e143253f

Simon Pilgrim authored May 10, 2020

Added explicit StringRef.h include as we need the full definition for several inline functions in DebugCounter.h.

e143253f

[Target][ARM] Replace outdated getARMVPTBlockMask function · 24bf8063

Pierre-vh authored Apr 08, 2020

getARMVPTBlockMask was an outdated function that only handled basic
block masks: T, TT, TTT and TTTT. This worked fine before the MVE
VPT Block Insertion Pass improvements as it was the only kind of
masks that it could generate, but now it can generate more complex
masks that uses E predicates, so it's dangerous to use that function
to calculate VPT/VPST block masks.

I replaced it with 2 different functions:
  - expandPredBlockMask, in ARMBaseInfo. This adds an "E" or "T" at
    the end of an existing PredBlockMask.
  - recomputeVPTBlockMask, in Thumb2InstrInfo. This takes an iterator
    to a VPT/VPST instruction and recomputes its block mask by looking
    at the predicated instructions that follows it. This should be
    used to recompute a block mask after removing/adding a predicated
    instruction to the block.

The expandPredBlockMask function is pretty much imported from the MVE
VPT Blocks pass.

I had to change the ARMLowOverheadLoops and MVEVPTBlocks passes as well
so they could use these new functions.

Differential Revision: https://reviews.llvm.org/D78201

24bf8063

[Target][ARM] Replace re-uses of old VPR values with VPNOTs · bf218337
Pierre-vh authored Apr 02, 2020
```
Differential Revision: https://reviews.llvm.org/D76847
```
bf218337

[libcxx testing] Remove ALLOW_RETRIES from sleep_for.pass.cpp · 9e32bf55

David Zarzycki authored May 12, 2020

Operating systems are best effort by default, so we cannot assume that
sleep-like APIs return as soon as we'd like.

Even if a sleep-like API returns when we want it to, the potential for
preemption means that attempts to measure time are subject to delays.

9e32bf55

[CodeGen][SVE] Add patterns for whole vector predicate select · 077d2d68

Sander de Smalen authored May 12, 2020

Added patterns to implement `select i1 %p, <vty> %a, <vty> %b`

Reviewed By: efriedma

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D79356

077d2d68

Revert "[RISCV] Make CanLowerReturn protected for downstream maintenance" · 9d6064ec
Jim Lin authored May 12, 2020
```
This reverts commit d775841d.
```
9d6064ec
[NFC][AArch64] More cast cost tests · e114bdf0
Sam Parker authored May 12, 2020
```
Add truncating stores and casts with users.
```
e114bdf0
[SveEmitter] Add builtins for svdup and svindex · d6936be2
Sander de Smalen authored May 12, 2020
```
Reviewed By: efriedma

Differential Revision: https://reviews.llvm.org/D79357
```
d6936be2

[ARM] Refactor lower to S[LR]I optimization · 9682d0d5

Petre-Ionut Tudor authored Apr 21, 2020

Summary:
The optimization has been refactored to fix certain bugs and
limitations. The condition for lowering to S[LR]I has been changed
to reflect the manual pseudocode description of SLI and SRI operation.
The optimization can now handle more cases of operand type and order.

Subscribers: kristof.beyls, hiraditya, danielkiss, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D79233

9682d0d5

[ARM][CostModel] Improve getCastInstrCost · b4a8091a

Sam Parker authored May 07, 2020

- Specifically check for sext/zext users which have 'long' form NEON
  instructions.
- Add more entries to the table for sext/zexts so that we can report
  more accurately the number of vmovls required for NEON.
- Pass the instruction to the pass implementation.

Differential Revision: https://reviews.llvm.org/D79561

b4a8091a

[AArch64][CostModel] getCastInstrCost · 1952c86d

Sam Parker authored May 12, 2020

Pass the instruction to the base implementation.

Differential Revision: https://reviews.llvm.org/D79562

1952c86d

[Openmp][VE] Libomptarget plugin for NEC SX-Aurora · 6b9e43c6

Manoel Roemmer authored May 12, 2020

This patch adds a libomptarget plugin for the NEC SX-Aurora TSUBASA Vector
Engine (VE target). The code is largely based on the existing generic-elf
plugin and uses the NEC VEO and VEOSINFO libraries for offloading.

Differential Revision: https://reviews.llvm.org/D76843

6b9e43c6

get rid of the NDEBUG usage in RecoveryExpr, NFC. · 40ef4274
Haojian Wu authored May 12, 2020
```
use the llvm::all_of, per dblaikie's suggestion.
```
40ef4274
[NFC][AArch64] Update tests · 494c7ece
Sam Parker authored May 12, 2020
```
Add cost model tests for extending loads.
```
494c7ece
Fix typos encountered while working on pass pipeline for O1. · a42e53cc
Eric Christopher authored May 12, 2020

a42e53cc

Revert "[NFC][DwarfDebug] Prefer explicit to auto type deduction" · 8b7b84e9

Djordje Todorovic authored May 12, 2020

This wasn't proposed by the LLVM Style Guide.
Please see https://reviews.llvm.org/D79624.

This reverts commit rG2552dc5317e0.

8b7b84e9

Revert "[NFC][DwarfDebug] Avoid default capturing when using lambdas" · 41ca6058

Djordje Todorovic authored May 12, 2020

Reverting this because we found it isn't that useful.
Please see https://reviews.llvm.org/D79616.

This reverts commit rG45e5a32a8bd3.

41ca6058

[SystemZ] Improve foldMemoryOperandImpl: vec->FP conversions · 57feff93

Jonas Paulsson authored Mar 18, 2020

Use FP-mem instructions when folding reloads into single lane (W..) vector
instructions.

Only do this when all other operands of the instruction have already been
allocated to an FP (F0-F15) register.

Review: Ulrich Weigand

Differential Revision: https://reviews.llvm.org/D76705

57feff93

[CodeGen] Fix incorrect uses of getVectorNumElements() · 42c7a6d5

David Sherwood authored May 05, 2020

I have fixed up some places in SelectionDAG::getNode() where we
used to assert that the number of vector elements for two types
are the same. I have changed such cases to assert that the
element counts are the same instead. I've added new tests that
exercise the code paths for all the truncations. All the extend
operations are covered by this existing test:

  CodeGen/AArch64/sve-sext-zext.ll

For the ISD::SETCC case I fixed this code path is exercised by
these existing tests:

  CodeGen/AArch64/sve-fcmp.ll
  CodeGen/AArch64/sve-intrinsics-int-compares-with-imm.ll

Differential Revision: https://reviews.llvm.org/D79399

42c7a6d5

[LLDB] Disable TestBasicEntryValues.py for arm · 054ed1fd

Muhammad Omair Javaid authored May 12, 2020

TestBasicEntryValues.py fails on arm 32 bit. Currently running on silent master here:
http://lab.llvm.org:8014/builders/lldb-arm-ubuntu/

054ed1fd

[clangd] Have suppression comments take precedence over warning-as-error · 5a7276b3

Nathan Ridge authored May 10, 2020

Summary: This matches the clang-tidy behaviour.

Fixes https://github.com/clangd/clangd/issues/375

Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D79691

5a7276b3

Temporarily Revert "[mlir][shape] Tidy up shape.shape_of" as it's breaking a few tests. · 84a9c725
Eric Christopher authored May 11, 2020
```
This reverts commit b6045448.

Followed up offline with a testcase.
```
84a9c725

[RISCV] Make CanLowerReturn protected for downstream maintenance · d775841d

Jim Lin authored May 12, 2020

Summary: For the downstream RISCV maintenance, it would be easier to override and reuse CanLowerReturn for customizing.

Reviewers: asb, lenary, luismarques

Reviewed By: lenary

Subscribers: hiraditya, rbar, johnrusso, simoncook, sabuasal, niosHD, kito-cheng, shiva0217, jrtc27, MaskRay, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, rkruppe, PkmX, jocewei, psnobl, benna, s.egerton, pzheng, sameer.abuasal, apazos, evandro, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D78545

d775841d

[PowerPC] Add fma/fsqrt/fmax strict-fp intrinsics · e8d2ff22

Qiu Chaofan authored May 12, 2020

This patch adds strict-fp intrinsics support for fma, fsqrt, fmaxnum and
fminnum on PowerPC.

Reviewed By: hfinkel

Differential Revision: https://reviews.llvm.org/D72749

e8d2ff22

Revert "[libcxx] shared_ptr changes from library fundamentals (P0414R2)." · 5eb55483
zoecarver authored May 11, 2020
```
This reverts commit e8c13c18.
```
5eb55483

[gcov] Fix big-endian problems · f98709a9

Fangrui Song authored May 11, 2020

In a big-endian .gcda file, the first four bytes are "gcda" instead of "adcg".
All 32-bit values are in big-endian.

With this change, libclang_rt.profile can hopefully produce gcov
compatible output.

f98709a9

Revert part of D49132 "[gcov] Fix gcov profiling on big-endian machines" · 4c684b91

Fangrui Song authored May 11, 2020

D49132 is partially correct. For 64-bit values, the lower 32-bit part comes
before the higher 32-bit part (in a little-endian manner).

For 32-bit values, libgcov reads/writes 32-bit values in native endianness.

4c684b91

Partially revert "[CMake] Fix building with -DBUILD_SHARED_LIBS=ON on mingw" · 1f707cc9

Martin Storsjö authored May 12, 2020

This reverts parts of commit 609ef948,
as it caused build failures on windows if LLVM_BUILD_EXAMPLES was
enabled, due to Bye being added as a dependency of the lit tests.

1f707cc9

[DWARF5]: Added support for dumping strx forms in llvm-dwarfdump · 93aee9ca

Sourabh Singh Tomar authored Apr 27, 2020

This patch adds support for dumping DW_MACRO_define_strx,
DW_MACRO_undef_strx in llvm-dwarfdump. These forms are currently
supported only in debug_macro section.

Reviewed By: ikudrin, dblaikie

Differential Revision: https://reviews.llvm.org/D78736

93aee9ca

[gcov] Emit GCOV_TAG_OBJECT_SUMMARY/GCOV_TAG_PROGRAM_SUMMARY correctly and fix... · 013f0670

Fangrui Song authored May 11, 2020

[gcov] Emit GCOV_TAG_OBJECT_SUMMARY/GCOV_TAG_PROGRAM_SUMMARY correctly and fix llvm-cov's decoding of runcount

gcov 9 (r264462) started to use GCOV_TAG_OBJECT_SUMMARY. Before,
GCOV_TAG_PROGRAM_SUMMARY was used.
libclang_rt.profile should emit just one tag according to the version.

Another bug introduced by rL194499 is that the wrong runcount field was
selected.

Fix the two bugs so that gcov can correctly decode "Runs:" from
libclang_rt.profile produced .gcda files, and llvm-cov gcov can
correctly decode "Runs:" from libgcov produced .gcda files.

013f0670

[x86/SLH][NFC] Add a test to produce a failed generation. · 2e9f1153
Wang, Pengfei authored May 12, 2020

2e9f1153

[mlir] [VectorOps] Replace zero-scalar + splat into direct zero vector constant · 40f56c8c

aartbik authored May 11, 2020

Summary:
The scalar zero + splat yields more intermediate code than the direct
dense zero constant, and ultimately is lowered to exactly the same
LLVM IR operations, so no point wasting the intermediate code.

Reviewers: nicolasvasilache, andydavis1, reidtatge

Reviewed By: nicolasvasilache

Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D79758

40f56c8c

Quote error string from qLaunchSuccess · 2b8b783b

Jason Molenda authored May 11, 2020

If the error message from qLaunchSucess included a gdb RSP
metacharacter, it could crash lldb.  Apply the binary
escaping to the string before sending it to lldb; lldb
promiscuously applies the binary escaping protocol on
packets it receives.

Also fix a small bug in cstring_to_asciihex_string where
a high bit character (eg utf-8 chars) would not be
quoted correctly due to signed char fun.

Differential Revision: https://reviews.llvm.org/D79614

rdar://problem/62873581

2b8b783b

Fix a release+noasserts werror for unused variable. · 59a299cb
Eric Christopher authored May 11, 2020

59a299cb