Commits · 93a9d2de8f4f73b5785d539db4dfa3fb5bbffedc · Lorenzo Albano / LLVM bpEVL

Mar 19, 2021

[VPlan] Add plain text (not DOT's digraph) dumps · 93a9d2de

Andrei Elovikov authored Mar 18, 2021

I foresee two uses for this:
1) It's easier to use those in debugger.
2) Once we start implementing more VPlan-to-VPlan transformations (especially
   inner loop massaging stuff), using the vectorized LLVM IR as CHECK targets in
   LIT test would become too obscure. I can imagine that we'd want to CHECK
   against VPlan dumps after multiple transformations instead. That would be
   easier with plain text dumps than with DOT format.

Reviewed By: fhahn

Differential Revision: https://reviews.llvm.org/D96628

93a9d2de

[RISCV] Lower scalable vector masked loads to intrinsics to match fixed... · 85f3f6b3

Craig Topper authored Mar 19, 2021

[RISCV] Lower scalable vector masked loads to intrinsics to match fixed vectors and reduce isel patterns.

Reviewed By: frasercrmck

Differential Revision: https://reviews.llvm.org/D98840

85f3f6b3

[RISCV] Maintain fixed-length info when optimizing BUILD_VECTORs · d399b82e

Fraser Cormack authored Mar 19, 2021

I'm not sure how I failed to notice this before, but when optimizing
dominant-element BUILD_VECTORs we would lower via the scalable container type,
which lost us the information about the fixed length of the vector types. By
lowering via the fixed-length type we can preserve that information and
eliminate redundant vsetvli instructions.

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D98938

d399b82e

[SCEV] Factor out a lambda for strict condition splitting [NFC] · 00d0315a
Philip Reames authored Mar 19, 2021

00d0315a

[RISCV] Add missing CHECKs to vector test · 3bffa2c2

Fraser Cormack authored Mar 19, 2021

Since the "LMUL-MAX=2" output for some test functions differed between
RV32 and RV64, the update_llc_test_checks script failed to emit a
unified LMULMAX2 check for them. I'm not sure why it didn't warn about
this.

This patch also takes the opportunity to add unified RV32/RV64 checks to
help shorten the test file when the output for LMULMAX1 and LMULMAX2 is
identical but differs between the two ISAs.

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D98944

3bffa2c2

[RISCV] Fix missing scalable->fixed-length vector conversion · 550292ec

Fraser Cormack authored Mar 17, 2021

Returning the scalable-vector container type would present problems when
the fixed-length INSERT_VECTOR_ELT was used by later operations.

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D98776

550292ec

[cmake] Enable Clang warnings about redundant semicolons · cfa65f77

Martin Storsjö authored Mar 19, 2021

This matches what GCC warns about when -pedantic is enabled.

This should avoid such redundant semicolons creeping into the codebase.

Differential Revision: https://reviews.llvm.org/D98941

cfa65f77

[AMDGPU] Rationalize some check prefixes and use more common prefixes. NFC. · 87248e85
Jay Foad authored Mar 19, 2021

87248e85
[AMDGPU] Remove weird target triples from tests. NFC. · 5df52f77
Jay Foad authored Mar 19, 2021

5df52f77

[RGT] Recode more unreachable assertions and tautologies · fb4f6057

Paul Robinson authored Mar 19, 2021

Count iterations of zero-trip loops and assert the count is zero,
rather than asserting inside the loop.
Unreachable functions should use llvm_unreachable.
Remove tautological 'if' statements, even when they're following a
pattern of checks.

Found by the Rotten Green Tests project.

fb4f6057

[DAG] computeKnownBits - add ISD::MULHS/MULHU/SMUL_LOHI/UMUL_LOHI handling · 9d2df964

Simon Pilgrim authored Mar 19, 2021

Reuse the existing KnownBits multiplication code to handle the 'extend + multiply + extract high bits' pattern for multiply-high ops.

Noticed while looking at the codegen for D88785 / D98587 - the patch helps division-by-constant expansion code in particular, which suggests that we might have some further KnownBits div/rem cases we could handle - but this was far easier to implement.

Differential Revision: https://reviews.llvm.org/D98857

9d2df964

[AMDGPU] Add atomic optimizer nouse tests · b8616e40

Jay Foad authored Mar 18, 2021

Add some atomic optimizer tests where there is no use of the result of
the atomic operation, which is a common case in real code. NFC.

Differential Revision: https://reviews.llvm.org/D98952

b8616e40

[AMDGPU] Remove dead glc1 handing in asm parser. NFC. · 57effe22
Stanislav Mekhanoshin authored Mar 19, 2021

57effe22

propose Chocolately as package manager · 4532ab76

Christian Kühnel authored Feb 24, 2021

Installing the Unix tools on Windows is quite painful. To make things easier,
I explained how to use a package manager or a Docker image.

Note: This still uses the GNUWin tools as explained on this page. Once we
replace these with something else, we would also need to update the
installation commands.

Differential Revision: https://reviews.llvm.org/D97387

4532ab76

[DAG] Fold shuffle(bop(shuffle(x,y),shuffle(z,w)),undef) -> bop(shuffle'(x,y),shuffle'(z,w)) · ffb28871

Simon Pilgrim authored Mar 19, 2021

Followup to D96345, handle unary shuffles of binops (as well as binary shuffles) if we can merge the shuffle with inner operand shuffles.

Differential Revision: https://reviews.llvm.org/D98646

ffb28871

[TableGen] Improve handling of template arguments · a9fc44c5

Paul C. Anagnostopoulos authored Feb 25, 2021

This requires changes to TableGen files and some C++ files due to
incompatible multiclass template arguments that slipped through
before the improved handling.

a9fc44c5

[M68k] Replace unknown operand with explicit type · 028d6250

Ricky Taylor authored Mar 17, 2021

Replace the unknown operand used for immediate operands for DIV/MUL with a fixed 16-bit immediate.

This is required since the assembly parser generator requires that all operands are typed.

Differential Revision: https://reviews.llvm.org/D98819

028d6250

Support intrinsic overloading on unnamed types · 04790d9c

Jeroen Dobbelaere authored Mar 19, 2021

This patch adds support for intrinsic overloading on unnamed types.

This fixes PR38117 and PR48340 and will also be needed for the Full Restrict Patches (D68484).

The main problem is that the intrinsic overloading name mangling is using 's_s' for unnamed types.
This can result in identical intrinsic mangled names for different function prototypes.

This patch changes this by adding a '.XXXXX' to the intrinsic mangled name when at least one of the types is based on an unnamed type, ensuring that we get a unique name.

Implementation details:
- The mapping is created on demand and kept in Module.
- It also checks for existing clashes and recycles potentially existing prototypes and declarations.
- Because of extra data in Module, Intrinsic::getName needs an extra Module* argument and, for speed, an optional FunctionType* argument.
- I still kept the original two-argument 'Intrinsic::getName' around which keeps the original behavior (providing the base name).
-- Main reason is that I did not want to change the LLVMIntrinsicGetName version, as I don't know how acceptable such a change is
-- The current situation already has a limitation. So that should not get worse with this patch.
- Intrinsic::getDeclaration and the verifier are now using the new version.

Other notes:
- As far as I see, this should not suffer from stability issues. The count is only added for prototypes depending on at least one anonymous struct
- The initial count starts from 0 for each intrinsic mangled name.
- In case of name clashes, existing prototypes are remembered and reused when that makes sense.

Reviewed By: fhahn

Differential Revision: https://reviews.llvm.org/D91250

04790d9c

[PowerPC] Fix the check for 16-bit signed field in peephole · a8697c57

Nemanja Ivanovic authored Mar 19, 2021

When a D-Form instruction is fed by an add-immediate, we attempt
to merge the two immediates to form a single displacement so we
can remove the add-immediate.

However, we don't check whether the new displacement fits into
a 16-bit signed immediate field early enough. Namely, we do a
sign-extend from 16 bits first which will discard high bits and
then we check whether the result is a 16-bit signed immediate.
It of course will always be.

Move the check prior to the sign extend to ensure we are checking
the correct value.

Fixes https://bugs.llvm.org/show_bug.cgi?id=49640

a8697c57

[SystemZ][z/OS] Distinguish between text and binary files on z/OS · 4f750f6e

Abhina Sreeskantharajan authored Mar 19, 2021

This patch consists of the initial changes to help distinguish between text and binary content correctly on z/OS. I would like to get feedback from Windows users on setting OF_None for all ToolOutputFiles. This seems to have been done as an optimization to prevent CRLF translation on Windows in the past.

Reviewed By: zibi

Differential Revision: https://reviews.llvm.org/D97785

4f750f6e

[X86, NFC] Update stack-clash tests using the automated tooling · c2313a45

Simonas Kazlauskas authored Mar 19, 2021

This is in preparation of changes in this area (such as D98789 and D98906).

Reviewed By: RKSimon

Differential Revision: https://reviews.llvm.org/D98909

c2313a45

[M68k] Convert register Aliases to AltNames · cd442157

Ricky Taylor authored Mar 11, 2021

This makes it simpler to determine when two registers are actually the
same vs just partially aliasing.

The only real caveat is that it becomes impossible to know which name
was used for the register previously. (i.e. parsing assembly and then
disassembling it can result in the register name changing.)

Differential Revision: https://reviews.llvm.org/D98536

cd442157

[M68k] Introduce DReg bead · 51884c6b

Ricky Taylor authored Mar 11, 2021

This is required in order to determine during disassembly whether a
Reg bead without associated DA bead is referring to a data register.

Differential Revision: https://reviews.llvm.org/D98534

51884c6b

[AMDGPU] Remove some redundant code. NFC. · 5a5a5312

Jay Foad authored Mar 19, 2021

This is redundant because we have already checked that we can't handle
divergent 64-bit atomic operands.

5a5a5312

[AMDGPU] Skip building some IR if it won't be used. NFC. · 5dd5ddcb
Jay Foad authored Mar 18, 2021

5dd5ddcb
[AMDGPU] Remove duplicate test functions. NFC. · 685335a0
Jay Foad authored Mar 18, 2021

685335a0
[AMDGPU] Sink Intrinsic::getDeclaration calls to where they are used. NFC. · c96dfe0d
Jay Foad authored Mar 18, 2021

c96dfe0d

Revert "[lit] Handle plain negations directly in the internal shell" · f3dd783b

Martin Storsjö authored Mar 19, 2021

This reverts commit d09adfd3.

That commit caused failures in
clang-tidy/infrastructure/validate-check-names.cpp on windows
buildbots.

That change exposed a surprising issue, not directly related to
this change in itself, but in how TestRunner quotes command line
arguments that later are going to be interpreted by a msys based
tool (like grep.exe, when provided by Git for Windows). This
worked accidentally before, when grep was invoked via not.exe
which took a more conservative approach to windows argument quoting.

f3dd783b

[docs] Add calendar info for SVE sync-ups · 1d7cf550
Kristof Beyls authored Mar 19, 2021

1d7cf550
[KnownBits] Add knownbits analysis for mulhs/mulu 'multiply high' instructions · a9689721
Simon Pilgrim authored Mar 18, 2021
```
Split off from D98857

https://reviews.llvm.org/D98866
```
a9689721

[NVPTX] Fix warning, remove extra ";" [NFC] · 6d22ba48

Mikael Holmen authored Mar 19, 2021

gcc complained with
../lib/Target/NVPTX/NVPTXLowerArgs.cpp:203:2: warning: extra ';' [-Wpedantic]
  203 | };
      |  ^

6d22ba48

[InstCombine] Add unit test with @llvm.annotation. · 926cca96
Clement Courbet authored Mar 19, 2021
```
In preparation for https://reviews.llvm.org/D98925
```
926cca96

[lit] Pass the USERPROFILE variable through on Windows · 9de63b2e

Martin Storsjö authored Mar 18, 2021

When running in a Windows Container, the Git for Windows Unix tools
(C:\Program Files\Git\usr\bin) just hang if this variable isn't
passed through.

Currently, running the LLVM/clang tests in a Windows Container fails
if that directory is added to the path, but succeeds after this change.
(After this change, the previously used GnuWin tools can be left out
entirely, too, as lit automatically picks up the Git for Windows tools
if necessary.)

Differential Revision: https://reviews.llvm.org/D98858

9de63b2e

[lit] Handle plain negations directly in the internal shell · d09adfd3

Martin Storsjö authored Mar 18, 2021

Keep running "not --crash" via the external "not" executable, but
for plain negations, and for cases that use the shell "!" operator,
just skip that argument and invert the return code.

The libcxx tests only use the shell operator "!" for negations,
never the "not" executable, because libcxx tests can be run without
having a fully built llvm tree available providing the "not"
executable.

This allows using the internal shell for libcxx tests.

Differential Revision: https://reviews.llvm.org/D98859

d09adfd3

[Test] Precommit one more test · a1d6c652
Max Kazantsev authored Mar 19, 2021

a1d6c652
[Test] Precommit test · 4ee4f9bf
Max Kazantsev authored Mar 19, 2021

4ee4f9bf
[NFC] Move function up in code · 8eefa07f
Max Kazantsev authored Mar 19, 2021

8eefa07f
[NFC] Factor out utility function for finding common dom of user set · 8bb952b5
Max Kazantsev authored Mar 19, 2021

8bb952b5
[X86] Fix -Wunused-function in -DLLVM_ENABLE_ASSERTIONS=off builds · c241659d
Fangrui Song authored Mar 18, 2021

c241659d

[IndVars] Provide eliminateIVComparison with context · 16370e02

Max Kazantsev authored Mar 19, 2021

We can prove more predicates when we have a context when eliminating ICmp.
As first (and very obvious) approximation we can use the ICmp instruction itself,
though in the future we are going to use a common dominator of all its users.
Need some refactoring before that.

Observed ~0.5% negative compile time impact.

Differential Revision: https://reviews.llvm.org/D98697
Reviewed By: lebedev.ri

16370e02