Commits · 1112b7bad851c78af2159d39c7bff670a9c77da1 · Lorenzo Albano / LLVM bpEVL

Oct 12, 2021

[TwoAddressInstruction] Remove ad hoc machine verification · f7ee21aa

Jay Foad authored Oct 12, 2021

With the -early-live-intervals command line flag,
TwoAddressInstructionPass::runOnMachineFunction would call
MachineFunction::verify before returning to check the live intervals.
But there was not much benefit to doing this since -verify-machineinstrs
and LLVM_ENABLE_EXPENSIVE_CHECKS provide a more general way of
scheduling machine verification after every pass.

Also it caused problems on targets like Lanai which are marked as "not
machine verifier clean", since verification would fail for known
target-specific problems which are nothing to do with LiveIntervals.

Differential Revision: https://reviews.llvm.org/D111618

f7ee21aa

[DebugInfo][NFC] Move LiveDebugValues class to header · 838b4a53

Jeremy Morse authored Oct 12, 2021

This patch shifts the InstrRefBasedLDV class declaration to a header.
Partially because it's already massive, but mostly so that I can start
writing some unit tests for it. This patch also adds the boilerplate for
said unit tests.

Differential Revision: https://reviews.llvm.org/D110165

838b4a53

[NFC][Attr] rename attribute btf_tag to btf_decl_tag · 325d0007

Yonghong Song authored Oct 11, 2021

Per discussion in https://reviews.llvm.org/D111199,
the existing btf_tag attribute will be renamed to
btf_decl_tag. This patch mostly updated the Bitcode and
DebugInfo test cases with new attribute name.

Differential Revision: https://reviews.llvm.org/D111591

325d0007

[AArch64][GlobalISel] Fix combiner assertion in matchConstantOp(). · 53ebfa7c
Amara Emerson authored Oct 11, 2021
```
We shouldn't call APInt::getSExtValue() on a >64b value.
```
53ebfa7c

[TwoAddressInstructionPass] Improve the SrcRegMap and DstRegMap computation · 6599961c

Guozhi Wei authored Oct 11, 2021

This patch contains following enhancements to SrcRegMap and DstRegMap:

  1 In findOnlyInterestingUse not only check if the Reg is two address usage,
    but also check after commutation can it be two address usage.

  2 If a physical register is clobbered, remove SrcRegMap entries that are
    mapped to it.

  3 In processTiedPairs, when create a new COPY instruction, add a SrcRegMap
    entry only when the COPY instruction is coalescable. (The COPY src is
    killed)

With these enhancements isProfitableToCommute can do better commute decision,
and finally more register copies are removed.

Differential Revision: https://reviews.llvm.org/D108731

6599961c

Oct 11, 2021

[KnownBits] Introduce `countMaxActiveBits()` and use it in a few places · 684cbae8
Roman Lebedev authored Oct 11, 2021

684cbae8

[PHIElimination] Fix accounting for undef uses when updating LiveVariables · edfdce26

Jay Foad authored Oct 11, 2021

PHI elimination updates LiveVariables info as described here:

    // We only need to update the LiveVariables kill of SrcReg if this was the
    // last PHI use of SrcReg to be lowered on this CFG edge and it is not live
    // out of the predecessor. We can also ignore undef sources.

Unfortunately if the last use also happened to be an undef use then it
would fail to update the LiveVariables at all. Fix this by not counting
undef uses in the VRegPHIUse map.

Thanks to Mikael Holmén for the test case!

Differential Revision: https://reviews.llvm.org/D111552

edfdce26

Oct 10, 2021
- [GlobalISel] Fix the stores of truncates -> wide store combine for non-evenly dividing type sizes. · f95d9c95
  Amara Emerson authored Oct 09, 2021
```
If the wide store we'd generate is not a multiple of the memory type of the
narrow stores (e.g. s48 and s32), we'd assert. Fix that.
```
  f95d9c95
Oct 09, 2021
- Fixed some errors detected by PVS Studio · 943b3048
  Dávid Bolvanský authored Oct 09, 2021
  
  943b3048
- Fixed some errors detected by PVS Studio · 3649fb14
  Dávid Bolvanský authored Oct 09, 2021
  
  3649fb14
- Make more places that use alignment use uint64_t · a0a49351
  Arthur Eubanks authored Oct 08, 2021
```
Followup to D110451.
```
  a0a49351
Oct 08, 2021

Move TargetRegistry.(h|cpp) from Support to MC · 89b57061

Reid Kleckner authored Oct 08, 2021

This moves the registry higher in the LLVM library dependency stack.
Every client of the target registry needs to link against MC anyway to
actually use the target, so we might as well move this out of Support.

This allows us to ensure that Support doesn't have includes from MC/*.

Differential Revision: https://reviews.llvm.org/D111454

89b57061

[GlobalISel] Improve G_UMHULH -> LSHR combine to accept non-uniform constant vectors. · 17b89f9d
Amara Emerson authored Oct 08, 2021

17b89f9d

[RegisterScavenging] Use a Twine in a call to report_fatal_error instead of... · a9700653

Craig Topper authored Oct 08, 2021

[RegisterScavenging] Use a Twine in a call to report_fatal_error instead of going from std::string to c_str. NFC

The std::string was built on the line above. Might as well just
build it as a Twine in the call.

a9700653

Revert "[SelectionDAG] Remove PromoteIntOp_EXTRACT_SUBVECTOR." · 7c68d4b8

Bradley Smith authored Oct 08, 2021

This reverts commit 3e8d2008.

The code removed in this commit is actually required for extracting
fixed types from illegal scalable types, hence this commit causes
assertion failures in such extracts.

7c68d4b8

[GlobalISel] Combine for eliminating redundant operand negations · d20840c9
Mirko Brkusanin authored Oct 08, 2021
```
Differential Revision: https://reviews.llvm.org/D111319
```
d20840c9

[GlobalISel][IRTranslator] Fix a use-after-free bug when translating trap-func-name traps. · 72ce310b

Amara Emerson authored Oct 07, 2021

This was using MachineFunction::createExternalSymbolName() before, which seems
reasonable, but in fact this is freed before the asm emitter which tries to access
the function name string. Switching it to use the string returned by the attribute
seems to fix the problem.

72ce310b

[GlobalISel] Combine G_UMULH x, (1 << c)) -> x >> (bitwidth - c) · 08b3c0d9

Amara Emerson authored Oct 03, 2021

In order to not generate an unnecessary G_CTLZ, I extended the constant folder
in the CSEMIRBuilder to handle G_CTLZ. I also added some extra handing of
vector constants too. It seems we don't have any support for doing constant
folding of vector constants, so the tests show some other useless G_SUB
instructions too.

Differential Revision: https://reviews.llvm.org/D111036

08b3c0d9

[TargetPassConfig] Remove an obsolete FIXME comment · b84d9d29

Jay Foad authored Oct 08, 2021

The "coloring with register" functionality it refers to was removed ten
years ago in svn r144481 "Remove the -color-ss-with-regs option".

b84d9d29

[SelectionDAG] Fix shift libcall ABI mismatch in shift-amount argument · faa0e2ae

Itay Bookstein authored Oct 08, 2021

The shift libcalls have a shift amount parameter of MVT::i32, but
sometimes ExpandIntRes_Shift may be called with a node whose
second operand is a type that is larger than that. This leads to
an ABI mismatch, and for example causes a spurious zeroing of
a register in RV32 for 64-bit shifts. Note that at present regular
shift intstructions already have their shift amount operand adapted
at SelectionDAGBuilder::visitShift time, and funnelled shifts bypass that.

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D110508

faa0e2ae

[X86] Optimize fdiv with reciprocal instructions for half type · c236883b
Wang, Pengfei authored Oct 08, 2021
```
Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D110557
```
c236883b

Oct 07, 2021

Do not emit prologue_end for line 0 locs if there is a non-zero loc present · 9f93f2bf

Adrian Prantl authored Oct 07, 2021

This change fixes a bug where the compiler generates a prologue_end
for line 0 locs. That is because line 0 is not associated with any
source location, so there should not be a prolgoue_end at a location
that doesn't correspond to a source location.

There were some LLVM tests that were explicitly checking for line 0
prologue_end's as well since I believe that to be incorrect, I had to
change those tests as well.

Patch by Shubham Rastogi!

Differential Revision: https://reviews.llvm.org/D110740

9f93f2bf

[TargetPassConfig] Enable machine verification after miscellaneous passes · 097339b1

Jay Foad authored Oct 07, 2021

In a couple of places machine verification was disabled for no apparent
reason, probably just because an "addPass(..., false)" line was cut and
pasted from elsewhere.

After this patch the only remaining place where machine verification is
disabled in the generic TargetPassConfig code, is after addPreEmitPass.

097339b1

[TwoAddressInstruction] Enable machine verification after this pass · 27c57e79
Jay Foad authored Oct 01, 2021
```
Differential Revision: https://reviews.llvm.org/D111007
```
27c57e79
[PHIElimination] Enable machine verification after this pass · 3ff0a574
Jay Foad authored Oct 03, 2021
```
Differential Revision: https://reviews.llvm.org/D111006
```
3ff0a574

[PHIElimination] Account for INLINEASM_BR when inserting kills · 3c9dfba1

Jay Foad authored Sep 30, 2021

When PHIElimination adds kills after lowering PHIs to COPYs it knows
that some instructions after the inserted COPY might use the same
SrcReg, but it was only looking at the terminator instructions at the
end of the block, not at other instructions like INLINEASM_BR that can
appear after the COPY insertion point.

Since we have already called findPHICopyInsertPoint, which knows about
INLINEASM_BR, we might as well reuse the insertion point that it
calculated when looking for instructions that might use SrcReg.

This fixes a machine verification failure if you force machine
verification to run after PHIElimination (currently it is disabled for
other reasons) when running
test/CodeGen/X86/callbr-asm-phi-placement.ll.

Differential Revision: https://reviews.llvm.org/D110834

3c9dfba1

[GlobalISel] Port the udiv -> mul by constant combine. · 8bfc0e06

Amara Emerson authored Sep 28, 2021

This is a straight port from the equivalent DAG combine.

Differential Revision: https://reviews.llvm.org/D110890

8bfc0e06

[MIRParser] Add support for IsInlineAsmBrIndirectTarget · 548b01c7

Jay Foad authored Oct 07, 2021

Print this basic block flag as inlineasm-br-indirect-target and parse
it. This allows you to write MIR test cases for INLINEASM_BR. The test
case I added is one that I wanted to precommit anyway for D110834.

Differential Revision: https://reviews.llvm.org/D111291

548b01c7

[AArch64][SVE] Improve VECTOR_SPLICE codegen for VL > 128-bit · 5be266db
Bradley Smith authored Oct 05, 2021
```
Differential Revision: https://reviews.llvm.org/D111135
```
5be266db

[MachineInstr] Move MIParser's DBG_VALUE RegState::Debug invariant into MachineInstr::addOperand · bd4dad87

Jack Andersen authored Oct 07, 2021

Based on the reasoning of D53903, register operands of DBG_VALUE are
invariably treated as RegState::Debug operands. This change enforces
this invariant as part of MachineInstr::addOperand so that all passes
emit this flag consistently.

RegState::Debug is inconsistently set on DBG_VALUE registers throughout
LLVM. This runs the risk of a filtering iterator like
MachineRegisterInfo::reg_nodbg_iterator to process these operands
erroneously when not parsed from MIR sources.

This issue was observed in the development of the llvm-mos fork which
adds a backend that relies on physical register operands much more than
existing targets. Physical RegUnit 0 has the same numeric encoding as
$noreg (indicating an undef for DBG_VALUE). Allowing debug operands into
the machine scheduler correlates $noreg with RegUnit 0 (i.e. a collision
of register numbers with different zero semantics). Eventually, this
causes an assert where DBG_VALUE instructions are prohibited from
participating in live register ranges.

Reviewed By: MatzeB, StephenTozer

Differential Revision: https://reviews.llvm.org/D110105

bd4dad87

[MachineCopyPropagation] Handle propagation of undef copies · b5d6ad20

Carl Ritson authored Oct 07, 2021

When propagating undefined copies the undef flag must also be
propagated.

Reviewed By: foad

Differential Revision: https://reviews.llvm.org/D111219

b5d6ad20

[TwoAddressInstruction] Fix ReplacedAllUntiedUses in processTiedPairs · df2d4bc4

Jay Foad authored Sep 30, 2021

Fix the calculation of ReplacedAllUntiedUses when any of the tied defs
are early-clobber. The effect of this is to fix the placement of kill
flags on an instruction like this (from @f2 in
 test/CodeGen/SystemZ/asm-18.ll):

  INLINEASM &"stepb $1, $2" [attdialect], $0:[regdef-ec:GRH32Bit], def early-clobber %3:grh32bit, $1:[reguse tiedto:$0], killed %4:grh32bit(tied-def 3), $2:[reguse:GRH32Bit], %4:grh32bit

After TwoAddressInstruction without this patch:

  %3:grh32bit = COPY killed %4:grh32bit
  INLINEASM &"stepb $1, $2" [attdialect], $0:[regdef-ec:GRH32Bit], def early-clobber %3:grh32bit, $1:[reguse tiedto:$0], %3:grh32bit(tied-def 3), $2:[reguse:GRH32Bit], %4:grh32bit

Note that the COPY kills %4, even though there is a later use of %4 in
the INLINEASM. This fails machine verification if you force it to run
after TwoAddressInstruction (currently it is disabled for other
reasons).

After TwoAddressInstruction with this patch:

  %3:grh32bit = COPY %4:grh32bit
  INLINEASM &"stepb $1, $2" [attdialect], $0:[regdef-ec:GRH32Bit], def early-clobber %3:grh32bit, $1:[reguse tiedto:$0], %3:grh32bit(tied-def 3), $2:[reguse:GRH32Bit], %4:grh32bit

Differential Revision: https://reviews.llvm.org/D110848

df2d4bc4

[GlobalISel] Silence gcc warning about unused variable · 9bf5d913
Mikael Holmen authored Oct 07, 2021

9bf5d913

[IR][NFC] Rename getBaseObject to getAliaseeObject · 40ec1c0f

Itay Bookstein authored Oct 06, 2021

To better reflect the meaning of the now-disambiguated {GlobalValue,
GlobalAlias}::getBaseObject after breaking off GlobalIFunc::getResolverFunction
(D109792), the function is renamed to getAliaseeObject.

40ec1c0f

DebugInfo: Use clang's preferred names for integer types · f6a561c4

David Blaikie authored Sep 24, 2021

This reverts c7f16ab3 / r109694 - which
suggested this was done to improve consistency with the gdb test suite.
Possible that at the time GCC did not canonicalize integer types, and so
matching types was important for cross-compiler validity, or that it was
only a case of over-constrained test cases that printed out/tested the
exact names of integer types.

In any case neither issue seems to exist today based on my limited
testing - both gdb and lldb canonicalize integer types (in a way that
happens to match Clang's preferred naming, incidentally) and so never
print the original text name produced in the DWARF by GCC or Clang.

This canonicalization appears to be in `integer_types_same_name_p` for
GDB and in `TypeSystemClang::GetBasicTypeEnumeration` for lldb.

(I tested this with one translation unit defining 3 variables - `long`,
`long (*)()`, and `int (*)()`, and another translation unit that had
main, and a function that took `long (*)()` as a parameter - then
compiled them with mismatched compilers (either GCC+Clang, or
Clang+(Clang with this patch applied)) and no matter the combination,
despite the debug info for one CU naming the type "long int" and the
other naming it "long", both debuggers printed out the name as "long"
and were able to correctly perform overload resolution and pass the
`long int (*)()` variable to the `long (*)()` function parameter)

Did find one hiccup, identified by the lldb test suite - that CodeView
was relying on these names to map them to builtin types in that format.
So added some handling for that in LLVM. (these could be split out into
separate patches, but seems small enough to not warrant it - will do
that if there ends up needing any reverti/revisiting)

Differential Revision: https://reviews.llvm.org/D110455

f6a561c4

Oct 06, 2021

Reland [IR] Increase max alignment to 4GB · 05392466

Arthur Eubanks authored Sep 24, 2021

Currently the max alignment representable is 1GB, see D108661.
Setting the align of an object to 4GB is desirable in some cases to make sure the lower 32 bits are clear which can be used for some optimizations, e.g. https://crbug.com/1016945.

This uses an extra bit in instructions that carry an alignment. We can store 15 bits of "free" information, and with this change some instructions (e.g. AtomicCmpXchgInst) use 14 bits.
We can increase the max alignment representable above 4GB (up to 2^62) since we're only using 33 of the 64 values, but I've just limited it to 4GB for now.

The one place we have to update the bitcode format is for the alloca instruction. It stores its alignment into 5 bits of a 32 bit bitfield. I've added another field which is 8 bits and should be future proof for a while. For backward compatibility, we check if the old field has a value and use that, otherwise use the new field.

Updating clang's max allowed alignment will come in a future patch.

Reviewed By: hans

Differential Revision: https://reviews.llvm.org/D110451

05392466

Revert "Reland [IR] Increase max alignment to 4GB" · 569346f2
Arthur Eubanks authored Oct 06, 2021
```
This reverts commit 8d64314f.
```
569346f2

Reland [IR] Increase max alignment to 4GB · 8d64314f

Arthur Eubanks authored Sep 24, 2021

Updating clang's max allowed alignment will come in a future patch.

Reviewed By: hans

Differential Revision: https://reviews.llvm.org/D110451

8d64314f

Revert "[IR] Increase max alignment to 4GB" · 72cf8b60
Arthur Eubanks authored Oct 06, 2021
```
This reverts commit df84c1fe.

Breaks some bots
```
72cf8b60

[IR] Increase max alignment to 4GB · df84c1fe

Arthur Eubanks authored Sep 24, 2021

Updating clang's max allowed alignment will come in a future patch.

Reviewed By: hans

Differential Revision: https://reviews.llvm.org/D110451

df84c1fe