Commits · 87ddf1f4fad01bccb70f10a3ee5c5ad5b20e4de4 · Roger Ferrer / llvm-epi-0.8

Feb 10, 2020
- [Attributor] Simple casts preserve no-alias property · 87ddf1f4
  Johannes Doerfert authored Jan 25, 2020
```
This is a minimal but important advancement over the existing code. A
cast with an operand that is only used in the cast retains the no-alias
property of the operand.
```
  87ddf1f4
- [Attributor][Tests] Run the CGSCC versions on the range.ll test · 1c0ebcca
  Johannes Doerfert authored Feb 09, 2020
  
  1c0ebcca
- [llvm-dwarfdump][Stats] Fix the License header · d180899c
  Djordje Todorovic authored Feb 07, 2020
```
Fix the added License.

Differential Revision: https://reviews.llvm.org/D74207
```
  d180899c
- [GlobalISel][CallLowering] Tighten constantexpr check for callee. · 21c9d9ad
  Amara Emerson authored Feb 09, 2020
```
I'm not sure there's a test case for this, but it's better to be safe.
```
  21c9d9ad
- [Attributor] Allow PHI nodes in AAValueConstantRangeFloating · 81554393
  Johannes Doerfert authored Feb 09, 2020
```
Traversing PHI nodes is natural with the genericValueTraversal but also
a bit tricky. The problem is similar to the ones we have seen in AAAlign
and AADereferenceable, namely that we continue to increase the range in
each iteration. We use a pessimistic approach here to stop the
iterations. Nevertheless, optimistic information can now be propagated
through a PHI node.
```
  81554393
- [Attributor][FIX] Remove FIXME that seems outdated · 63adbb9a
  Johannes Doerfert authored Feb 09, 2020
```
The change is performed as stated by the FIXME and the tests are
adjusted. All changes look fine to me and values can be inferred as
undef without it being an error.
```
  63adbb9a
- [Attributor] Allow SelectInst in AAValueConstantRangeFloating · 7e7e6594
  Johannes Doerfert authored Feb 09, 2020
```
The genericValueTraversal will already handle SelectInst properly and we
just needed to allow them in the initialize method.
```
  7e7e6594
- [Attributor] Look through (some) casts in AAValueConstantRangeFloating · ffdbd2a0
  Johannes Doerfert authored Feb 09, 2020
```
Casts can be handled natively by the ConstantRange class. We do limit it
to extends for now as we assume an integer type in different locations.
A TODO and a test case with a FIXME was added to remove that restriction
in the future.
```
  ffdbd2a0
- [Attributor][FIX] Call right base method in AAValueConstantRangeFloating · 028db8c4
  Johannes Doerfert authored Feb 09, 2020
```
We now call the base class method as we should.
```
  028db8c4
- [X86] Autogenerate complete checks. NFC · d0a6b32b
  Craig Topper authored Feb 09, 2020
  
  d0a6b32b
- [Attributor][Tests][NFC] Add more range tests · 103364b4
  Johannes Doerfert authored Feb 09, 2020
```
Inspired by https://llvm.discourse.group/t/impossible-condition-optimization/461
```
  103364b4
- [Attributor][NFC] Use existing constant instead of magic one · d0749cc7
  Johannes Doerfert authored Jan 25, 2020
  
  d0749cc7
- [X86] Make (insert_vector_elt (v8i16 zerovec), i16 %x, 0) generate the same... · 06ba969c
  Craig Topper authored Feb 09, 2020
```
[X86] Make (insert_vector_elt (v8i16 zerovec), i16 %x, 0) generate the same code as (v8i16 (build_vector %x, 0, 0, 0, 0, 0, 0, 0)).

Instead of using a insrw to element 0, use movzx and movd.

Same for v16i8.
```
  06ba969c
- Fix `-Wparentheses` warning. NFC. · ab3da5dd
  Michael Liao authored Feb 10, 2020
  
  ab3da5dd
- [clang][codegen] Fix another lifetime emission on alloca on non-default address space. · a0678913
  Michael Liao authored Feb 09, 2020
```
- Lifetime intrinsics expect the pointer directly from alloca. Need
  extra handling for targets with alloca on non-default (or non-zero)
  address space.
```
  a0678913
- [X86] Autogenerate complete checks. NFC · f24c43c0
  Craig Topper authored Feb 09, 2020
  
  f24c43c0
- [X86] Use MOVZX instead of MOVSX in f16_to_fp isel patterns. · 05d44204
  Craig Topper authored Feb 09, 2020
```
Using sign extend forces the adjacent element to either all zeros
or all ones. But all ones is a NAN. So that doesn't seem like a
great idea.

Trying to work on supporting this with strict FP where NAN would
definitely be bad.
```
  05d44204
- [RISCV] Fix incorrect FP base CFI offset for variable argument functions · 64f41720
  Shiva Chen authored Feb 03, 2020
```
When the FP exists, the FP base CFI directive offset should take the size of variable arguments into account.

Differential Revision: https://reviews.llvm.org/D73862
```
  64f41720
- [DebugInfo] Add a DWARFDataExtractor constructor that takes ArrayRef<uint8_t> · 512c03ba
  Fangrui Song authored Feb 09, 2020
```
Similar to D67797 (DataExtractor).
```
  512c03ba
- GlobalISel: Fix narrowScalar for G_{CTLZ|CTTZ}_ZERO_UNDEF · 312a9d1b
  Matt Arsenault authored Feb 07, 2020
```
Narrow these for 64-bit VALU for AMDGPU.
```
  312a9d1b
- AMDGPU/GlobalISel: Split 64-bit G_CTPOP in RegBankSelect · c437f6c6
  Matt Arsenault authored Jan 25, 2020
  
  c437f6c6
- GlobalISel: Fix narrowing of G_CTLZ/G_CTTZ · 6135f5ed
  Matt Arsenault authored Feb 07, 2020
```
The result type is separate from the source type.
```
  6135f5ed
- AMDGPU/GlobalISel: Don't mis-select vector index on a constant · 2126c70e
  Matt Arsenault authored Feb 06, 2020
```
Vector indexing with a constant index should be folded out in the
legalizer, but this was accidentally falling through. This would
produce the indexing operation with $noreg. Handle this case as a
dynamic index just in case a bug like this happens again in the
future.
```
  2126c70e
- AMDGPU/GlobalISel: Look through casts when legalizing vector indexing · f4a38c11
  Matt Arsenault authored Feb 06, 2020
```
We were failing to find constants that were casted. I feel like the
artifact combiner should have folded the constant in the trunc before
the custom lowering, but that doesn't happen.
```
  f4a38c11
Feb 09, 2020

AMDGPU: Remove dead kill handling · 00115d76

Matt Arsenault authored Jan 06, 2020

At one point a custom node was used for kill handling, but now the
intrinsic is directly selected. Remove leftover pattern machinery.

00115d76

AMDGPU: Fix SI_IF lowering when the save exec reg has terminator uses · 6e177082

Matt Arsenault authored Dec 27, 2019

Reverts part of 6524a7a2. Since that
commit, the expansion was ignoring the actual save exec register
produced by the instruction, and looking at other instructions. I do
not understand why it was looking at other instructions, but relying
on this scan was wrong.

Fixes verifier errors after SI_IF is tail duplicated, which should be
correct to do. The results were fed into a phi, which was lowered to
the S_MOV_B64_term instructions.

6e177082

[X86] combineConcatVectorOps - combine VROTLI/VROTRI ops · 29e646fe

Simon Pilgrim authored Feb 09, 2020

Fix issue mentioned on rGe82e17d4d4ca - non-AVX512BW targets failed to concatenate 256-bit rotations back to 512-bits (split during shuffle lowering as they don't have v32i16/v64i8 types).

29e646fe

[X86] Use custom isel for (X86sbb_flag 0, 0) so we can use 32-bit SBB for i8/i16. · 656d66f5

Craig Topper authored Feb 09, 2020

We were using MOV32r0 and an extract_subreg as an input. By using
custom isel we can move the extract_subreg to after the SBB instead
of on the input.

656d66f5

[X86] Add flag result VT to a MOV32r0 created in X86DAGToDAGISel::Select · e1cbfecd

Craig Topper authored Feb 09, 2020

The flag isn't used, but I believe this matches the MOV32r0 that
would be created by the table emitter. This should allow this node
to be CSEed with any others created by the table.

e1cbfecd

[X86] Add lowerShuffleAsBitRotate (PR44379) · e82e17d4

Simon Pilgrim authored Feb 09, 2020

As noted on PR44379, we didn't attempt to lower vector shuffles using bit rotations on XOP/AVX512F targets.

This patch lowers to uniform ISD:ROTL nodes - ROTR isn't supported by XOP and they are interchangeable for constant values anyway.

There might be cases where targets without ISD:ROTL support would benefit from this (expanding to SRL+SHL+OR), which I'll investigate in a future patch.

Also, non-AVX512BW targets fail to concatenate 256-bit rotations back to 512-bits (split during shuffle lowering as they don't have v32i16/v64i8 types).

e82e17d4

[X86] Use MVT::i32 for the type of a MOV32r0 created in X86DAGToDAGISel::Select. · dd262222
Craig Topper authored Feb 09, 2020
```
Not sure if this really matters. The VT isn't really used after
this point. At best it might affect CSE.
```
dd262222

[X86] Remove isel patterns that include a vselect/X86selects and a strict FP node. · dbcc1392

Craig Topper authored Feb 08, 2020

A vselect+strictfp node is not equivalent to a masked operation.
The exceptions of the strictfp node are not masked by a vselect
after it so we can't match it to a masked operation.

We already had a hack in IsLegalToFold to prevent these patterns from
matching. This patch removes that hack and removes the patterns.

dbcc1392

libclc/r600: Use target specific builtins to implement rsqrt and native_rsqrt · 85e2fa44

Jan Vesely authored Feb 04, 2020

Fixes OCL CTS rsqrt and half_rsqrt (1 thread, scalaer) tests on AMD Turks.

Reviewer: awatry
Differential Revision: https://reviews.llvm.org/D74016

85e2fa44

libclc: Move rsqrt implementation to a .cl file · 4b23a2e8
Jan Vesely authored Feb 04, 2020
```
Reviewer: awatry
Differential Revision: https://reviews.llvm.org/D74013
```
4b23a2e8
[X86][XOP] Add XOP target to vXi16/vXi8 shuffle tests · 0ae119f8
Simon Pilgrim authored Feb 09, 2020
```
Helps with bit rotation test coverage for PR44379
```
0ae119f8
[X86][SSE] Add more tests showing failure to lower shuffles as bit rotations · 22780731
Simon Pilgrim authored Feb 09, 2020

22780731
[X86] Rename matchShuffleAsRotate - matchShuffleAsByteRotate. NFCI. · 29621b25
Simon Pilgrim authored Feb 09, 2020
```
A matchShuffleAsBitRotate variant will be added soon and we need to make the difference more obvious.
```
29621b25
[lldb] [doc] Status: Linux: Update the paragraph · 9d223a01
Jan Kratochvil authored Feb 09, 2020

9d223a01
[LLDB] [doc] Document NetBSD status and sort OSs alphabetically · 273f6383
Kamil Rytarowski authored Feb 09, 2020

273f6383
[gn build] Port a17f03bd · 628462e3
LLVM GN Syncbot authored Feb 09, 2020

628462e3