Commits · ead96d73ac1489bb96ed4bab30c8d4b112862251 · Roger Ferrer / llvm-epi

Sep 23, 2019

Revert "Reland "[utils] Implement the llvm-locstats tool"" · ead96d73
Djordje Todorovic authored Sep 23, 2019
```
This reverts commit rL372554.

llvm-svn: 372580
```
ead96d73

[llvm-readobj] - Stop treating ".stack_sizes.*" sections as stack sizes sections. · 753f6cff

George Rimar authored Sep 23, 2019

llvm-readobj currently handles .stack_sizes.* (e.g. .stack_sizes.foo)
as a normal stack sizes section. Though MC does not produce sections with
such names. Also, linkers do not combine .stack_sizes.* into .stack_sizes.

A mini discussion about this correctness issue is here: https://reviews.llvm.org/D67757#inline-609274
This patch changes implementation so that only now only '.stack_sizes' name is
accepted as a real stack sizes section.

Differential revision: https://reviews.llvm.org/D67824

llvm-svn: 372578

753f6cff

CriticalAntiDepBreaker - Assert that we've found the bottom of the critical path. NFCI. · 0b184b85
Simon Pilgrim authored Sep 23, 2019
```
Silences static analyzer null dereference warnings.

llvm-svn: 372577
```
0b184b85

[llvm-readobj] - Implement LLVM-style dumping for .stack_sizes sections. · 4e0faa33

George Rimar authored Sep 23, 2019

D65313 implemented GNU-style dumping (llvm-readelf).
This one implements LLVM-style dumping (llvm-readobj).

Differential revision: https://reviews.llvm.org/D67834

llvm-svn: 372576

4e0faa33

[FunctionAttrs] Enable nonnull arg propagation · d90fd41f

David Bolvansky authored Sep 23, 2019

Enable flag introduced in rL294998. Security concerns are no longer valid, since function signatures for mentioned libc functions has no nonnull attribute (Clang does not generate them? I see no nonnull attr in LLVM IR for these functions) and since rL372091 we carefully annotate the callsites where we know that size is static, non zero. So let's enable this flag again..

llvm-svn: 372573

d90fd41f

[ARM][MVE] Remove old tail predicates · 9feb429a

Sam Parker authored Sep 23, 2019

Remove any predicate that we replace with a vctp intrinsic, and try
to remove their operands too. Also look into the exit block to see if
there's any duplicates of the predicates that we've replaced and
clone the vctp to be used there instead.

Differential Revision: https://reviews.llvm.org/D67709

llvm-svn: 372567

9feb429a

[AArch64] support neon_sshl and neon_ushl in performIntrinsicCombine. · 3e2fdbee

Florian Hahn authored Sep 23, 2019

Try to generate ushll/sshll for aarch64_neon_ushl/aarch64_neon_sshl,
if their first operand is extended and the second operand is a constant

Also adds a few tests marked with FIXME, where we can further increase
codegen.

Reviewers: t.p.northover, samparker, dmgreen, anemet

Reviewed By: anemet

Differential Revision: https://reviews.llvm.org/D62308

llvm-svn: 372565

3e2fdbee

[ARM][LowOverheadLoops] Use subs during revert. · 4ba6d0de

Sam Parker authored Sep 23, 2019

Check whether there are any uses or defs between the LoopDec and
LoopEnd. If there's not, then we can use a subs to set the cpsr and
skip generating a cmp.

Differential Revision: https://reviews.llvm.org/D67801

llvm-svn: 372560

4ba6d0de

[Alignment][NFC] Switch DataLayout private members to llvm::Align · 046a16b8

Guillaume Chatelet authored Sep 23, 2019

Summary:
This is patch is part of a series to introduce an Alignment type.
See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html
See this patch for the introduction of the type: https://reviews.llvm.org/D64790

Reviewers: courbet

Subscribers: hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D67836

llvm-svn: 372558

046a16b8

[ARM][LowOverheadLoops] Use tBcc when reverting · 566127e3

Sam Parker authored Sep 23, 2019

Check the branch target ranges and use a tBcc instead of t2Bcc when
we can.

Differential Revision: https://reviews.llvm.org/D67796

llvm-svn: 372557

566127e3

[MIPS GlobalISel] VarArg argument lowering, select G_VASTART and vacopy · c063b0b0

Petar Avramovic authored Sep 23, 2019

CC_Mips doesn't accept vararg functions for O32, so we have to explicitly
use CC_Mips_FixedArg.
For lowerCall we now properly figure out whether callee function is vararg
or not, this has no effect for O32 since we always use CC_Mips_FixedArg.
For lower formal arguments we need to copy arguments in register to stack
and save pointer to start for argument list into MipsMachineFunction
object so that G_VASTART could use it during instruction select.
For vacopy we need to copy content from one vreg to another,
load and store are used for that purpose.

Differential Revision: https://reviews.llvm.org/D67756

llvm-svn: 372555

c063b0b0

Reland "[utils] Implement the llvm-locstats tool" · 0e490ae0

Djordje Todorovic authored Sep 23, 2019

The tool reports verbose output for the DWARF debug location coverage.
The llvm-locstats for each variable or formal parameter DIE computes what
percentage from the code section bytes, where it is in scope, it has
location description. The line 0 shows the number (and the percentage) of
DIEs with no location information, but the line 100 shows the number (and
the percentage) of DIEs where there is location information in all code
section bytes (where the variable or parameter is in the scope). The line
50..59 shows the number (and the percentage) of DIEs where the location
information is in between 50 and 59 percentage of its scope covered.

Differential Revision: https://reviews.llvm.org/D66526

llvm-svn: 372554

0e490ae0

[X86] Canonicalize all zeroes vector to RHS in X86DAGToDAGISel::tryVPTESTM. · 03b5a13e
Craig Topper authored Sep 23, 2019
```
llvm-svn: 372544
```
03b5a13e

[X86] Remove SETEQ/SETNE canonicalization code from LowerIntVSETCC_AVX512 to... · 5e26064c

Craig Topper authored Sep 23, 2019

[X86] Remove SETEQ/SETNE canonicalization code from LowerIntVSETCC_AVX512 to prevent an infinite loop.

The attached test case would previous infinite loop after
r365711.

I'm going to move this to X86ISelDAGToDAG.cpp to get the setcc
to match VPTEST in 32-bit mode in a follow up commit.

llvm-svn: 372543

5e26064c

[X86] Add 32-bit command line to avx512f-vec-test-testn.ll · 1f058538
Craig Topper authored Sep 23, 2019
```
llvm-svn: 372542
```
1f058538

Prefer AVX512 memcpy when applicable · a7a515cb

David Zarzycki authored Sep 23, 2019

When AVX512 is available and the preferred vector width is 512-bits or
more, we should prefer AVX512 for memcpy().

https://bugs.llvm.org/show_bug.cgi?id=43240

https://reviews.llvm.org/D67874

llvm-svn: 372540

a7a515cb

[X86] Convert to Constant arguments to MMX shift by i32 intrinsics to... · da4a4707

Craig Topper authored Sep 23, 2019

[X86] Convert to Constant arguments to MMX shift by i32 intrinsics to TargetConstant during lowering.

This allows us to use timm in the isel table which is more
consistent with other intrinsics that take an immediate now.

We can't declare the intrinsic as taking an ImmArg because we
need to match non-constants to the shift by MMX register
instruction which we do by mutating the intrinsic id during
lowering.

llvm-svn: 372537

da4a4707

[X86] Remove stale FIXME. · 5efc928a

Craig Topper authored Sep 23, 2019

This goes back to when MMX was migrated to intrinsic only. The
hack referenced here has been gone for quite a while.

llvm-svn: 372536

5efc928a

[X86][SelectionDAGBuilder] Move the hack for handling MMX shift by i32... · a533e877

Craig Topper authored Sep 23, 2019

[X86][SelectionDAGBuilder] Move the hack for handling MMX shift by i32 intrinsics into the X86 backend.

This intrinsics should be shift by immediate, but gcc allows any
i32 scalar and clang needs to match that. So we try to detect the
non-constant case and move the data from an integer register to an
MMX register.

Previously this was done by creating a v2i32 build_vector and
bitcast in SelectionDAGBuilder. This had to be done early since
v2i32 isn't a legal type. The bitcast+build_vector would be DAG
combined to X86ISD::MMX_MOVW2D which isel will turn into a
GPR->MMX MOVD.

This commit just moves the whole thing to lowering and emits
the X86ISD::MMX_MOVW2D directly to avoid the illegal type. The
test changes just seem to be due to nodes being linearized in a
different order.

llvm-svn: 372535

a533e877

[X86] Require last argument to LWPINS/LWPVAL builtins to be an ICE. Add ImmArg... · e4c17651

Craig Topper authored Sep 22, 2019

[X86] Require last argument to LWPINS/LWPVAL builtins to be an ICE. Add ImmArg to the llvm intrinsics.

Update the isel patterns to use timm instead of imm.

llvm-svn: 372534

e4c17651

[X86] X86DAGToDAGISel::matchBEXTRFromAndImm(): if can't use BEXTR, fallback to... · 7c3d6f5a

Roman Lebedev authored Sep 22, 2019

[X86] X86DAGToDAGISel::matchBEXTRFromAndImm(): if can't use BEXTR, fallback to BZHI is profitable (PR43381)

Summary:
PR43381 notes that while we are good at matching `(X >> C1) & C2` as BEXTR/BEXTRI,
we only do that if we either have BEXTRI (TBM),
or if BEXTR is marked as being fast (`-mattr=+fast-bextr`).
In all other cases we don't match.

But that is mainly only true for AMD CPU's.
However, for all the CPU's for which we have sched models,
the BZHI is always fast (or the sched models are all bad.)

So if we decide that it's unprofitable to emit BEXTR/BEXTRI,
we should consider falling-back to BZHI if it is available,
and follow-up with the shift.

While it's really tempting to do something because it's cool
it is wise to first think whether it actually makes sense to do.
We shouldn't just use BZHI because we can, but only it it is beneficial.
In particular, it isn't really worth it if the input is a register,
mask is small, or we can fold a load.
But it is worth it if the mask does not fit into 32-bits.

(careful, i don't know much about intel cpu's, my choice of `-mcpu` may be bad here)
Thus we manage to fold a load:
https://godbolt.org/z/Er0OQz
Or if we'd end up using BZHI anyways because the mask is large:
https://godbolt.org/z/dBJ_5h
But this isn'r actually profitable in general case,
e.g. here we'd increase microop count
(the register renaming is free, mca does not model that there it seems)
https://godbolt.org/z/k6wFoz
Likewise, not worth it if we just get load folding:
https://godbolt.org/z/1M1deG

https://bugs.llvm.org/show_bug.cgi?id=43381

Reviewers: RKSimon, craig.topper, davezarzycki, spatel

Reviewed By: craig.topper, davezarzycki

Subscribers: andreadb, hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D67875

llvm-svn: 372532

7c3d6f5a

Sep 22, 2019
- Verifier - silence static analyzer dyn_cast<VectorType> null dereference warnings. NFCI. · 638933ac
  Simon Pilgrim authored Sep 22, 2019
```
The static analyzer is warning about potential null dereferences, but we should be able to use cast<VectorType> directly and if not assert will fire for us.

llvm-svn: 372529
```
  638933ac
- [AMDGPU] isSDNodeAlwaysUniform - silence static analyzer dyn_cast<LoadSDNode>... · 557cee33
  Simon Pilgrim authored Sep 22, 2019
```
[AMDGPU] isSDNodeAlwaysUniform - silence static analyzer dyn_cast<LoadSDNode> null dereference warning. NFCI.

The static analyzer is warning about a potential null dereference, but we should be able to use cast<LoadSDNode> directly and if not assert will fire for us.

llvm-svn: 372528
```
  557cee33
- [X86] Fix some VCVTPS2PH isel patterns where 'i32' was used instead of 'timm' · c4802de3
  Craig Topper authored Sep 22, 2019
```
This seems to have completed omitted any check for the opcode
of the operand in the isel table.

llvm-svn: 372526
```
  c4802de3
- [X86][TableGen] Allow timm to appear in output patterns. Use it to remove... · 80fda375
  Craig Topper authored Sep 22, 2019
```
[X86][TableGen] Allow timm to appear in output patterns. Use it to remove ConvertToTarget opcodes from the X86 isel table.

We're now using a lot more TargetConstant nodes in SelectionDAG.
But we were still telling isel to convert some of them
to TargetConstants even though they already are. This is because
isel emits a conversion anytime the output pattern has a an 'imm'.
I guess for patterns in instructions we take the 'timm' from the
'set' pattern, but for Pat patterns with explcicit output we
previously had to say 'imm' since 'timm' wasn't allowed in outputs.

llvm-svn: 372525
```
  80fda375
- [NFC][X86] Add BEXTR test with load and 33-bit mask (PR43381 / D67875) · 24159592
  Roman Lebedev authored Sep 22, 2019
```
llvm-svn: 372524
```
  24159592
- [X86] Update commutable EVEX vcmp patterns to use timm instead of imm. · a1d86857
  Craig Topper authored Sep 22, 2019
```
We need to match TargetConstant, not Constant. This was broken
in r372338, but we lacked test coverage.

llvm-svn: 372523
```
  a1d86857
- [X86] Add more tests for commuting evex vcmp instructions during isel to fold a load. · ac847712
  Craig Topper authored Sep 22, 2019
```
Some of the isel patterns were not updated to check for
TargetConstant instead of Constant in r372338.

llvm-svn: 372522
```
  ac847712
- [SelectionDAG] computeKnownBits/ComputeNumSignBits - cleanup demanded/unknown paths. NFCI. · c8a9ae4c
  Simon Pilgrim authored Sep 22, 2019
```
Merge the calls, just adjust the demandedelts if we have a valid extract_subvector constant index, else demand all elts.

llvm-svn: 372521
```
  c8a9ae4c
- [XRay] Silence static analyzer dyn_cast<BufferExtents> null dereference warning. NFCI. · 2c558bd8
  Simon Pilgrim authored Sep 22, 2019
```
llvm-svn: 372520
```
  2c558bd8
- [LSR] Silence static analyzer null dereference warnings with assertions. NFCI. · 2441455b
  Simon Pilgrim authored Sep 22, 2019
```
Add assertions to make it clear that GenerateIVChain / NarrowSearchSpaceByPickingWinnerRegs should succeed in finding non-null values

llvm-svn: 372518
```
  2441455b
- ConstantHoisting - Silence static analyzer dyn_cast<PointerType> null dereference warning. NFCI. · db05a482
  Simon Pilgrim authored Sep 22, 2019
```
llvm-svn: 372517
```
  db05a482
- gn build: Friendlier error on invalid entries in llvm_targets_to_build · f7d5f90c
  Nico Weber authored Sep 22, 2019
```
llvm-svn: 372515
```
  f7d5f90c
- [Cost][X86] Add more missing vector truncation costs · 4d486156
  Simon Pilgrim authored Sep 22, 2019
```
The AVX512 cases still need some work to correct recognise the PMOV truncation cases.

llvm-svn: 372514
```
  4d486156
- gn build: consolidate "Nothing to do" branches in targets.gni · 44b6e02f
  Nico Weber authored Sep 22, 2019
```
No behavior change.

llvm-svn: 372512
```
  44b6e02f
- [InstCombine] allow icmp+binop folds before min/max bailout (PR43310) · eb8d39e1
  Sanjay Patel authored Sep 22, 2019
```
This has the potential to uncover missed analysis/folds as shown in the
min/max code comment/test, but fewer restrictions on icmp folds should
be better in general to solve cases like:
https://bugs.llvm.org/show_bug.cgi?id=43310

llvm-svn: 372510
```
  eb8d39e1
- [InstCombine] add tests for icmp fold hindered by min/max; NFC · d2a52428
  Sanjay Patel authored Sep 22, 2019
```
llvm-svn: 372509
```
  d2a52428
- Fix uninitialized variable warning. NFCI. · 8bfea817
  Simon Pilgrim authored Sep 22, 2019
```
llvm-svn: 372508
```
  8bfea817
- [AArch64] AArch64StackTagging - Silence static analyzer dyn_cast<> null dereference warning. NFCI. · 62ed3fea
  Simon Pilgrim authored Sep 22, 2019
```
The static analyzer is warning about potential null dereferences, but we should be able to use cast<> directly and if not assert will fire for us.

llvm-svn: 372507
```
  62ed3fea
- gn build: Add missing RISCV to llvm_targets_to_build="all" · 597ec24c
  Nico Weber authored Sep 22, 2019
```
llvm-svn: 372506
```
  597ec24c