Commits · 6d7f7e6792bb79eeb209c359717862e8da7e4c30 · Roger Ferrer / llvm-epi

Oct 01, 2019

Reland "[utils] Implement the llvm-locstats tool" · 6d7f7e67

Djordje Todorovic authored Oct 01, 2019

The tool reports verbose output for the DWARF debug location coverage.
The llvm-locstats for each variable or formal parameter DIE computes what
percentage from the code section bytes, where it is in scope, it has
location description. The line 0 shows the number (and the percentage) of
DIEs with no location information, but the line 100 shows the number (and
the percentage) of DIEs where there is location information in all code
section bytes (where the variable or parameter is in the scope). The line
50..59 shows the number (and the percentage) of DIEs where the location
information is in between 50 and 59 percentage of its scope covered.

Differential Revision: https://reviews.llvm.org/D66526

llvm-svn: 373317

6d7f7e67

[yaml2obj] - Allow specifying custom Link values for SHT_HASH section. · 0210a1a5

George Rimar authored Oct 01, 2019

This allows setting any sh_link values for SHT_HASH sections.

Differential revision: https://reviews.llvm.org/D68214

llvm-svn: 373316

0210a1a5

[yaml2obj/obj2yaml] - Add support for SHT_HASH sections. · e5163ebf

George Rimar authored Oct 01, 2019

SHT_HASH specification is:
http://www.sco.com/developers/gabi/latest/ch5.dynamic.html#hash

In short the format is the following: it has 2 uint32 fields
in its header: nbucket and nchain followed by (nbucket + nchain)
uint32 values.

This patch allows dumping and parsing such sections.

Differential revision: https://reviews.llvm.org/D68085

llvm-svn: 373315

e5163ebf

Fixup r373278: Move test to X86 directory · c2c377ea
Diana Picus authored Oct 01, 2019
```
...since it's using an x86 triple.

llvm-svn: 373314
```
c2c377ea

Revert "GlobalISel: Handle llvm.read_register" · 827a7fab

Dmitri Gribenko authored Oct 01, 2019

This reverts commit r373294. It broke Clang's
CodeGen/arm64-microsoft-status-reg.cpp:
http://lab.llvm.org:8011/builders/clang-x86_64-debian-fast/builds/18483

llvm-svn: 373310

827a7fab

[X86] Consider isCodeGenOnly in the EVEX2VEX pass to make VMAXPD/PS map to the... · 220cf535

Craig Topper authored Oct 01, 2019

[X86] Consider isCodeGenOnly in the EVEX2VEX pass to make VMAXPD/PS map to the non-commutable VEX instruction. Use EVEX2VEX override to fix the scalar instructions.

Previously the match was ambiguous and VMAXPS/PD and VMAXCPS/PD
were mapped to the same VEX instruction. But we should keep
the commutableness when change the opcode.

llvm-svn: 373303

220cf535

[WebAssembly] Make sure EH pads are preferred in sorting · e2bcab61

Heejin Ahn authored Oct 01, 2019

Summary:
In CFGSort, we try to make EH pads have higher priorities as soon as
they are ready to be sorted, to prevent creation of unwind destination
mismatches in CFGStackify. We did that by making priority queues'
comparison function  prefer EH pads, but it was possible for an EH pad
to be popped from `Preferred` queue and then not sorted immediately and
enter `Ready` queue instead in a certain condition. This patch makes
sure that special condition does not consider EH pads as its candidates.

Reviewers: dschuff

Subscribers: sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D68229

llvm-svn: 373302

e2bcab61

[WebAssembly] Unstackify regs after fixing unwinding mismatches · 61d5c76a

Heejin Ahn authored Oct 01, 2019

Summary:
Fixing unwind mismatches for exception handling can result in splicing
existing BBs and moving some of instructions to new BBs. In this case
some of stackified def registers in the original BB can be used in the
split BB. For example, we have this BB and suppose %r0 is a stackified
register.
```
bb.1:
  %r0 = call @foo
  ... use %r0 ...
```

After fixing unwind mismatches in CFGStackify, `bb.1` can be split and
some instructions can be moved to a newly created BB:
```
bb.1:
  %r0 = call @foo

bb.split (new):
  ... use %r0 ...
```

In this case we should make %r0 un-stackified, because its use is now in
another BB.

When spliting a BB, this CL unstackifies all def registers that have
uses in the new split BB.

Reviewers: dschuff

Subscribers: sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D68218

llvm-svn: 373301

61d5c76a

AMDGPU/GlobalISel: Select s1 src G_SITOFP/G_UITOFP · fdea5e02
Matt Arsenault authored Oct 01, 2019
```
llvm-svn: 373298
```
fdea5e02
AMDGPU/GlobalISel: Add support for init.exec intrinsics · 59b91aa9
Matt Arsenault authored Oct 01, 2019
```
TThe existing wave32 behavior seems broken and incomplete, but this
reproduces it.

llvm-svn: 373296
```
59b91aa9

GlobalISel: Handle llvm.read_register · bdcc6d3d

Matt Arsenault authored Oct 01, 2019

SelectionDAG has a bunch of machinery to defer this to selection time
for some reason. Just directly emit a copy during IRTranslator. The
x86 usage does somewhat questionably check hasFP, which could depend
on the whole function being at minimum translated.

This does lose the convergent bit if the callsite had it, which may be
a problem. We also lose that in general for intrinsics, which may also
be a problem.

llvm-svn: 373294

bdcc6d3d

AMDGPU/GlobalISel: Avoid creating shift of 0 in arg lowering · 8f6bdb76

Matt Arsenault authored Oct 01, 2019

This is sort of papering over the fact that we don't run a combiner
anywhere, but avoiding creating 2 instructions in the first place is
easy.

llvm-svn: 373293

8f6bdb76

[llvm-readobj/llvm-readelf] Delete --arm-attributes (alias for --arch-specific) · 2d92c884

Fangrui Song authored Oct 01, 2019

D68110 added --arch-specific (supported by GNU readelf) and made
--arm-attributes an alias for it. The tests were later migrated to use
--arch-specific.

Note, llvm-readelf --arch-specific currently just uses llvm-readobj
style output for ARM attributes. The readelf-style output is not
implemented.

Reviewed By: compnerd, kongyi, rupprecht

Differential Revision: https://reviews.llvm.org/D68196

llvm-svn: 373291

2d92c884

[X86] Add test case to show missed opportunity to shrink a constant index to a... · 5dc49a83

Craig Topper authored Oct 01, 2019

[X86] Add test case to show missed opportunity to shrink a constant index to a gather in order to avoid splitting.

Also add a test case for an index that could be shrunk, but
would create a narrow type. We can go ahead and do it we just
need to be before type legalization.

Similar test cases for scatter as well.

llvm-svn: 373290

5dc49a83

AMDGPU/GlobalISel: Select G_UADDO/G_USUBO · 54167ea3
Matt Arsenault authored Oct 01, 2019
```
llvm-svn: 373288
```
54167ea3
GlobalISel: Implement widenScalar for G_SITOFP/G_UITOFP sources · ed85b0ce
Matt Arsenault authored Oct 01, 2019
```
Legalize 16-bit G_SITOFP/G_UITOFP for AMDGPU.

llvm-svn: 373287
```
ed85b0ce

AMDGPU/GlobalISel: Legalize G_GLOBAL_VALUE · 77ac4001

Matt Arsenault authored Oct 01, 2019

Handle other cases besides LDS. Mostly a straight port of the existing
handling, without the intermediate custom nodes.

llvm-svn: 373286

77ac4001

DebugInfo: Add parsing support for debug_loc base address specifiers · 5ca30666
David Blaikie authored Oct 01, 2019
```
llvm-svn: 373278
```
5ca30666
Add partial bswap test to the X86 backend. NFC · d60c297d
Amaury Sechet authored Sep 30, 2019
```
llvm-svn: 373271
```
d60c297d

Sep 30, 2019

[ConstantFolding] Fold constant calls to log2() · 22cb3d2e

Evandro Menezes authored Sep 30, 2019

Somehow, folding calls to `log2()` with a constant was missing.

Differential revision: https://reviews.llvm.org/D67300

llvm-svn: 373262

22cb3d2e

[InstCombine] Expand the simplification of log() · 110b1138

Evandro Menezes authored Sep 30, 2019

Expand the simplification of special cases of `log()` to include `log2()`
and `log10()` as well as intrinsics and more types.

Differential revision: https://reviews.llvm.org/D67199

llvm-svn: 373261

110b1138

[FunctionAttrs] Added noalias for memccpy/mempcpy arguments · a05e671c
David Bolvansky authored Sep 30, 2019
```
llvm-svn: 373251
```
a05e671c
[NFC][InstCombine] Redundant-left-shift-input-masking: add some more undef tests · 0205be8f
Roman Lebedev authored Sep 30, 2019
```
llvm-svn: 373248
```
0205be8f

[X86] Mask off upper bits of splat element in LowerBUILD_VECTORvXi1 when forming a SELECT. · 3405237f

Craig Topper authored Sep 30, 2019

The i1 scalar would have been type legalized to i8, but that
doesn't guarantee anything about the upper bits. If we're going
to use it as condition we need to make sure the upper bits are 0.

I've special cased ISD::SETCC conditions since that should
guarantee zero upper bits. We could go further and use
computeKnownBits, but we have no tests that would need that.

Fixes PR43507.

llvm-svn: 373246

3405237f

Revert "[MC] Emit unused undefined symbol even if its binding is not set" · 2331cd69
Nico Weber authored Sep 30, 2019
```
This reverts r373168. It caused PR43511.

llvm-svn: 373242
```
2331cd69

[PGO] Don't group COMDAT variables for compiler generated profile variables in ELF · 36740500

Rong Xu authored Sep 30, 2019

With this patch, compiler generated profile variables will have its own COMDAT
name for ELF format, which syncs the behavior with COFF. Tested with clang
PGO bootstrap. This shows a modest reduction in object sizes in ELF format.

Differential Revision: https://reviews.llvm.org/D68041

llvm-svn: 373241

36740500

[X86] Add ANY_EXTEND to switch in ReplaceNodeResults, but just fall back to default handling. · 299ebacf

Craig Topper authored Sep 30, 2019

ANY_EXTEND of v8i8 is marked Custom on AVX512 for handling extends
from v8i8. But the type legalization infrastructure will call
ReplaceNodeResults for v8i8 results. We should just defer it the
default handling instead of asserting in the default of the switch.

Fixes PR43509.

llvm-svn: 373234

299ebacf

[AArch64][SVE] Implement punpk[hi|lo] intrinsics · 01b84e17

Kerry McLaughlin authored Sep 30, 2019

Summary:
Adds the following two intrinsics:
  - int_aarch64_sve_punpkhi
  - int_aarch64_sve_punpklo

This patch also contains a fix which allows LLVMHalfElementsVectorType
to forward reference overloadable arguments.

Reviewers: sdesmalen, rovka, rengolin

Reviewed By: sdesmalen

Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, greened, cfe-commits, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D67830

llvm-svn: 373232

01b84e17

[InstCombine] fold negate disguised as select+mul · 712b7c24

Sanjay Patel authored Sep 30, 2019

  Name: negate if true
  %sel = select i1 %cond, i32 -1, i32 1
  %r = mul i32 %sel, %x
  =>
  %m = sub i32 0, %x
  %r = select i1 %cond, i32 %m, i32 %x

  Name: negate if false
  %sel = select i1 %cond, i32 1, i32 -1
  %r = mul i32 %sel, %x
  =>
  %m = sub i32 0, %x
  %r = select i1 %cond, i32 %x, i32 %m

https://rise4fun.com/Alive/Nlh

llvm-svn: 373230

712b7c24

[AArch64][GlobalISel] Support lowering variadic musttail calls · b1c1095f

Jessica Paquette authored Sep 30, 2019

This adds support for lowering variadic musttail calls. To do this, we have
to...

- Detect a musttail call in a variadic function before attempting to lower the
  call's formal arguments. This is done in the IRTranslator.
- Compute forwarded registers in `lowerFormalArguments`, and add copies for
  those registers.
- Restore the forwarded registers in `lowerTailCall`.

Because there doesn't seem to be any nice way to wrap these up into the outgoing
argument handler, the restore code in `lowerTailCall` is done separately.

Also, irritatingly, you have to make sure that the registers don't overlap with
any passed parameters. Otherwise, the scheduler doesn't know what to do with the
extra copies and asserts.

Add call-translator-variadic-musttail.ll to test this. This is pretty much the
same as the X86 musttail-varargs.ll test. We didn't have as nice of a test to
base this off of, but the idea is the same.

Differential Revision: https://reviews.llvm.org/D68043

llvm-svn: 373226

b1c1095f

Add tests for rotate with demanded bits. NFC · 09025ca6
Amaury Sechet authored Sep 30, 2019
```
llvm-svn: 373223
```
09025ca6
[InstCombine] add tests for negate disguised as mul; NFC · 8913882f
Sanjay Patel authored Sep 30, 2019
```
llvm-svn: 373222
```
8913882f
[AMDGPU] SIFoldOperands should not fold register acrocc the EXEC definition · 565b1d3d
Alexander Timofeev authored Sep 30, 2019
```
      Reviewers: rampitec

      Differential Revision: https://reviews.llvm.org/D67662

llvm-svn: 373221
```
565b1d3d

[SSP] [3/3] cmpxchg and addrspacecast instructions can now · ed1f3f36

Paul Robinson authored Sep 30, 2019

trigger stack protectors.  Fixes PR42238.

Add test coverage for llvm.memset, as proxy for all llvm.mem*
intrinsics. There are two issues here: (1) they could be lowered to a
libc call, which could be intercepted, and do Bad Stuff; (2) with a
non-constant size, they could overwrite the current stack frame.

The test was mostly written by Matt Arsenault in r363169, which was
later reverted; I tweaked what he had and added the llvm.memset part.

Differential Revision: https://reviews.llvm.org/D67845

llvm-svn: 373220

ed1f3f36

[SSP] [1/3] Revert "StackProtector: Use PointerMayBeCaptured" · 14945186

Paul Robinson authored Sep 30, 2019

"Captured" and "relevant to Stack Protector" are not the same thing.

This reverts commit f29366b1.
aka r363169.

Differential Revision: https://reviews.llvm.org/D67842

llvm-svn: 373216

14945186

Revert "Reland "[utils] Implement the llvm-locstats tool"" · 8180f3b1
Djordje Todorovic authored Sep 30, 2019
```
This reverts commit rL373183.

llvm-svn: 373200
```
8180f3b1

[NFC][ARM][MVE] More tests · e3b4f0ec

Sam Parker authored Sep 30, 2019

Add some loop tests that cover different float operations and types.

llvm-svn: 373192

e3b4f0ec

Pre-commit a test case for PR43129. · 8569c0f1
Hans Wennborg authored Sep 30, 2019
```
llvm-svn: 373190
```
8569c0f1
[llvm-locstats] Fix the test for the Hexagon target · 180f1feb
Djordje Todorovic authored Sep 30, 2019
```
llvm-svn: 373189
```
180f1feb

[ARM][MVE] Change VCTP operand · aac03ae0

Sam Parker authored Sep 30, 2019

The VCTP instruction will calculate the predicate masked based upon
the number of elements that need to be processed. I had inserted the
sub before the vctp intrinsic and supplied it as the operand, but
this is incorrect as the phi should directly feed the vctp. The sub
is calculating the value for the next iteration.

Differential Revision: https://reviews.llvm.org/D67921

llvm-svn: 373188

aac03ae0