Commits · a48633b5bbbbe9e20a7199eaed651ff6024d7967 · Lorenzo Albano / LLVM bpEVL

Oct 21, 2019

Fix llvm signal tests build. · a48633b5
David Carlier authored Oct 21, 2019
```
llvm-svn: 375406
```
a48633b5
[obj2yaml] - Fix a comment. NFC. · 6fc28919
George Rimar authored Oct 21, 2019
```
I forgot to address this nit before committing..

llvm-svn: 375405
```
6fc28919

[obj2yaml] - Stop triggering UB when dumping corrupted strings. · 4ec0b084

George Rimar authored Oct 21, 2019

We have a following code to find quote type:

if (isspace(S.front()) || isspace(S.back()))
...

Problem is that:

"int isspace( int ch ): The behavior is undefined if the value of
ch is not representable as unsigned char and is not equal to EOF."
(https://en.cppreference.com/w/cpp/string/byte/isspace)

This patch shows how this UB can be triggered and fixes an issue.

Differential revision: https://reviews.llvm.org/D69160

llvm-svn: 375404

4ec0b084

[MemCpyOpt] Fixing Incorrect Code Motion while Handling Aggregate Type Values · d6e6aa8a

Sam Elliott authored Oct 21, 2019

Summary:
When MemCpyOpt is handling aggregate type values, if an instruction (let's call it P) between the targeting load (L) and store (S) clobbers the source pointer of L, it will try to hoist S before P. This process will also hoist S's data dependency instructions.

However, the current implementation has a bug that if one of S's dependency instructions is //also// a user of P, MemCpyOpt will not prevent it from being hoisted above P and cause a use-before-define error. For example, in the newly added test file (i.e. `aggregate-type-crash.ll`), it will try to hoist both `store %my_struct %1, %my_struct* %3` and its dependent, `%3 = bitcast i8* %2 to %my_struct*`, above `%2 = call i8* @my_malloc(%my_struct* %0)`. Creating the following BB:
```
entry:
  %1 = bitcast i8* %4 to %my_struct*
  %2 = bitcast %my_struct* %1 to i8*
  %3 = bitcast %my_struct* %0 to i8*
  call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 4 %2, i8* align 4 %3, i64 8, i1 false)
  %4 = call i8* @my_malloc(%my_struct* %0)
  ret void
```
Where there is a use-before-define error between `%1` and `%4`.

Update: The compiler for the Pony Programming Language [also encounter the same bug](https://github.com/ponylang/ponyc/issues/3140)

Patch by Min-Yih Hsu (myhsu)

Reviewers: eugenis, pcc, dblaikie, dneilson, t.p.northover, lattner

Reviewed By: eugenis

Subscribers: lenary, hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D66060

llvm-svn: 375403

d6e6aa8a

[ARM] Lower sadd_sat to qadd8 and qadd16 · fba831e7

David Green authored Oct 21, 2019

Lower the target independent signed saturating intrinsics to qadd8 and qadd16.
This custom lowers them from a sadd_sat, catching the node early before it is
promoted. It also adds a QADD8b and QADD16b node to mean the bottom "lane" of a
qadd8/qadd16, so that we can call demand bits on it to show that it does not
use the upper bits.

Also handles QSUB8 and QSUB16.

Differential Revision: https://reviews.llvm.org/D68974

llvm-svn: 375402

fba831e7

[ARM] Add and adjust saturation tests for upcoming qadd changes. NFC · 5ba66fa5
David Green authored Oct 21, 2019
```
llvm-svn: 375401
```
5ba66fa5
[LLD] [COFF] Fix use of uninitialized memory since SVN r375390 · 150a9ad3
Martin Storsjö authored Oct 21, 2019
```
llvm-svn: 375400
```
150a9ad3

Use Align for TFL::TransientStackAlignment · 3cc4835c

Guillaume Chatelet authored Oct 21, 2019

Summary:
This is patch is part of a series to introduce an Alignment type.
See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html
See this patch for the introduction of the type: https://reviews.llvm.org/D64790

Reviewers: courbet

Subscribers: arsenm, dschuff, jyknight, sdardis, jvesely, nhaehnle, sbc100, jgravelle-google, hiraditya, aheejin, fedor.sergeev, jrtc27, atanasyan, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D69216

llvm-svn: 375398

3cc4835c

[NFC][InstCombine] Fixup comments · 9948fac6
Roman Lebedev authored Oct 21, 2019
```
As noted in post-commit review of rL375378375378.

llvm-svn: 375397
```
9948fac6

[CVP] Deduce no-wrap on `mul` · 29277162

Roman Lebedev authored Oct 21, 2019

Summary:
`ConstantRange::makeGuaranteedNoWrapRegion()` knows how to deal with `mul`
since rL335646, there is exhaustive test coverage.
This is already used by CVP's `processOverflowIntrinsic()`,
and by SCEV's `StrengthenNoWrapFlags()`

That being said, currently, this doesn't help much in the end:
| statistic                              |     old |     new | delta | percentage |
| correlated-value-propagation.NumMulNSW |       4 |     275 |   271 |   6775.00% |
| correlated-value-propagation.NumMulNUW |       4 |    1323 |  1319 |  32975.00% |
| correlated-value-propagation.NumMulNW  |       8 |    1598 |  1590 |  19875.00% |
| correlated-value-propagation.NumNSW    |    5715 |    5986 |   271 |      4.74% |
| correlated-value-propagation.NumNUW    |    9193 |   10512 |  1319 |     14.35% |
| correlated-value-propagation.NumNW     |   14908 |   16498 |  1590 |     10.67% |
| instcount.NumAddInst                   |  275871 |  275869 |    -2 |      0.00% |
| instcount.NumBrInst                    |  708234 |  708232 |    -2 |      0.00% |
| instcount.NumMulInst                   |   43812 |   43810 |    -2 |      0.00% |
| instcount.NumPHIInst                   |  316786 |  316784 |    -2 |      0.00% |
| instcount.NumTruncInst                 |   62165 |   62167 |     2 |      0.00% |
| instcount.NumUDivInst                  |    2528 |    2526 |    -2 |     -0.08% |
| instcount.TotalBlocks                  |  842995 |  842993 |    -2 |      0.00% |
| instcount.TotalInsts                   | 7376486 | 7376478 |    -8 |      0.00% |
(^ test-suite plain, tests still pass)

Reviewers: nikic, reames, luqmana, sanjoy, timshen

Reviewed By: reames

Subscribers: hiraditya, javed.absar, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D69203

llvm-svn: 375396

29277162

[InstCombine] Allow values with multiple users in SimplifyDemandedVectorElts · a861c9ae

Piotr Sobczak authored Oct 21, 2019

Summary:
Allow for ignoring the check for a single use in SimplifyDemandedVectorElts
to be able to simplify operands if DemandedElts is known to contain
the union of elements used by all users.
It is a responsibility of a caller of SimplifyDemandedVectorElts to
supply correct DemandedElts.

Simplify a series of extractelement instructions if only a subset of
elements is used.

Reviewers: reames, arsenm, majnemer, nhaehnle

Reviewed By: nhaehnle

Subscribers: wdng, jvesely, nhaehnle, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D67345

llvm-svn: 375395

a861c9ae

gn build: Merge r375390 · 01e177ed
LLVM GN Syncbot authored Oct 21, 2019
```
llvm-svn: 375393
```
01e177ed
[LLDB] [Windows] Initial support for ARM register contexts · a59444a3
Martin Storsjö authored Oct 21, 2019
```
Differential Revision: https://reviews.llvm.org/D69226

llvm-svn: 375392
```
a59444a3

[LLD] [COFF] Use the local dwarf code instead of Symbolizer for resolving code locations. NFC. · 65b1c497

Martin Storsjö authored Oct 21, 2019

As we now have code that parses the dwarf info for variable locations,
we can use that instead of relying on the higher level Symbolizer library,
reducing the previous two different dwarf codepaths into one.

Differential Revision: https://reviews.llvm.org/D69198

llvm-svn: 375391

65b1c497

[LLD] Move duplicated dwarf parsing code to the Common library. NFC. · 908b7809
Martin Storsjö authored Oct 21, 2019
```
Differential Revision: https://reviews.llvm.org/D69197

llvm-svn: 375390
```
908b7809

[IR] Fix mayReadFromMemory() for writeonly calls · 5e5af533

Yevgeny Rouban authored Oct 21, 2019

Current implementation of Instruction::mayReadFromMemory()
returns !doesNotAccessMemory() which is !ReadNone. This
does not take into account that the writeonly attribute
also indicates that the call does not read from memory.

The patch changes the predicate to !doesNotReadMemory()
that reflects the intended behavior.

Differential Revision: https://reviews.llvm.org/D69086

llvm-svn: 375389

5e5af533

[BPF] fix indirect call assembly code · ee881197

Yonghong Song authored Oct 21, 2019

Currently, for indirect call, the assembly code printed out as
  callx <imm>
This is not right, it should be
  callx <reg>

Fixed the issue with proper format.

Differential Revision: https://reviews.llvm.org/D69229

llvm-svn: 375386

ee881197

[Attributor][FIX] Silence sign-compare warning · 9d5ad5e4
Johannes Doerfert authored Oct 21, 2019
```
llvm-svn: 375384
```
9d5ad5e4

[Attributor] Teach AANoCapture to use information in-flight more aggressively · 3839b57f

Johannes Doerfert authored Oct 21, 2019

AAReturnedValues, AAMemoryBehavior, and AANoUnwind, can provide
information that helps during the tracking or even justifies no-capture.
We now use this information and enable no-capture in some test cases
designed a long while a ago for these cases.

llvm-svn: 375382

3839b57f

[X86] Check Subtarget.hasSSE3() before calling shouldUseHorizontalOp and... · e7841462

Craig Topper authored Oct 20, 2019

[X86] Check Subtarget.hasSSE3() before calling shouldUseHorizontalOp and emitting X86ISD::FHADD in LowerUINT_TO_FP_i64.

This was a regression from r375341.

Fixes PR43729.

llvm-svn: 375381

e7841462

[IndVars] Add a todo to reflect a further oppurtunity identified in D69009 · e884843d
Philip Reames authored Oct 20, 2019
```
Nikita pointed out an oppurtunity, might as well document it in the code.

llvm-svn: 375380
```
e884843d

[IndVars] Eliminate loop exits with equivalent exit counts · 8cbcd2f4

Philip Reames authored Oct 20, 2019

We can end up with two loop exits whose exit counts are equivalent, but whose textual representation is different and non-obvious. For the sub-case where we have a series of exits which dominate one another (common), eliminate any exits which would iterate *after* a previous exit on the exiting iteration.

As noted in the TODO being removed, I'd always thought this was a good idea, but I've now seen this in a real workload as well.

Interestingly, in review, Nikita pointed out there's let another oppurtunity to leverage SCEV's reasoning. If we kept track of the min of dominanting exits so far, we could discharge exits with EC >= MDE. This is less powerful than the existing transform (since later exits aren't considered), but potentially more powerful for any case where SCEV can prove a >= b, but neither a == b or a > b. I don't have an example to illustrate that oppurtunity, but won't be suprised if we find one and return to handle that case as well.

Differential Revision: https://reviews.llvm.org/D69009

llvm-svn: 375379

8cbcd2f4

Oct 20, 2019

[InstCombine] conditional sign-extend of high-bit-extract: 'or' pattern. · 7015a5c5

Roman Lebedev authored Oct 20, 2019

In this pattern, all the "magic" bits that we'd `add` are all
high sign bits, and in the value we'd be adding to they are all unset,
not unexpectedly, so we can have an `or` there:
https://rise4fun.com/Alive/ups

It is possible that `haveNoCommonBitsSet()` should be taught about this
pattern so that we never have an `add` variant, but the reasoning would
need to be recursive (because of that `select`), so i'm not really sure
that would be worth it just yet.

llvm-svn: 375378

7015a5c5

[NFC][InstCombine] conditional sign-extend of high-bit-extract: 'and' pat. can be 'or' pattern. · f7aec25d

Roman Lebedev authored Oct 20, 2019

In this pattern, all the "magic" bits that we'd add are all
high sign bits, and in the value we'd be adding to they are all unset,
not unexpectedly, so we can have an `or` there:
https://rise4fun.com/Alive/ups

llvm-svn: 375377

f7aec25d

gn build: Merge r375375 · b01c077a
LLVM GN Syncbot authored Oct 20, 2019
```
llvm-svn: 375376
```
b01c077a
Reverted r375254 as it has broken some build bots for a long time. · 92c96c7b
Vladimir Vereschaka authored Oct 20, 2019
```
llvm-svn: 375375
```
92c96c7b

[InstCombine] Fold uadd.sat(a, b) == 0 and usub.sat(a, b) == 0 · b1b7a2f7

Nikita Popov authored Oct 20, 2019

This adds folds for comparing uadd.sat/usub.sat with zero:

 * uadd.sat(a, b) == 0 => a == 0 && b == 0 => (a | b) == 0
 * usub.sat(a, b) == 0 => a <= b

And inverted forms for !=.

Differential Revision: https://reviews.llvm.org/D69224

llvm-svn: 375374

b1b7a2f7

Fix buildbot error in SIRegisterInfo.cpp. · 5fa36e42
Zinovy Nis authored Oct 20, 2019
```
llvm-svn: 375373
```
5fa36e42
[InstCombine] Add tests for uadd/sub.sat(a, b) == 0; NFC · c08666ab
Nikita Popov authored Oct 20, 2019
```
llvm-svn: 375372
```
c08666ab

[InstCombine] Shift amount reassociation in shifty sign bit test (PR43595) · 49483a3b

Roman Lebedev authored Oct 20, 2019

Summary:
This problem consists of several parts:
* Basic sign bit extraction - `trunc? (?shr %x, (bitwidth(x)-1))`.
This is trivial, and easy to do, we have a fold for it.
* Shift amount reassociation - if we have two identical shifts,
and we can simplify-add their shift amounts together,
then we likely can just perform them as a single shift.
But this is finicky, has one-use restrictions,
and shift opcodes must be identical.

But there is a super-pattern where both of these work together.
to produce sign bit test from two shifts + comparison.
We do indeed already handle this in most cases.
But since we get that fold transitively, it has one-use restrictions.
And what's worse, in this case the right-shifts aren't required to be
identical, and we can't handle that transitively:

If the total shift amount is bitwidth-1, only a sign bit will remain
in the output value. But if we look at this from the perspective of
two shifts, we can't fold - we can't possibly know what bit pattern
we'd produce via two shifts, it will be *some* kind of a mask
produced from original sign bit, but we just can't tell it's shape:
https://rise4fun.com/Alive/cM0 https://rise4fun.com/Alive/9IN

But it will *only* contain sign bit and zeros.
So from the perspective of sign bit test, we're good:
https://rise4fun.com/Alive/FRz https://rise4fun.com/Alive/qBU
Superb!

So the simplest solution is to extend `reassociateShiftAmtsOfTwoSameDirectionShifts()` to also have a
sudo-analysis mode that will ignore extra-uses, and will only check
whether a) those are two right shifts and b) they end up with bitwidth(x)-1
shift amount and return either the original value that we sign-checking,
or null.

This does not have any functionality change for
the existing `reassociateShiftAmtsOfTwoSameDirectionShifts()`.

All that being said, as disscussed in the review, this yet again
increases usage of instsimplify in instcombine as utility.
Some day that may need to be reevaluated.

https://bugs.llvm.org/show_bug.cgi?id=43595

Reviewers: spatel, efriedma, vsk

Reviewed By: spatel

Subscribers: xbolva00, hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D68930

llvm-svn: 375371

49483a3b

[ConstantRange] makeGuaranteedNoWrapRegion(): `shl` support · 4b622326

Roman Lebedev authored Oct 20, 2019

Summary:
If all the shifts amount are already poison-producing,
then we can add more poison-producing flags ontop:
https://rise4fun.com/Alive/Ocwi

Otherwise, we should only consider the possible range of shift amts that don't result in poison.

For unsigned range not not overflow, we must not shift out any set bits,
and the actual limit for `x` can be computed by backtransforming
the maximal value we could ever get out of the `shl` - `-1` through
`lshr`. If the `x` is any larger than that then it will overflow.

Likewise for signed range, but just in signed domain..

This is based on the general idea outlined by @nikic in https://reviews.llvm.org/D68672#1714990

Reviewers: nikic, sanjoy

Reviewed By: nikic

Subscribers: hiraditya, llvm-commits, nikic

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D69217

llvm-svn: 375370

4b622326

[ConstantRange] Optimize nowrap region test, remove redundant tests; NFC · 926dae33

Nikita Popov authored Oct 20, 2019

Enumerate one less constant range in TestNoWrapRegionExhaustive,
which was unnecessary. This allows us to bump the bit count from
3 to 5 while keeping reasonable timing.

Drop four tests for multiply nowrap regions, as these cover subsets
of the exhaustive test. They do use a wider bitwidth, but I don't
think it's worthwhile to have them additionally now.

llvm-svn: 375369

926dae33

AMDGPU: Increase vcc liveness scan threshold · e5be543a

Matt Arsenault authored Oct 20, 2019

Avoids a test regression in a future patch. Also add debug printing on
this case, so I waste less time debugging folds in the future.

llvm-svn: 375367

e5be543a

AMDGPU: Split flat offsets that don't fit in DAG · 7cd57dcd

Matt Arsenault authored Oct 20, 2019

We handle it this way for some other address spaces.

Since r349196, SILoadStoreOptimizer has been trying to do this. This
is after SIFoldOperands runs, which can change the addressing
patterns. It's simpler to just split this earlier.

llvm-svn: 375366

7cd57dcd

AMDGPU: Fix missing OPERAND_IMMEDIATE · 1aad3835
Matt Arsenault authored Oct 20, 2019
```
llvm-svn: 375365
```
1aad3835
AMDGPU: Add baseline tests for flat offset splitting · bba8fd71
Matt Arsenault authored Oct 20, 2019
```
llvm-svn: 375364
```
bba8fd71
AMDGPU: Don't re-get the subtarget · fc205f1d
Matt Arsenault authored Oct 20, 2019
```
It's already available in the class.

llvm-svn: 375363
```
fc205f1d

[AMDGPU] Fix assertion due to initializer list · e6125fc0

Yaxun Liu authored Oct 20, 2019

Sometimes a global var is replaced by a different llvm value. clang use GetAddrOfGlobalVar to get the original llvm global variable.
For most targets, GetAddrOfGlobalVar returns either the llvm global variable or a bitcast of the llvm global variable.
However, for AMDGPU target, GetAddrOfGlobalVar returns the addrspace cast or addrspace cast plus bitcast of the llvm global variable.
To get the llvm global variable, these casts need to be stripped, otherwise there is assertion.

This patch fixes that.

Differential Revision: https://reviews.llvm.org/D69129

llvm-svn: 375362

e6125fc0

[yaml2obj][obj2yaml] - Do not create a symbol table by default. · 2779987d

George Rimar authored Oct 20, 2019

This patch tries to resolve problems faced in D68943
and uses some of the code written by Konrad Wilhelm Kleine
in that patch.

Previously, yaml2obj tool always created a .symtab section.
This patch changes that. With it we only create it when
have a "Symbols:" tag in the YAML document or when
we need to create it because it is used by another section(s).

obj2yaml follows the new behavior and does not print "Symbols:"
anymore when there is no symbol table.

Differential revision: https://reviews.llvm.org/D69041

llvm-svn: 375361

2779987d

[LLD][ELF] - Update tests after yaml2obj tool update. · c4107383
George Rimar authored Oct 20, 2019
```
yaml2obj doesn't create .symtab by default anymore.

llvm-svn: 375360
```
c4107383