Commits · 197e47f5c8f3ac74df0460e56f91355e2d9006f4 · Lorenzo Albano / LLVM bpEVL

Feb 01, 2018

[NFC] 'DWARFv5' -> 'DWARF v5' · 197e47f5
Jonas Devlieghere authored Feb 01, 2018
```
llvm-svn: 323950
```
197e47f5
Test commit: Fix a comment. · 705e26a2
Yvan Roux authored Feb 01, 2018
```
llvm-svn: 323947
```
705e26a2

[LSR] Don't force bases of foldable formulae to the final type. · 6d06976e

Mikael Holmen authored Feb 01, 2018

Summary:
Before emitting code for scaled registers, we prevent
SCEVExpander from hoisting any scaled addressing mode
by emitting all the bases first. However, these bases
are being forced to the final type, resulting in some
odd code.

For example, if the type of the base is an integer and
the final type is a pointer, we will emit an inttoptr
for the base, a ptrtoint for the scale, and then a
'reverse' GEP where the GEP pointer is actually the base
integer and the index is the pointer. It's more intuitive
to use the pointer as a pointer and the integer as index.

Patch by: Bevin Hansson

Reviewers: atrick, qcolombet, sanjoy

Reviewed By: qcolombet

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D42103

llvm-svn: 323946

6d06976e

[XRay][compiler-rt+llvm] Update XRay register stashing semantics · cdca0730

Dean Michael Berris authored Feb 01, 2018

Summary:
This change expands the amount of registers stashed by the entry and
`__xray_CustomEvent` trampolines.

We've found that since the `__xray_CustomEvent` trampoline calls can show up in
situations where the scratch registers are being used, and since we don't
typically want to affect the code-gen around the disabled
`__xray_customevent(...)` intrinsic calls, that we need to save and restore the
state of even the scratch registers in the handling of these custom events.

Reviewers: pcc, pelikan, dblaikie, eizan, kpw, echristo, chandlerc

Reviewed By: echristo

Subscribers: chandlerc, echristo, hiraditya, davide, dblaikie, llvm-commits

Differential Revision: https://reviews.llvm.org/D40894

llvm-svn: 323940

cdca0730

[MC] Fix assembler infinite loop on EH table using LEB padding. · 45b12f18

Rafael Espindola authored Feb 01, 2018

Fix the infinite loop reported in PR35809. It can occur with GCC-style
EH table assembly, where the compiler relies on the assembler to
calculate the offsets in the EH table.

Also see https://sourceware.org/bugzilla/show_bug.cgi?id=4029 for the
equivalent issue in the GNU assembler.

Patch by Ryan Prichard!

llvm-svn: 323934

45b12f18

[GlobalOpt] Improve common case efficiency of static global initializer evaluation · 93b0ff20

Amara Emerson authored Jan 31, 2018

For very, very large global initializers which can be statically evaluated, the
code would create vectors of temporary Constants, modifying them in place,
before committing the resulting Constant aggregate to the global's initializer
value. This had effectively O(n^2) complexity in the size of the global
initializer and would cause memory and non-termination issues compiling some
workloads.

This change performs the static initializer evaluation and creation in batches,
once for each global in the evaluated IR memory. The existing code is maintained
as a last resort when the initializers are more complex than simple values in a
large aggregate. This should theoretically by NFC, no test as the example case
is massive. The existing test cases pass with this, as well as the llvm test
suite.

To give an example, consider the following C++ code adapted from the clang
regression tests:
struct S {
 int n = 10;
 int m = 2 * n;
 S(int a) : n(a) {}
};

template<typename T>
struct U {
 T *r = &q;
 T q = 42;
 U *p = this;
};

U<S> e;

The global static constructor for 'e' will need to initialize 'r' and 'p' of
the outer struct, while also initializing the inner 'q' structs 'n' and 'm'
members. This batch algorithm will simply use general CommitValueTo() method
to handle the complex nested S struct initialization of 'q', before
processing the outermost members in a single batch. Using CommitValueTo() to
handle member in the outer struct is inefficient when the struct/array is
very large as we end up creating and destroy constant arrays for each
initialization.
For the above case, we expect the following IR to be generated:

%struct.U = type { %struct.S*, %struct.S, %struct.U* }
%struct.S = type { i32, i32 }
@e = global %struct.U { %struct.S* gep inbounds (%struct.U, %struct.U* @e,
                                                 i64 0, i32 1),
                        %struct.S { i32 42, i32 84 }, %struct.U* @e }
The %struct.S { i32 42, i32 84 } inner initializer is treated as a complex
constant expression, while the other two elements of @e are "simple".

Differential Revision: https://reviews.llvm.org/D42612

llvm-svn: 323933

93b0ff20

DAG: Fix not truncating when promoting bswap/bitreverse · df0f2507
Matt Arsenault authored Jan 31, 2018
```
These need to convert back to the original type, like any
other promotion.

llvm-svn: 323932
```
df0f2507

Jan 31, 2018

Revert "[ARM] Lower lower saturate to 0 and lower saturate to -1 using bit-operations" · 7746899f
Evgeniy Stepanov authored Jan 31, 2018
```
Miscompiles code. Testcase pending.

This reverts commit r323869.

llvm-svn: 323929
```
7746899f

Utils: Fix DomTree update for entry block · 06dfbb50

Matt Arsenault authored Jan 31, 2018

If SplitBlockPredecessors was used on a function entry block,
it wouldn't update the dominator tree.

llvm-svn: 323928

06dfbb50

AMDGPU: Fix missing SCC def from s_xor_b64_term · af88f0eb
Matt Arsenault authored Jan 31, 2018
```
llvm-svn: 323927
```
af88f0eb

[AggressiveInstCombine] Fixed TruncCombine class to handle TruncInst leaf node correctly. · b86b771c

Amjad Aboud authored Jan 31, 2018

This covers the case where TruncInst leaf node is a constant expression.
See PR36121 for more details.

Differential Revision: https://reviews.llvm.org/D42622

llvm-svn: 323926

b86b771c

[X86] Make the type checks in detectAVX512USatPattern more robust · e44faf53

Craig Topper authored Jan 31, 2018

This code currently uses isSimple and getSizeInBits in an attempt to prune types. But isSimple will return true for any type that any target supports natively. I don't think that's a good way to prune types. I also don't think the dest element type checks are very robust since we didn't do an isSimple check on the dest type.

This patch adds a check for the input type being legal to the one caller that didn't already check that. Then we explicitly check the element types for the destination are i8, i16, or i32

Differential Revision: https://reviews.llvm.org/D42706

llvm-svn: 323924

e44faf53

[llvm-cov] Fix incorrect usage of .precision specifier in format() call. · 74295975

Max Moroz authored Jan 31, 2018

Summary: Existing version doesn't work on Windows as it always prints 0.00.

Reviewers: Dor1s

Reviewed By: Dor1s

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D42767

llvm-svn: 323923

74295975

Followup on Proposal to move MIR physical register namespace to '$' sigil. · 43e94b15

Puyan Lotfi authored Jan 31, 2018

Discussed here:

http://lists.llvm.org/pipermail/llvm-dev/2018-January/120320.html

In preparation for adding support for named vregs we are changing the sigil for
physical registers in MIR to '$' from '%'. This will prevent name clashes of
named physical register with named vregs.

llvm-svn: 323922

43e94b15

[Hexagon] Rename HexagonISelLowering::getNode to getInstr, NFC · 15efa98f
Krzysztof Parzyszek authored Jan 31, 2018
```
llvm-svn: 323916
```
15efa98f

[x86] Make the retpoline thunk insertion a machine function pass. · 0dcee4fe

Chandler Carruth authored Jan 31, 2018

Summary:
This removes the need for a machine module pass using some deeply
questionable hacks. This should address PR36123 which is a case where in
full LTO the memory usage of a machine module pass actually ended up
being significant.

We should revert this on trunk as soon as we understand and fix the
memory usage issue, but we should include this in any backports of
retpolines themselves.

Reviewers: echristo, MatzeB

Subscribers: sanjoy, mcrosier, mehdi_amini, hiraditya, llvm-commits

Differential Revision: https://reviews.llvm.org/D42726

llvm-svn: 323915

0dcee4fe

[Hexagon] Implement HVX codegen for vector shifts · 1108ee24
Krzysztof Parzyszek authored Jan 31, 2018
```
llvm-svn: 323914
```
1108ee24
[SeparateConstOffsetFromGEP] Fix up addrspace in the AMDGPU test · 8f2df9d2
Marek Olsak authored Jan 31, 2018
```
llvm-svn: 323913
```
8f2df9d2
[Hexagon] Handle ANY_EXTEND_VECTOR_INREG in lowering · 9eb085e6
Krzysztof Parzyszek authored Jan 31, 2018
```
llvm-svn: 323912
```
9eb085e6
[Hexagon] Handle SETCC on vector pairs in lowering · b843f751
Krzysztof Parzyszek authored Jan 31, 2018
```
llvm-svn: 323911
```
b843f751

[GlobalOpt] Fix exponential compile-time with selects. · 79d297ab

Eli Friedman authored Jan 31, 2018

If you have a long chain of select instructions created from something
like `int* p = &g; if (foo()) p += 4; if (foo2()) p += 4;` etc., a naive
recursive visitor will recursively visit each select twice, which is
O(2^N) in the number of select instructions. Use the visited set to cut
off recursion in this case.

(No testcase because this doesn't actually change the behavior, just the
time.)

Differential Revision: https://reviews.llvm.org/D42451

llvm-svn: 323910

79d297ab

AMDGPU: Fold inline offset for loads properly in moveToVALU on GFX9 · d4bb329d

Marek Olsak authored Jan 31, 2018

Summary:
This enables load merging into x2, x4, which is driven by inline offsets.

6500 shaders are affected:
Code Size in affected shaders: -15.14 %

Reviewers: arsenm, nhaehnle

Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits

Differential Revision: https://reviews.llvm.org/D42078

llvm-svn: 323909

d4bb329d

AMDGPU: Add intrinsics llvm.amdgcn.cvt.{pknorm.i16, pknorm.u16, pk.i16, pk.u16} · 13e47412

Marek Olsak authored Jan 31, 2018

Reviewers: arsenm, nhaehnle

Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye

Differential Revision: https://reviews.llvm.org/D41663

llvm-svn: 323908

13e47412

[SeparateConstOffsetFromGEP] Preserve metadata when splitting GEPs · 8e7d149a

Marek Olsak authored Jan 31, 2018

Summary:
!amdgpu.uniform needs to be preserved for AMDGPU, otherwise bad things
happen.

Reviewers: arsenm, nhaehnle, jingyue, broune, majnemer, bjarke.roune, dblaikie

Subscribers: wdng, tpr, llvm-commits

Differential Revision: https://reviews.llvm.org/D42744

llvm-svn: 323907

8e7d149a

[MachineOutliner] Freeze registers in new functions · 82203c41

Geoff Berry authored Jan 31, 2018

Summary:
Call MRI.freezeReservedRegs() on functions created during outlining so
that calls to isReserved() by the verifier called after this pass won't
assert.

Reviewers: MatzeB, qcolombet, paquette

Subscribers: mcrosier, javed.absar, llvm-commits

Differential Revision: https://reviews.llvm.org/D42749

llvm-svn: 323905

82203c41

[WebAssembly] MC: Remove unused code for handling of wasm globals · 6e7f1826

Sam Clegg authored Jan 31, 2018

For now, we are not using wasm globals, except for modeling of
the stack points.

Alos, factor out common struct WasmGlobalType, which matches the
name for that tuple in the Wasm spec and rename methods
to "isBindingGlobal", "isTypeGlobal" to avoid ambiguity.

Patch by Nicholas Wilson!

Differential Revision: https://reviews.llvm.org/D42750

llvm-svn: 323901

6e7f1826

[WebAssembly] MC: Resolve aliases when creating provisional table entries · f9edbe95

Sam Clegg authored Jan 31, 2018

This change is useful for the upcoming addition of the symbol
table (D41954) since in that world aliases for given function
all share the same function index.

This change does not effect lld because it essentially ignores
the wasm "table".  The table exists only to the wasm objects
will validate and disassembly meaningfully.

Patch by Nicholas Wilson!

Differential Revision: https://reviews.llvm.org/D42095

llvm-svn: 323900

f9edbe95

[X86] Generate testl instruction through truncates. · f9a9e9a2

Amaury Sechet authored Jan 31, 2018

Summary:
This was introduced in D42646 but ended up being reverted because the original implementation was buggy.

Depends on D42646

Reviewers: craig.topper, niravd, spatel, hfinkel

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D42741

llvm-svn: 323899

f9a9e9a2

[Analysis] Disable calls to *_finite and other glibc-only functions on Android. · 60d1e79f

Chih-Hung Hsieh authored Jan 31, 2018

Since r322087, glibc's finite lib calls are generated when possible.
However, they are not supported on Android. This change also
disables other functions not available on Android.

Differential Revision: http://reviews.llvm.org/D42668

llvm-svn: 323898

60d1e79f

[llvm-cov] Improvements for summary report generated in HTML format. · 790baeed

Max Moroz authored Jan 31, 2018

Summary:
This commit adds the following changes:

1) coverage numbers are aligned to the left and padded with spaces in order to
provide better readability for percentage values, e.g.:

```
file1     |  89.13% (123 / 2323)    | 100.00% (55 / 55)    |   9.33% (14545 / 234234)
file_asda |   1.78% ( 23 / 4323)    |  32.31% (555 / 6555) |  67.89% (1545 / 2234)
fileXXX   | 100.00% (12323 / 12323) | 100.00% (555 / 555)  | 100.00% (12345 / 12345)
```

2) added "hover" attribute to CSS for highlighting table row under mouse cursor
see screenshot attached to the phabricator review page

{F5764813}

3) table title row and "totals" row now use bold text

Reviewers: vsk, morehouse

Reviewed By: vsk

Subscribers: kcc, llvm-commits

Differential Revision: https://reviews.llvm.org/D42093

llvm-svn: 323892

790baeed

[CodeGenPrepare] Improve source and dest alignments of memory intrinsics independently · be58a220

Daniel Neilson authored Jan 31, 2018

Summary:
  This change is part of step five in the series of changes to remove alignment argument from
memcpy/memmove/memset in favour of alignment attributes. In particular, this changes the
CodeGenPrepare pass to be more aggressive in improving the source and destination alignments
of memcpy/memmove/memset by exploiting our new ability to record independent alignments
for each argument.

Steps:
Step 1) Remove alignment parameter and create alignment parameter attributes for
memcpy/memmove/memset. ( rL322965, rC322964, rL322963 )
Step 2) Expand the IRBuilder API to allow creation of memcpy/memmove with differing
source and dest alignments. ( rL323597 )
Step 3) Update Clang to use the new IRBuilder API. ( rC323617 )
Step 4) Update Polly to use the new IRBuilder API. ( rL323618 )
Step 5) Update LLVM passes that create memcpy/memmove calls to use the new IRBuilder API,
and those that use use MemIntrinsicInst::[get|set]Alignment() to use [get|set]DestAlignment()
and [get|set]SourceAlignment() instead. ( rL323886 )
Step 6) Remove the single-alignment IRBuilder API for memcpy/memmove, and the
MemIntrinsicInst::[get|set]Alignment() methods.

Reference
   http://lists.llvm.org/pipermail/llvm-dev/2015-August/089384.html
   http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html

llvm-svn: 323891

be58a220

[Hexagon] Handle BUILD_VECTOR from undef values in buildHvxVectorReg · 82a83391
Krzysztof Parzyszek authored Jan 31, 2018
```
llvm-svn: 323889
```
82a83391

[X86] Avoid using high register trick for test instruction · f89f188d

Amaury Sechet authored Jan 31, 2018

Summary:
It seems it's main effect is to create addition copies when values are inr register that do not support this trick, which increase register pressure and makes the code bigger.

Reviewers: craig.topper, niravd, spatel, hfinkel

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D42646

llvm-svn: 323888

f89f188d

[Hexagon] Only process bitcasts of vsplats when selecting const vectors · 8cc636c5

Krzysztof Parzyszek authored Jan 31, 2018

Selecting of constant HVX vectors involves some "manual processing",
which mishandled an unrelated BITCAST operation causing a selection
error.

llvm-svn: 323887

8cc636c5

[Lint] Upgrade uses of MemoryIntrinic::getAlignment() to new API. (NFCI) · 147810d2

Daniel Neilson authored Jan 31, 2018

Summary:
  This change is part of step five in the series of changes to remove alignment argument from
memcpy/memmove/memset in favour of alignment attributes. In particular, this changes the Lint
analysis to cease using the old getAlignment() API of MemoryIntrinsic in favour of getting
source & dest specific alignments through the new API.

Steps:
Step 1) Remove alignment parameter and create alignment parameter attributes for
memcpy/memmove/memset. ( rL322965, rC322964, rL322963 )
Step 2) Expand the IRBuilder API to allow creation of memcpy/memmove with differing
source and dest alignments. ( rL323597 )
Step 3) Update Clang to use the new IRBuilder API. ( rC323617 )
Step 4) Update Polly to use the new IRBuilder API. ( rL323618 )
Step 5) Update LLVM passes that create memcpy/memmove calls to use the new IRBuilder API,
and those that use use MemIntrinsicInst::[get|set]Alignment() to use [get|set]DestAlignment()
and [get|set]SourceAlignment() instead.
Step 6) Remove the single-alignment IRBuilder API for memcpy/memmove, and the
MemIntrinsicInst::[get|set]Alignment() methods.

Reference
   http://lists.llvm.org/pipermail/llvm-dev/2015-August/089384.html
   http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html

llvm-svn: 323886

147810d2

[DWARF] Allow duplication of tails with CFI instructions · 540f4cd1

Petar Jovanovic authored Jan 31, 2018

This commit came as a result for revert of patch r317579 (originally
committed as r317100). The patch made CFI instructions duplicable, because
their existence in the epilogue block was affecting the Tail duplication
pass. However, duplicating blocks with CFI instructions was an issue for
compact unwind info on Darwin, which is why the patch was reverted.

This patch allows duplicating tails with CFI instructions, though they are
not duplicable, by copying them 'manually'.


Patch by Djordje Kovacevic.

Differential Revision: https://reviews.llvm.org/D40979

llvm-svn: 323883

540f4cd1

[InstCombine] move related tests into the same file; NFC · fd58ade8
Sanjay Patel authored Jan 31, 2018
```
llvm-svn: 323882
```
fd58ade8
[InstCombine] add tests to show limit of canEvaluate* ; NFC · 8c74a9a1
Sanjay Patel authored Jan 31, 2018
```
llvm-svn: 323881
```
8c74a9a1

[DAG] Prevent NodeId pruning of TokenFactors in Instruction Selection. · c3a1e16d

Nirav Dave authored Jan 31, 2018

Summary:
Instruction Selection preserves relative orders of all nodes save
TokenFactors which we treat specially. As a result Node Ids for
TokenFactors may violate the topological ordering and should not be
considered as valid pruning candidates in predecessor search.

Fixes PR35316.

Reviewers: RKSimon, hfinkel

Subscribers: hiraditya, llvm-commits

Differential Revision: https://reviews.llvm.org/D42701

llvm-svn: 323880

c3a1e16d

Fix formatting for r323876. NFC · 12ed95e3
Diana Picus authored Jan 31, 2018
```
llvm-svn: 323878
```
12ed95e3