Commits · 3963ba3e48a4ac8c681235d6d83b48574e6cff31 · Roger Ferrer / llvm-epi

Feb 16, 2017

Collapse my two entries in CODE_OWNERS.txt · 3963ba3e
Teresa Johnson authored Feb 15, 2017
```
llvm-svn: 295259
```
3963ba3e

[dllimport] Check for dtor references in functions · cac8ce06

Hans Wennborg authored Feb 15, 2017

Destructor references are not modelled explicitly in the AST. This adds
checks for destructor calls due to variable definitions and temporaries.

If a dllimport function references a non-dllimport destructor, it must
not be emitted available_externally, as the referenced destructor might
live across the DLL boundary and isn't exported.

llvm-svn: 295258

cac8ce06

[dllimport] Look through typedefs and arrays in HasNonDllImportDtor · 6c3d625f

Hans Wennborg authored Feb 15, 2017

The function is used to check whether a type is a class with
non-dllimport destructor. It needs to look through typedefs and array
types.

llvm-svn: 295257

6c3d625f

[Docs] Add missing space to an option doc · a2111793
Alex Lorenz authored Feb 15, 2017
```
llvm-svn: 295256
```
a2111793

GlobalISel: legalize va_arg on AArch64. · 9136617a

Tim Northover authored Feb 15, 2017

Uses a Custom implementation because the slot sizes being a multiple of the
pointer size isn't really universal, even for the architectures that do have a
simple "void *" va_list.

llvm-svn: 295255

9136617a

GlobalISel: support translating va_arg · 4a652227

Tim Northover authored Feb 15, 2017

Since (say) i128 and [16 x i8] map to the same type in generic MIR, we also
need to attach the required alignment info.

llvm-svn: 295254

4a652227

Implement intrinsic mangling for literal struct types. · 3c1432fe

Daniel Berlin authored Feb 15, 2017

Fixes PR 31921

Summary:
Predicateinfo requires an ugly workaround to try to avoid literal
struct types due to the intrinsic mangling not being implemented.
This workaround actually does not work in all cases (you can hit the
assert by bootstrapping with -print-predicateinfo), and can't be made
to work without DFS'ing the type (IE copying getMangledStr and using a
version that detects if it would crash).

Rather than do that, i just implemented the mangling.  It seems
simple, since they are unified structurally.

Looking at the overloaded-mangling testcase we have, it actually turns
out the gc intrinsics will *also* crash if you try to use a literal
struct.  Thus, the testcase added fails before this patch, and works
after, without needing to resort to predicateinfo.

Reviewers: chandlerc, davide

Subscribers: llvm-commits, sanjoy

Differential Revision: https://reviews.llvm.org/D29925

llvm-svn: 295253

3c1432fe

Feb 15, 2017

[Modules] Consider enable_if attrs in isSameEntity. · 9584508d

George Burgess IV authored Feb 15, 2017

Two functions that differ only in their enable_if attributes are
considered overloads, so we should check for those when we're trying to
figure out if two functions are mergeable.

We need to do the same thing for pass_object_size, as well. Looks like
that'll be a bit less trivial, since we sometimes do these merging
checks before we have pass_object_size attributes available (see the
merge checks in ASTDeclReader::VisitFunctionDecl that happen before we
read parameters, and merge checks in calls to ReadDeclAs<>()).

llvm-svn: 295252

9584508d

Re-commit r295017, since we have a potential fix of the tests, r295248, landed. · c6ce73b1
Tim Shen authored Feb 15, 2017
```
Differential Revision: https://reviews.llvm.org/D29742

llvm-svn: 295251
```
c6ce73b1
[XRay] Fix gtest error code comparison. NFC. · b3937285
Tim Shen authored Feb 15, 2017
```
llvm-svn: 295248
```
b3937285
AMDGPU: Remove dead node definitions · 824de226
Matt Arsenault authored Feb 15, 2017
```
llvm-svn: 295247
```
824de226
Fix typos · 900b21c3
Matt Arsenault authored Feb 15, 2017
```
llvm-svn: 295246
```
900b21c3
Fix the static build. · 46f68f50
Rafael Espindola authored Feb 15, 2017
```
llvm-svn: 295245
```
46f68f50
AMDGPU: Consolidate sendmsg/sendmsghalt handling and tests · a78ca62c
Matt Arsenault authored Feb 15, 2017
```
llvm-svn: 295244
```
a78ca62c
[Support] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). · 454d0cea
Eugene Zelenko authored Feb 15, 2017
```
llvm-svn: 295243
```
454d0cea
DAG: Do not scalarize fsub if fneg is legal · 5de8dc9c
Matt Arsenault authored Feb 15, 2017
```
Tests will be included with future commit.

llvm-svn: 295242
```
5de8dc9c
Re-apply r295110 and r295144 with a fix for the ASan issue. · 50cbd7cc
Peter Collingbourne authored Feb 15, 2017
```
llvm-svn: 295241
```
50cbd7cc

Make --export-dynamic work on non-PIC/PIE targets. · 3bf71304

Ed Schouten authored Feb 15, 2017

For CloudABI I'm only interested in generating non-PIC/PIE executables
on armv6 and i686, as PIE introduces larger overhead than on aarch64 and
x86_64. Still, I want to be able to instruct the linker to generate a
dynamic symbol table if requested. One example use for this is that
dynamic symbol tables can be used by programs to print nicely formatted
stacktraces, including symbol names.

Right now there seems to be some logic in LLD that it only wants to emit
dynamic symbol tables when either linking against libraries or when
building PIC. Let's extend this to also take --export-dynamic into
account.

Reviewed by:	ruiu
Differential Revision:	https://reviews.llvm.org/D29982

llvm-svn: 295240

3bf71304

AMDGPU: Replace assert with report_fatal_error · d122abea
Matt Arsenault authored Feb 15, 2017
```
Also use a more refined condition.

llvm-svn: 295239
```
d122abea

[GlobalObject] Fix setSection("") · 5e1e5918

Keno Fischer authored Feb 15, 2017

Summary:
In rL291613, the section name was interned in LLVMContext. However,
this broke the ability to remove the section from a GlobalObject,
because it tried to intern empty strings, which is not allowed.
Fix that and add an appropriate regression test.

Reviewed By: rnk
Differential Revision: https://reviews.llvm.org/D29795

llvm-svn: 295238

5e1e5918

[InstCombine] improve formatting; NFC · 845ea963
Sanjay Patel authored Feb 15, 2017
```
llvm-svn: 295237
```
845ea963

AssumptionCache: Disable the verifier by default, move it behind a hidden... · 9421c2dc

Peter Collingbourne authored Feb 15, 2017

AssumptionCache: Disable the verifier by default, move it behind a hidden cl::opt and verify from releaseMemory().

This is a short term solution to the problem that many passes currently fail
to update the assumption cache. In the long term the verifier should not
be controllable with a flag. We should either fix all passes to correctly
update the assumption cache and enable the verifier unconditionally or
somehow arrange for the assumption list to be updated automatically by passes.

Differential Revision: https://reviews.llvm.org/D30003

llvm-svn: 295236

9421c2dc

[X86][SSE] Don't call EltsFromConsecutiveLoads if any element is missing. · 5b4c30fb

Simon Pilgrim authored Feb 15, 2017

Minor performance speedup - if any call to getShuffleScalarElt fails to get a result, don't both calling for the remaining elements as EltsFromConsecutiveLoads will fail anyhow.

llvm-svn: 295235

5b4c30fb

[compiler-rt][asan|win] Fix ASAN exception handler missing import · b69639e2

Etienne Bergeron authored Feb 15, 2017

Summary:
This patch is adding a missing ASAN API redirection from an instrumented DLL.
The bug was introduced here:
  https://reviews.llvm.org/D29463

This is causing this chromium bug:
  https://bugs.chromium.org/p/chromium/issues/detail?id=692580

Reviewers: rnk

Reviewed By: rnk

Subscribers: kubamracek, dberris, llvm-commits, chrisha, thakis

Differential Revision: https://reviews.llvm.org/D30001

llvm-svn: 295232

b69639e2

[OpenMP] libomptarget: Protect parent struct from being deallocated · 15a6e7da

George Rokos authored Feb 15, 2017

Fixed bug due to which a parent struct was deallocated when one of the struct's pointers was being unmapped.

Differential Revision: https://reviews.llvm.org/D29914

llvm-svn: 295231

15a6e7da

AddressSanitizer: don't track swifterror memory addresses · 8d61e003

Arnold Schwaighofer authored Feb 15, 2017

They are register promoted by ISel and so it makes no sense to treat them as
memory.

Inserting calls to the thread sanitizer would also generate invalid IR.

You would hit:

"swifterror value can only be loaded and stored from, or as a swifterror
argument!"

llvm-svn: 295230

8d61e003

[AArch64] Make am_ldrlit an iPTR - not OtherVT - operand. NFC-ish. · f8acf568

Ahmed Bougacha authored Feb 15, 2017

am_ldrlit diverged from am_brcond in r207105, but kept the OtherVT
operand type.  It made sense for branch targets, as those are
represented as MVT::Other in SDAG.  But loads operate on pointers.

This shouldn't have an observable effect on any in-tree code, but helps
make the patterns consistent for external users.

llvm-svn: 295229

f8acf568

[OptDiag] Pass const Values/Types to Argument. NFC. · 36026006
Ahmed Bougacha authored Feb 15, 2017
```
llvm-svn: 295228
```
36026006
[IR] Accept 'const Type &' in the Type operator<<. NFC. · f9e5a1dd
Ahmed Bougacha authored Feb 15, 2017
```
Type::print is const; there's no reason for the operator not to be.

llvm-svn: 295227
```
f9e5a1dd

[LTO] Add ability to emit assembly to new LTO API · f454b9ea

Tobias Edler von Koch authored Feb 15, 2017

Summary:
Add a field to LTO::Config, CGFileType, to select the file type to emit (object
or assembly). This is useful for testing and to implement -save-temps.

Reviewers: tejohnson, mehdi_amini, pcc

Reviewed By: mehdi_amini

Subscribers: davide, llvm-commits

Differential Revision: https://reviews.llvm.org/D29475

llvm-svn: 295226

f454b9ea

Apply different tokenization rules to linker script expressions. · 731a66ae

Rui Ueyama authored Feb 15, 2017

The linker script lexer is context-sensitive. In the regular context,
arithmetic operator characters are regular characters, but in the
expression context, they are independent tokens. This afects how the
lexer tokenizes "3*4", for example. (This kind of expression is real;
the Linux kernel uses it.)

This patch defines function `maybeSplitExpr`. This function splits the
current token into multiple expression tokens if the lexer is in the
expression context.

Differential Revision: https://reviews.llvm.org/D29963

llvm-svn: 295225

731a66ae

PR24440: Do not silently discard a fold-expression appearing as the operand of a cast-expression. · 90e043da
Richard Smith authored Feb 15, 2017
```
llvm-svn: 295224
```
90e043da

Codegen: Make chains from trellis-shaped CFGs · 7fbec9bd

Kyle Butt authored Feb 15, 2017

Lay out trellis-shaped CFGs optimally.
A trellis of the shape below:

  A     B
  |\   /|
  | \ / |
  |  X  |
  | / \ |
  |/   \|
  C     D

would be laid out A; B->C ; D by the current layout algorithm. Now we identify
trellises and lay them out either A->C; B->D or A->D; B->C. This scales with an
increasing number of predecessors. A trellis is a a group of 2 or more
predecessor blocks that all have the same successors.

because of this we can tail duplicate to extend existing trellises.

As an example consider the following CFG:

    B   D   F   H
   / \ / \ / \ / \
  A---C---E---G---Ret

Where A,C,E,G are all small (Currently 2 instructions).

The CFG preserving layout is then A,B,C,D,E,F,G,H,Ret.

The current code will copy C into B, E into D and G into F and yield the layout
A,C,B(C),E,D(E),F(G),G,H,ret

define void @straight_test(i32 %tag) {
entry:
  br label %test1
test1: ; A
  %tagbit1 = and i32 %tag, 1
  %tagbit1eq0 = icmp eq i32 %tagbit1, 0
  br i1 %tagbit1eq0, label %test2, label %optional1
optional1: ; B
  call void @a()
  br label %test2
test2: ; C
  %tagbit2 = and i32 %tag, 2
  %tagbit2eq0 = icmp eq i32 %tagbit2, 0
  br i1 %tagbit2eq0, label %test3, label %optional2
optional2: ; D
  call void @b()
  br label %test3
test3: ; E
  %tagbit3 = and i32 %tag, 4
  %tagbit3eq0 = icmp eq i32 %tagbit3, 0
  br i1 %tagbit3eq0, label %test4, label %optional3
optional3: ; F
  call void @c()
  br label %test4
test4: ; G
  %tagbit4 = and i32 %tag, 8
  %tagbit4eq0 = icmp eq i32 %tagbit4, 0
  br i1 %tagbit4eq0, label %exit, label %optional4
optional4: ; H
  call void @d()
  br label %exit
exit:
  ret void
}

here is the layout after D27742:
straight_test:                          # @straight_test
; ... Prologue elided
; BB#0:                                 # %entry ; A (merged with test1)
; ... More prologue elided
	mr 30, 3
	andi. 3, 30, 1
	bc 12, 1, .LBB0_2
; BB#1:                                 # %test2 ; C
	rlwinm. 3, 30, 0, 30, 30
	beq	 0, .LBB0_3
	b .LBB0_4
.LBB0_2:                                # %optional1 ; B (copy of C)
	bl a
	nop
	rlwinm. 3, 30, 0, 30, 30
	bne	 0, .LBB0_4
.LBB0_3:                                # %test3 ; E
	rlwinm. 3, 30, 0, 29, 29
	beq	 0, .LBB0_5
	b .LBB0_6
.LBB0_4:                                # %optional2 ; D (copy of E)
	bl b
	nop
	rlwinm. 3, 30, 0, 29, 29
	bne	 0, .LBB0_6
.LBB0_5:                                # %test4 ; G
	rlwinm. 3, 30, 0, 28, 28
	beq	 0, .LBB0_8
	b .LBB0_7
.LBB0_6:                                # %optional3 ; F (copy of G)
	bl c
	nop
	rlwinm. 3, 30, 0, 28, 28
	beq	 0, .LBB0_8
.LBB0_7:                                # %optional4 ; H
	bl d
	nop
.LBB0_8:                                # %exit ; Ret
	ld 30, 96(1)                    # 8-byte Folded Reload
	addi 1, 1, 112
	ld 0, 16(1)
	mtlr 0
	blr

The tail-duplication has produced some benefit, but it has also produced a
trellis which is not laid out optimally. With this patch, we improve the layouts
of such trellises, and decrease the cost calculation for tail-duplication
accordingly.

This patch produces the layout A,C,E,G,B,D,F,H,Ret. This layout does have
back edges, which is a negative, but it has a bigger compensating
positive, which is that it handles the case where there are long strings
of skipped blocks much better than the original layout. Both layouts
handle runs of executed blocks equally well. Branch prediction also
improves if there is any correlation between subsequent optional blocks.

Here is the resulting concrete layout:

straight_test:                          # @straight_test
; BB#0:                                 # %entry ; A (merged with test1)
	mr 30, 3
	andi. 3, 30, 1
	bc 12, 1, .LBB0_4
; BB#1:                                 # %test2 ; C
	rlwinm. 3, 30, 0, 30, 30
	bne	 0, .LBB0_5
.LBB0_2:                                # %test3 ; E
	rlwinm. 3, 30, 0, 29, 29
	bne	 0, .LBB0_6
.LBB0_3:                                # %test4 ; G
	rlwinm. 3, 30, 0, 28, 28
	bne	 0, .LBB0_7
	b .LBB0_8
.LBB0_4:                                # %optional1 ; B (Copy of C)
	bl a
	nop
	rlwinm. 3, 30, 0, 30, 30
	beq	 0, .LBB0_2
.LBB0_5:                                # %optional2 ; D (Copy of E)
	bl b
	nop
	rlwinm. 3, 30, 0, 29, 29
	beq	 0, .LBB0_3
.LBB0_6:                                # %optional3 ; F (Copy of G)
	bl c
	nop
	rlwinm. 3, 30, 0, 28, 28
	beq	 0, .LBB0_8
.LBB0_7:                                # %optional4 ; H
	bl d
	nop
.LBB0_8:                                # %exit

Differential Revision: https://reviews.llvm.org/D28522

llvm-svn: 295223

7fbec9bd

Simplify operator tests. · a4601b5d
Rui Ueyama authored Feb 15, 2017
```
llvm-svn: 295222
```
a4601b5d
Rename a test as they are tests for operators. · fd5edff8
Rui Ueyama authored Feb 15, 2017
```
llvm-svn: 295221
```
fd5edff8
include function name in dot filename · 538d6668
Xinliang David Li authored Feb 15, 2017
```
Differential Revision: http://reviews.llvm.org/D29975

llvm-svn: 295220
```
538d6668

ThreadSanitizer: don't track swifterror memory addresses · 8eb1a485

Arnold Schwaighofer authored Feb 15, 2017

They are register promoted by ISel and so it makes no sense to treat them as
memory.

Inserting calls to the thread sanitizer would also generate invalid IR.

You would hit:

"swifterror value can only be loaded and stored from, or as a swifterror
argument!"

llvm-svn: 295215

8eb1a485

[DAG] Don't try to create an INSERT_SUBVECTOR with an illegal source · ba80db39

Michael Kuperstein authored Feb 15, 2017

We currently can't legalize those, but we should really not be creating
them in the first place, since legalization would probably look similar to the
way we legalize CONCAT_VECTORS - basically replace the INSERT with a BUILD.

This fixes PR311956.

Differential Revision: https://reviews.llvm.org/D29961

llvm-svn: 295213

ba80db39

Fix Xcode project. · 4197b7f4
Greg Clayton authored Feb 15, 2017
```
llvm-svn: 295212
```
4197b7f4
Skip TestStepOverBreakpoint on windows · ae11b64d
Pavel Labath authored Feb 15, 2017
```
llvm-svn: 295211
```
ae11b64d