Commits · 7d942d73b868a9ce7e202096778803d57f3ec73c · Roger Ferrer / llvm-epi

Mar 02, 2016

[CMake] Add test-depends target to build dependencies of check-all · 7d942d73

Chris Bieneman authored Mar 02, 2016

This is just another convenience target for bots to use. It enables isolation of building and testing.

llvm-svn: 262494

7d942d73

[cmake] Check the compiler version first · 9fac19f0

Reid Kleckner authored Mar 02, 2016

Otherwise users get messages from CheckAtomic about missing libatomic
instead of a sensible message that says "use GCC 4.7 or newer".

I structured the change along the lines of HandleLLVMStdlib.cmake, so
that the standalone build of Clang still gets the compiler version
check.

Reviewers: beanz

Differential Revision: http://reviews.llvm.org/D17789

llvm-svn: 262491

9fac19f0

[AA] Hoist the logic to reformulate various AA queries in terms of other · 12884f7f

Chandler Carruth authored Mar 02, 2016

parts of the AA interface out of the base class of every single AA
result object.

Because this logic reformulates the query in terms of some other aspect
of the API, it would easily cause O(n^2) query patterns in alias
analysis. These could in turn be magnified further based on the number
of call arguments, and then further based on the number of AA queries
made for a particular call. This ended up causing problems for Rust that
were actually noticable enough to get a bug (PR26564) and probably other
places as well.

When originally re-working the AA infrastructure, the desire was to
regularize the pattern of refinement without losing any generality.
While I think it was successful, that is clearly proving to be too
costly. And the cost is needless: we gain no actual improvement for this
generality of making a direct query to tbaa actually be able to
re-use some other alias analysis's refinement logic for one of the other
APIs, or some such. In short, this is entirely wasted work.

To the extent possible, delegation to other API surfaces should be done
at the aggregation layer so that we can avoid re-walking the
aggregation. In fact, this significantly simplifies the logic as we no
longer need to smuggle the aggregation layer into each alias analysis
(or the TargetLibraryInfo into each alias analysis just so we can form
argument memory locations!).

However, we also have some delegation logic inside of BasicAA and some
of it even makes sense. When the delegation logic is baking in specific
knowledge of aliasing properties of the LLVM IR, as opposed to simply
reformulating the query to utilize a different alias analysis interface
entry point, it makes a lot of sense to restrict that logic to
a different layer such as BasicAA. So one aspect of the delegation that
was in every AA base class is that when we don't have operand bundles,
we re-use function AA results as a fallback for callsite alias results.
This relies on the IR properties of calls and functions w.r.t. aliasing,
and so seems a better fit to BasicAA. I've lifted the logic up to that
point where it seems to be a natural fit. This still does a bit of
redundant work (we query function attributes twice, once via the
callsite and once via the function AA query) but it is *exactly* twice
here, no more.

The end result is that all of the delegation logic is hoisted out of the
base class and into either the aggregation layer when it is a pure
retargeting to a different API surface, or into BasicAA when it relies
on the IR's aliasing properties. This should fix the quadratic query
pattern reported in PR26564, although I don't have a stand-alone test
case to reproduce it.

It also seems general goodness. Now the numerous AAs that don't need
target library info don't carry it around and depend on it. I think
I can even rip out the general access to the aggregation layer and only
expose that in BasicAA as it is the only place where we re-query in that
manner.

However, this is a non-trivial change to the AA infrastructure so I want
to get some additional eyes on this before it lands. Sadly, it can't
wait long because we should really cherry pick this into 3.8 if we're
going to go this route.

Differential Revision: http://reviews.llvm.org/D17329

llvm-svn: 262490

12884f7f

[X86][SSSE3] Added combine test for unary shuffle (pshufb) only referencing... · 537907fd

Simon Pilgrim authored Mar 02, 2016

[X86][SSSE3] Added combine test for unary shuffle (pshufb) only referencing elements from one of the inputs of a binary shuffle (punpcklbw)

llvm-svn: 262486

537907fd

[LLVM][AVX512]PSRAWI Change imm8 to int. · 927fdaee
Michael Zuckerman authored Mar 02, 2016
```
Differential Revision: http://reviews.llvm.org/D17705

llvm-svn: 262480
```
927fdaee

[X86][SSE] Lower 128-bit MOVDDUP with existing VBROADCAST mechanisms · c02b7262

Simon Pilgrim authored Mar 02, 2016

We have a number of useful lowering strategies for VBROADCAST instructions (both from memory and register element 0) which the 128-bit form of the MOVDDUP instruction can make use of.

This patch tweaks lowerVectorShuffleAsBroadcast to enable it to broadcast 2f64 args using MOVDDUP as well.

It does require a slight tweak to the lowerVectorShuffleAsBroadcast mechanism as the existing MOVDDUP lowering uses isShuffleEquivalent which can match binary shuffles that can lower to (unary) broadcasts.

Differential Revision: http://reviews.llvm.org/D17680

llvm-svn: 262478

c02b7262

Revert "[AMDGPU] table-driven parser/printer for amd_kernel_code_t structure fields" · f2fbabe9
Nikolay Haustov authored Mar 02, 2016
```
Build failure with clang.

llvm-svn: 262477
```
f2fbabe9
Revert "[AMDGPU] Using table-driven amd_kernel_code_t field parser in assembler." · f0f24628
Nikolay Haustov authored Mar 02, 2016
```
Build failure with clang.

llvm-svn: 262475
```
f0f24628

[AMDGPU] Using table-driven amd_kernel_code_t field parser in assembler. · 73447a97

Nikolay Haustov authored Mar 02, 2016

complementary patch to table-driven amd_kernel_code_t field parser/printer utility. lit tests passed.

Patch by: Valery Pykhtin

Differential Revision: http://reviews.llvm.org/D17151

llvm-svn: 262474

73447a97

[AMDGPU] table-driven parser/printer for amd_kernel_code_t structure fields · 6c8c7496

Nikolay Haustov authored Mar 02, 2016

This is going to be used in .hsatext disassembler and can be used
in current assembler parser (lit tests passed on parsing).
Code using this helpers isn't included in this patch.

Benefits:

unified approach
fast field name lookup on parsing
Later I would like to enhance some of the field naming/syntax using this code.

Patch by: Valery Pykhtin

Differential Revision: http://reviews.llvm.org/D17150

llvm-svn: 262473

6c8c7496

libfuzzer: fix compiler warnings · 2eed1218

Dmitry Vyukov authored Mar 02, 2016

- unused sigaction/setitimer result (used in assert)
- unchecked fscanf return value
- signed/unsigned comparison

llvm-svn: 262472

2eed1218

[X86] Remove unnecessary call to isReg from emitter's DestMem handling for VEX... · 1d3f4aef

Craig Topper authored Mar 02, 2016

[X86] Remove unnecessary call to isReg from emitter's DestMem handling for VEX prefix. The operand is always a register. NFC

llvm-svn: 262468

1d3f4aef

[X86] Make X86MCCodeEmitter::DetermineREXPrefix locate operands more like how... · 6a7cd422
Craig Topper authored Mar 02, 2016
```
[X86] Make X86MCCodeEmitter::DetermineREXPrefix locate operands more like how VEX prefix handling does.

llvm-svn: 262467
```
6a7cd422

[X86] Permit reading of the FLAGS register without it being previously defined · 5aadde1e

David Majnemer authored Mar 02, 2016

We modeled the RDFLAGS{32,64} operations as "using" {E,R}FLAGS.
While technically correct, this is not be desirable for folks who want
to examine aspects of the FLAGS register which are not related to
computation like whether or not CPUID is a valid instruction.

Differential Revision: http://reviews.llvm.org/D17782

llvm-svn: 262465

5aadde1e

[X86] Remove assertion I accidentally left in. · d4dabb39
Craig Topper authored Mar 02, 2016
```
llvm-svn: 262464
```
d4dabb39

[X86] Be more structured about how we capture the register number when it is... · a267431f

Craig Topper authored Mar 02, 2016

[X86] Be more structured about how we capture the register number when it is encoded in bits 7:4 of the immediate.

For some instructions the register is not the last operand and the immediate handling had to detect this and hardcode the index to find it. It also required CurOp to be pointing at the last operand handled in the Form switch whereas for any instruction it would be pointing at the next operand.

Now we just capture the value in the Form switch when we know exactly where it is and the CurOp pointer can behave normally.

llvm-svn: 262462

a267431f

[SCEV] Minor naming, braces cleanup; NFC · dcd3a88e
Sanjoy Das authored Mar 02, 2016
```
llvm-svn: 262459
```
dcd3a88e

[X86] Use MCPhysReg and uint16_t for static arrays of registers and opcodes... · cf65c627

Craig Topper authored Mar 02, 2016

[X86] Use MCPhysReg and uint16_t for static arrays of registers and opcodes respectively should reduce size tiny bit. NFC

llvm-svn: 262458

cf65c627

AMDGPU: Fix bug 26659. · f2dcb473

Matt Arsenault authored Mar 02, 2016

Fix checking the same instruction twice instead of the
second branch that uses vccz. I don't think this matters
currently because s_branch_vccnz is always used currently.

llvm-svn: 262457

f2dcb473

AMDGPU: Cleanup suggested in bug 23960 · a266bd87
Matt Arsenault authored Mar 02, 2016
```
llvm-svn: 262456
```
a266bd87
Bug 20810: Use report_fatal_error instead of unreachable · 5de68cbc
Matt Arsenault authored Mar 02, 2016
```
llvm-svn: 262455
```
5de68cbc
Add a comment with a rational for the unusual code structure · 6b017a11
Sanjoy Das authored Mar 02, 2016
```
llvm-svn: 262454
```
6b017a11
Qualify getRangeForAffineAR with this-> for MSVC · eca1b53b
Sanjoy Das authored Mar 02, 2016
```
llvm-svn: 262453
```
eca1b53b
Attempt to fix ASAN failure in a MemorySSA test. · e0e6e48b
George Burgess IV authored Mar 02, 2016
```
llvm-svn: 262452
```
e0e6e48b

Perturb code in an attempt to appease MSVC · 1168f93c

Sanjoy Das authored Mar 02, 2016

For some reason MSVC seems to think I'm calling getConstant() from a
static context.  Try to avoid this issue by explicitly specifying
'this->' (though I'm not confident that this will actually work).

llvm-svn: 262451

1168f93c

More code permutation to appease MSVC · 62a1c339
Sanjoy Das authored Mar 02, 2016
```
llvm-svn: 262449
```
62a1c339
Remove "auto" to appease the MSVC bots · 9e5ebf14
Sanjoy Das authored Mar 02, 2016
```
llvm-svn: 262448
```
9e5ebf14
DAGCombiner: Make sure an integer is being truncated · 7d0a77b9
Matt Arsenault authored Mar 02, 2016
```
llvm-svn: 262446
```
7d0a77b9
revert r262424 because there's a *clang test* for AArch64 that checks -O3 asm output · 5e4c46de
Sanjay Patel authored Mar 02, 2016
```
that is broken by this change

llvm-svn: 262440
```
5e4c46de
Fix SHARED_LIBS build · ab3aefea
Daniel Berlin authored Mar 02, 2016
```
llvm-svn: 262439
```
ab3aefea

[SCEV] Make getRange smarter around selects · bf730984

Sanjoy Das authored Mar 02, 2016

Have ScalarEvolution::getRange re-consider cases like "{C?A:B,+,C?P:Q}"
by factoring out "C" and computing RangeOf{A,+,P} union RangeOf({B,+,Q})
instead.

The latter can be easier to compute precisely in cases like
"{C?0:N,+,C?1:-1}" N is the backedge taken count of the loop; since in
such cases the latter form simplifies to [0,N+1) union [0,N+1).

llvm-svn: 262438

bf730984

[SCEV] Extract out a getRangeForAffineAR; NFC · b765b633

Sanjoy Das authored Mar 02, 2016

Pure code-motion change.  Will be used later in making getRange more clever.

llvm-svn: 262437

b765b633

[CMake] Add convenience target llvm-test-depends to build test dependencies. · 2d5e0771
Chris Bieneman authored Mar 02, 2016
```
This is useful when paired with the distribution targets to build prerequisites for running tests.

llvm-svn: 262428
```
2d5e0771
[CMake] Add distribution target that is the "just-build" side of install-distribution · 2b12f676
Chris Bieneman authored Mar 02, 2016
```
This is just a convenience target to allow limiting what you build.

llvm-svn: 262427
```
2b12f676

[InstCombine] convert 'isPositive' and 'isNegative' vector comparisons to shifts (PR26701) · 147e9279

Sanjay Patel authored Mar 01, 2016

As noted in the code comment, I don't think we can do the same transform that we do for
*scalar* integers comparisons to *vector* integers comparisons because it might pessimize
the general case. 

Exhibit A for an incomplete integer comparison ISA remains x86 SSE/AVX: it only has EQ and GT
for integer vectors.

But we should now recognize all the variants of this construct and produce the optimal code
for the cases shown in:
https://llvm.org/bugs/show_bug.cgi?id=26701
 

llvm-svn: 262424

147e9279

Mar 01, 2016

Perform InstructioinCombiningPass before SampleProfile pass. · 1012be12

Dehao Chen authored Mar 01, 2016

Summary: SampleProfile pass needs to be performed after InstructionCombiningPass, which helps eliminate un-inlinable function calls.

Reviewers: davidxl, dnovillo

Subscribers: llvm-commits

Differential Revision: http://reviews.llvm.org/D17742

llvm-svn: 262419

1012be12

[libFuzzer] deprecate exit_on_first flag · 3d95dd91
Kostya Serebryany authored Mar 01, 2016
```
llvm-svn: 262417
```
3d95dd91
llvm-dwp: Add missing copyright notice to llvm-dwp.cpp · f72dbc10
David Blaikie authored Mar 01, 2016
```
Addressing feedback on IRC by Sean Silva.

llvm-svn: 262416
```
f72dbc10

[libFuzzer] add generic signal handlers so that libFuzzer can report at least... · 228d5b1c

Kostya Serebryany authored Mar 01, 2016

[libFuzzer] add generic signal handlers so that libFuzzer can report at least something if ASan is not handlig the signals for us. Remove abort_on_timeout flag.

llvm-svn: 262415

228d5b1c

[X86][SSE41] Added missing fast-isel intrinsics tests · a3ad9cd7
Simon Pilgrim authored Mar 01, 2016
```
Match IR generated in clang/test/CodeGen/sse41-builtins.c

llvm-svn: 262412
```
a3ad9cd7