Commits · aaa0191be6643ad815b756877b37af50ee6e39cb · Roger Ferrer / llvm-epi

Jul 05, 2016

Transfer ownership of the gold plugin. · aaa0191b
Rafael Espindola authored Jul 05, 2016
```
llvm-svn: 274574
```
aaa0191b

Revert r259387: "AArch64: Implement missed conditional compare sequences." · d4acd7ed

Balaram Makam authored Jul 05, 2016

    This reverts commit r259387 because it inserts illegal code after legalization
    in some backends where i64 OR type is illegal for example.

llvm-svn: 274573

d4acd7ed

[X86][AVX2] Add support for target shuffle combining to BROADCAST · bec6543d
Simon Pilgrim authored Jul 05, 2016
```
Only support broadcast from vector register so far - memory folding support will have to wait.

llvm-svn: 274572
```
bec6543d

[X86][AVX512] Fixed decoding of permd/permpd variable mask shuffles + enabled... · 48adedff

Simon Pilgrim authored Jul 05, 2016

[X86][AVX512] Fixed decoding of permd/permpd variable mask shuffles + enabled them for target shuffle combining

Corrected element mask masking to extract the bottom index bits (now matches the perm2 implementation but for unary inputs).

llvm-svn: 274571

48adedff

ARM: fix `-mlong-calls` for WoA · 4d950ef8

Saleem Abdulrasool authored Jul 05, 2016

Not all code-paths set the relocation model to static for Windows.  This
currently breaks on Windows ARM with `-mlong-calls` when built with clang.
Loosen the assertion to what it was previously.  We would ideally ensure that
all the configuration sets Windows to static relocation model.

llvm-svn: 274570

4d950ef8

DAGCombiner: Fold away vector extract of insert with the same index · 2d793895

Matt Arsenault authored Jul 05, 2016

This only really matters when the index is non-constant since the
constant case already gets taken care of by other combines.

llvm-svn: 274569

2d793895

Fix "lldb.SBProcess.is_stopped" and "lldb.SBProcess.is_running" to do the right thing. · d458c4de
Greg Clayton authored Jul 05, 2016
```
https://llvm.org/bugs/show_bug.cgi?id=28428

llvm-svn: 274568
```
d458c4de

AArch64: use correct SDValue # when looking for bitfield placement. · 01dff9d1

Tim Northover authored Jul 05, 2016

The other use really does only care about the SDNode (it checks the
opcode against a whitelist), but bitFieldPlacement can be misled if
the node produces multiple results.

Patch by Ismail Badawi.

llvm-svn: 274567

01dff9d1

[Sema] Fix a bug where pack expansion was not expanded in type alias · f1bd000f

Erik Pilkington authored Jul 05, 2016

The problem is that the parameter pack in a function type type alias is not
reexpanded after being transformed. Also remove an incorrect comment in a
similar function. Fixes PR26017.

Differential Revision: http://reviews.llvm.org/D21030

llvm-svn: 274566

f1bd000f

Re-apply "test: Use add_lit_testsuites so that subsets of tests can be specified" · 2a15ffa2

Justin Bogner authored Jul 05, 2016

This version should actually remove the empty directories I removed
all of the files from. Thanks to tstellar for pointing out git-svn's
--rmdir flag.

Original message:

This creates make/ninja targets like check-clang-codegen and
check-clang-unit, much like LLVM already has. I had to move some input
files into Input directories so they weren't picked up as test
directories.

llvm-svn: 274565

2a15ffa2

AMDGPU: Fix folding SGPRs into madak/madmk src0 · ffc8275f

Matt Arsenault authored Jul 05, 2016

Because of the special immediate operand, the constant
bus is already used so SGPRs are never useful.

r263212 changed the name of the immediate operand, which
broke the verifier check for the restriction.

llvm-svn: 274564

ffc8275f

[MC/Darwin] Fix a -Wmisleading-indentation warning, reported by GCC 6. · a8d89f35
Davide Italiano authored Jul 05, 2016
```
llvm-svn: 274563
```
a8d89f35

Revert "test: Use add_lit_testsuites so that subsets of tests can be specified" · a73e81c5

Justin Bogner authored Jul 05, 2016

This reverts r274560. It's breaking a bunch of bots due to a directory
with a space in the name. Doesn't repro locally for some reason.

llvm-svn: 274562

a73e81c5

AMDGPU/SI: Remove address space query functions from AMDGPUDAGToDAGISel · a4b746d8

Tom Stellard authored Jul 05, 2016

Summary:
These have been replaced with TableGen code (except for isConstantLoad,
which is still used for R600).  The queries were broken for cases
where MemOperand was a PseudoSourceValue.

Reviewers: arsenm

Subscribers: arsenm, kzhuravl, llvm-commits

Differential Revision: http://reviews.llvm.org/D21684

llvm-svn: 274561

a4b746d8

test: Use add_lit_testsuites so that subsets of tests can be specified · 2976e014

Justin Bogner authored Jul 05, 2016

This creates make/ninja targets like check-clang-codegen and
check-clang-unit, much like LLVM already has. I had to move some input
files into Input directories so they weren't picked up as test
directories.

llvm-svn: 274560

2976e014

[Clang][Feature] Adding CLFLUSHOPT feature and intrinsic to clang · b9206654
Michael Zuckerman authored Jul 05, 2016
```
Differential Revision: http://reviews.llvm.org/D21792

llvm-svn: 274559
```
b9206654

[LV] Refactor integer induction widening (NFC) · 89188729

Matthew Simpson authored Jul 05, 2016

This patch also removes the SCEV variants of getStepVector() since they have no
uses after the refactoring.

Differential Revision: http://reviews.llvm.org/D21903

llvm-svn: 274558

89188729

cmake: do not check-format anything in lib/External · d1e90f59

Tobias Grosser authored Jul 05, 2016

There is no need to specifically match for isl, but we can exclude anything in
lib/External from formatting as we assume that externally contributed code
should always match the upstream code. This simplifies the cmake script and
allows additional external projects to be added without the need to explicitly
exclude them from formatting.

llvm-svn: 274557

d1e90f59

[AMDGPU] rename DS_1A1D_Off8_NORET to DS_1A2D_Off8_NORET as ds_write2xx use 2... · e65b39ec
Valery Pykhtin authored Jul 05, 2016
```
[AMDGPU] rename DS_1A1D_Off8_NORET to DS_1A2D_Off8_NORET as ds_write2xx use 2 source registers. NFC.

llvm-svn: 274556
```
e65b39ec
[X86][AVX512] Remove vector BROADCAST builtins. · 9769428e
Simon Pilgrim authored Jul 05, 2016
```
llvm-svn: 274555
```
9769428e
[X86][AVX512] Remove vector BROADCAST builtins. · 73ac160d
Simon Pilgrim authored Jul 05, 2016
```
llvm-svn: 274554
```
73ac160d
[LLVM][INTRINSICS] adding intrinsics of CLFLUSHOPT · bdc5f40d
Michael Zuckerman authored Jul 05, 2016
```
Differential Revision: http://reviews.llvm.org/D21789

llvm-svn: 274553
```
bdc5f40d

[clang-tidy] UnnecessaryValueParamCheck - only warn for virtual methods · 17934da7

Felix Berger authored Jul 05, 2016

Summary:

As changing virtual methods could break method overrides disable applying the fix and just warn.

Reviewers: alexfh, sbenza

Subscribers: cfe-commits

Differential Revision: http://reviews.llvm.org/D21936

llvm-svn: 274552

17934da7

[AMDGPU] Assembler: Fix parsing error with floating-point literals passed to integer instructions · a9cd6aa8
Sam Kolton authored Jul 05, 2016
```
Differential Revision: http://reviews.llvm.org/D21972

llvm-svn: 274551
```
a9cd6aa8
[X86][AVX512] Autoupgrade the BROADCAST intrinsics · 4e96fbf3
Simon Pilgrim authored Jul 05, 2016
```
llvm-svn: 274550
```
4e96fbf3

[tsan] Synchronize leaving a GCD group with notifications · c54b108c

Kuba Brecka authored Jul 05, 2016

In the patch that introduced support for GCD barrier blocks, I removed releasing a group when leaving it (in dispatch_group_leave). However, this is necessary to synchronize leaving a group and a notification callback (dispatch_group_notify). Adding this back, simplifying dispatch_group_notify_f and adding a test case.

Differential Revision: http://reviews.llvm.org/D21927

llvm-svn: 274549

c54b108c

[tsan] dispatch_once interceptor will cause a crash/deadlock when the original... · 09d3e53a

Kuba Brecka authored Jul 05, 2016

[tsan] dispatch_once interceptor will cause a crash/deadlock when the original dispatch_once is used

Because we use SCOPED_TSAN_INTERCEPTOR in the dispatch_once interceptor, the original dispatch_once can also be sometimes called (when ignores are enabled or when thr->is_inited is false). However the original dispatch_once function doesn’t expect to find “2” in the storage and it will spin forever (but we use “2” to indicate that the initialization is already done, so no waiting is necessary). This patch makes sure we never call the original dispatch_once.

Differential Revision: http://reviews.llvm.org/D21976

llvm-svn: 274548

09d3e53a

[mips][ias] Remove k_PhysReg since it's not possible to create an operand of this kind. · 976d938c

Daniel Sanders authored Jul 05, 2016

Reviewers: sdardis

Subscribers: dsanders, sdardis, llvm-commits

Differential Revision: http://reviews.llvm.org/D21986

llvm-svn: 274547

976d938c

[CMake] Adjust export_executable_symbols to cope with non-target link libraries · 24ca18e3

John Brawn authored Jul 05, 2016

export_executable_symbols looks though the link libraries of the executable in
order to figure out transitive dependencies, but in doing so it assumes that
all link libraries are also targets. This is not true as of r273302, so adjust
it to check if they actually are targets.

llvm-svn: 274546

24ca18e3

[X86][AVX512BW] Added BROADCAST intrinsics fast-isel generic IR tests · 1e91654b
Simon Pilgrim authored Jul 05, 2016
```
llvm-svn: 274545
```
1e91654b
[X86][AVX512] Converted the VBROADCAST intrinsics to generic IR · f5a8837e
Simon Pilgrim authored Jul 05, 2016
```
llvm-svn: 274544
```
f5a8837e

[Thumb] Reapply r272251 with a fix for PR28348 (mk 2) · ae5ff990

James Molloy authored Jul 05, 2016

The important thing I was missing was ensuring newly added constants were kept in topological order. Repositioning the node is correct if the constant is newly added (so it has no topological ordering) but wrong if it already existed - positioning it next in the worklist would break the topological ordering.

Original commit message:
  [Thumb] Select a BIC instead of AND if the immediate can be encoded more optimally negated

  If an immediate is only used in an AND node, it is possible that the immediate can be more optimally materialized when negated. If this is the case, we can negate the immediate and use a BIC instead;

    int i(int a) {
      return a & 0xfffffeec;
    }

  Used to produce:
      ldr r1, [CONSTPOOL]
      ands r0, r1
    CONSTPOOL: 0xfffffeec

  And now produces:
      movs    r1, #255
      adds    r1, #20  ; Less costly immediate generation
      bics    r0, r1

llvm-svn: 274543

ae5ff990

[X86][AVX512F] add float/double abs intrinsics · 13633288

Asaf Badouh authored Jul 05, 2016

add abs intrinsics that use native LLVM-IR.
change _mm512_mask[z]_and_epi{32|64} to use select intrinsic

Differential Revision: http://reviews.llvm.org/D21973

llvm-svn: 274542

13633288

[AVX512] minor fix in sqrt{ss|sd} intrinsics arguments · f9cdb8de
Asaf Badouh authored Jul 05, 2016
```
Differential Revision: http://reviews.llvm.org/D21988

llvm-svn: 274541
```
f9cdb8de

[OpenCL] An implementation of device side enqueue (DSE) from OpenCL v2.0 s6.13.17. · db7a31cc

Anastasia Stulova authored Jul 05, 2016

- Added new Builtins: enqueue_kernel, get_kernel_work_group_size
and get_kernel_preferred_work_group_size_multiple.

These Builtins use custom check to diagnose parameters of the passed Blocks
i. e. variable number of 'local void*' type params, and check different
overloads specified in Table 6.31 of OpenCL v2.0.

- IR is generated as an internal library call for each OpenCL Builtin,
reusing ObjC Block implementation.

Review: http://reviews.llvm.org/D20249
llvm-svn: 274540

db7a31cc

ntrinsics _mm256_permutexvar_epi64 doesn't accept three parameters as specify bellow. · a72b49ef

Michael Zuckerman authored Jul 05, 2016

I deleted the extra mask parameter.

__m256i _mm256_permutexvar_epi64 (__m256i idx, __m256i a)
#include "immintrin.h"
Instruction: vpermq
CPUID Flags: AVX512VL + AVX512F
Description
Shuffle 64-bit integers in a across lanes using the corresponding index in idx, and store the results in dst.
Operation
FOR j := 0 to 3
  i := j*64
    id := idx[i+1:i]*64
      dst[i+63:i] := a[id+63:id]
      ENDFOR
      dst[MAX:256] := 0
      dst[MAX:256] := 0
      
(From: Intel intrinsics guide)        

llvm-svn: 274539

a72b49ef

Revert r274536: [mips][ias] Don't break apart and reconstruct StringRef's for k_Token. NFC. · 7b361a2c
Daniel Sanders authored Jul 05, 2016
```
It turns out that MSVC requires this.

llvm-svn: 274538
```
7b361a2c
[X86][AVX512] Added BROADCAST intrinsics fast-isel generic IR tests · 20ede63a
Simon Pilgrim authored Jul 05, 2016
```
llvm-svn: 274537
```
20ede63a
[mips][ias] Don't break apart and reconstruct StringRef's for k_Token. NFC. · b2e0ca8e
Daniel Sanders authored Jul 05, 2016
```
llvm-svn: 274536
```
b2e0ca8e

[PowerPC] - Legalize vector types by widening instead of integer promotion · 44513e54

Nemanja Ivanovic authored Jul 05, 2016

This patch corresponds to review:
http://reviews.llvm.org/D20443

It changes the legalization strategy for illegal vector types from integer
promotion to widening. This only applies for vectors with elements of width
that is a multiple of a byte since we have hardware support for vectors with
1, 2, 3, 8 and 16 byte elements.
Integer promotion for vectors is quite expensive on PPC due to the sequence
of breaking apart the vector, extending the elements and reconstituting the
vector. Two of these operations are expensive.
This patch causes between minor and major improvements in performance on most
benchmarks. There are very few benchmarks whose performance regresses. These
regressions can be handled in a subsequent patch with a DAG combine (similar
to how this patch handles int -> fp conversions of illegal vector types).

llvm-svn: 274535

44513e54