Commits · dd09a8f320d8dc0e5e57e68cc4e3d6dbb15ed4a3 · Roger Ferrer / llvm-epi

Oct 28, 2014

[AVX512] Bring back vector-shuffle lowering support through broadcasts · dd09a8f3

Robert Khasanov authored Oct 28, 2014

Ffter commit at rev219046 512-bit broadcasts lowering become non-optimal. Most of tests on broadcasting and embedded broadcasting were changed and they doesn’t produce efficient code.

Example below is from commit changes (it’s the first test from test/CodeGen/X86/avx512-vbroadcast.ll):

 define   <16 x i32> @_inreg16xi32(i32 %a) {
 ; CHECK-LABEL: _inreg16xi32:
 ; CHECK:       ## BB#0:
-; CHECK-NEXT:    vpbroadcastd %edi, %zmm0
+; CHECK-NEXT:    vmovd %edi, %xmm0
+; CHECK-NEXT:    vpbroadcastd %xmm0, %ymm0
+; CHECK-NEXT:    vinserti64x4 $1, %ymm0, %zmm0, %zmm0
 ; CHECK-NEXT:    retq
 %b = insertelement <16 x i32> undef, i32 %a, i32 0
 %c = shufflevector <16 x i32> %b, <16 x i32> undef, <16 x i32> zeroinitializer
 ret <16 x i32> %c
}

Here, 256-bit broadcast was generated instead of 512-bit one.

In this patch
1) I added vector-shuffle lowering through broadcasts
2) Removed asserts and branches likes because this is incorrect
-  assert(Subtarget->hasDQI() && "We can only lower v8i64 with AVX-512-DQI");
3) Fixed lowering tests

llvm-svn: 220774

dd09a8f3

Reformat partially, where I touched for whitespace changes. · d0e13af2
NAKAMURA Takumi authored Oct 28, 2014
```
llvm-svn: 220773
```
d0e13af2
LoopRerollPass.cpp: Use range-based loop. NFC. · 5af50a54
NAKAMURA Takumi authored Oct 28, 2014
```
llvm-svn: 220772
```
5af50a54
Untabify and whitespace cleanups. · 335a7bcf
NAKAMURA Takumi authored Oct 28, 2014
```
llvm-svn: 220771
```
335a7bcf
clang/test/Modules/explicit-build.cpp: Tweak to meet win32's backslash. · 314df7a5
NAKAMURA Takumi authored Oct 28, 2014
```
llvm-svn: 220770
```
314df7a5

[libcxx] Delay evaluation of __make_tuple_types to prevent blowing the max... · 295bce11

Eric Fiselier authored Oct 28, 2014

[libcxx] Delay evaluation of __make_tuple_types to prevent blowing the max template instantiation depth. Fixes Bug #18345

Summary:
http://llvm.org/bugs/show_bug.cgi?id=18345

Tuple's constructor and assignment operators for "tuple-like" types evaluates __make_tuple_types unnecessarily. In the case of a large array this can blow the template instantiation depth.

Ex:
```
#include <array>
#include <tuple>
#include <memory>
 
typedef std::array<int, 1256> array_t;
typedef std::tuple<array_t> tuple_t;

int main() {
  array_t a;
  tuple_t t(a); // broken
  t = a; // broken

  // make_shared uses tuple behind the scenes. This bug breaks this code.
  std::make_shared<array_t>(a);
}
```

To prevent this from happening we delay the instantiation of `__make_tuple_types` until after we perform the length check. Currently `__make_tuple_types` is instantiated at the same time that the length check .


Test Plan: Two tests have been added. One for the "tuple-like" constructors and another for the "tuple-like" assignment operator. 

Reviewers: mclow.lists, EricWF

Reviewed By: EricWF

Subscribers: K-ballo, cfe-commits

Differential Revision: http://reviews.llvm.org/D4467

llvm-svn: 220769

295bce11

Update compile target dependency. · 44067eea

Rui Ueyama authored Oct 28, 2014

test/elf/Mips/hilo16-*.test depends on llvm-mc, so we need to
make CMake to build that before running the tests.

llvm-svn: 220768

44067eea

[OCaml] Enable -g for debug builds. · 3ebd0bf2
Peter Zotov authored Oct 28, 2014
```
We don't care about pre-3.12.1 anymore.

llvm-svn: 220767
```
3ebd0bf2
[OCaml] Fix whitespace. · 110f6291
Peter Zotov authored Oct 28, 2014
```
llvm-svn: 220766
```
110f6291
Fix warning text: lower -> higher · 9ad40ac7
Richard Trieu authored Oct 28, 2014
```
llvm-svn: 220763
```
9ad40ac7

Add breakpoint instruction byte sequences for arm to · 2586e94b

Jason Molenda authored Oct 28, 2014

PlatformLinux::GetSoftwareBreakpointTrapOpcode.

Patch by Stephane Sezer.
http://reviews.llvm.org/D5923

llvm-svn: 220762

2586e94b

Clarify the launch style for debugserver to use. · 0f7828cf
Jason Molenda authored Oct 28, 2014
```
<rdar://problem/18786645> 

llvm-svn: 220761
```
0f7828cf
Driver: remove a stray s that propagated in cross-windows · 56dd1ac1
Saleem Abdulrasool authored Oct 28, 2014
```
The option is '--allow-multiple-definition' not '--allow-multiple-definitions'.

llvm-svn: 220760
```
56dd1ac1
Minimize the scope of some variables, NFC. · ff468a5e
David Blaikie authored Oct 28, 2014
```
llvm-svn: 220759
```
ff468a5e

X86: Implement the vectorcall calling convention · 9ccce99e

Reid Kleckner authored Oct 28, 2014

This is a Microsoft calling convention that supports both x86 and x86_64
subtargets. It passes vector and floating point arguments in XMM0-XMM5,
and passes them indirectly once they are consumed.

Homogenous vector aggregates of up to four elements can be passed in
sequential vector registers, but this part is not implemented in LLVM
and will be handled in Clang.

On 32-bit x86, it is similar to fastcall in that it uses ecx:edx as
integer register parameters and is callee cleanup. On x86_64, it
delegates to the normal win64 calling convention.

Reviewers: majnemer

Differential Revision: http://reviews.llvm.org/D5943

llvm-svn: 220745

9ccce99e

AArch64: enable Cortex-A57 FP balancing on Cortex-A53. · 00917897

Tim Northover authored Oct 28, 2014

Benchmarks have shown that it's harmless to the performance there, and having a
unified set of passes between the two cores where possible helps big.LITTLE
deployment.

Patch by Z. Zheng.

llvm-svn: 220744

00917897

Add a test for setting and hitting the C++ Exception throw breakpoint. · c891d863
Jim Ingham authored Oct 28, 2014
```
llvm-svn: 220743
```
c891d863
Update for LLVM API change. · 37ad1342
Rafael Espindola authored Oct 28, 2014
```
llvm-svn: 220742
```
37ad1342

Remove the PreserveSource linker mode. · 9f8eff31

Rafael Espindola authored Oct 28, 2014

I noticed that it was untested, and forcing it on caused some tests to fail:

    LLVM :: Linker/metadata-a.ll
    LLVM :: Linker/prefixdata.ll
    LLVM :: Linker/type-unique-odr-a.ll
    LLVM :: Linker/type-unique-simple-a.ll
    LLVM :: Linker/type-unique-simple2-a.ll
    LLVM :: Linker/type-unique-simple2.ll
    LLVM :: Linker/type-unique-type-array-a.ll
    LLVM :: Linker/unnamed-addr1-a.ll
    LLVM :: Linker/visibility1.ll

If it is to be resurrected, it has to be fixed and we should probably have a
-preserve-source command line option in llvm-mc and run tests with and without
it.

llvm-svn: 220741

9f8eff31

Improve on the diagnostic in my last patch and change warning · 294eecf6
Fariborz Jahanian authored Oct 27, 2014
```
to error. rdar://18768214.

llvm-svn: 220740
```
294eecf6
AArch64InstrInfo.h: Fix a warning introduced in clang r220703. [-Winconsistent-missing-override] · 949fb6d2
NAKAMURA Takumi authored Oct 27, 2014
```
llvm-svn: 220739
```
949fb6d2
Remove unused variable. · 79c98cc9
Richard Smith authored Oct 27, 2014
```
llvm-svn: 220738
```
79c98cc9

[AVX512] Add vpermil variable version · cf7a4a26

Adam Nemet authored Oct 27, 2014

This is implemented via a multiclass that derives from the vperm imm
multiclass.

Fixes <rdar://problem/18426089>

llvm-svn: 220737

cf7a4a26

[AVX512] Clean up avx512_perm_imm to use X86VectorVTInfo · 8d85b0cd

Adam Nemet authored Oct 27, 2014

No functionality change.  No change in X86.td.expanded except that we only set
the CD8 attributes for the memory variants.  (This shouldn't be used unless we
have a memory operand.)

llvm-svn: 220736

8d85b0cd

[AVX512] Derive vpermil* from avx512_perm_imm · 9aad1316

Adam Nemet authored Oct 27, 2014

This used to derive from avx512_pshuf_imm which is confusing.

NFC.  Compared X86.td.expanded.

llvm-svn: 220735

9aad1316

[AVX512] Fix copy-and-paste bugs in vpermil · c51cee85

Adam Nemet authored Oct 27, 2014

1) i512mem -> f512mem (this is the packed FP input being permuted)
2) element size is 64 bits in EVEX_CD8 for PD.

(A good illustration why X86VectorVTInfo is useful)

llvm-svn: 220734

c51cee85

Use the newer/simple API for passing a diagnostic handler to the IR linker. · c008c643
Rafael Espindola authored Oct 27, 2014
```
llvm-svn: 220733
```
c008c643
Make it easier to pass a custom diagnostic handler to the IR linker. · 4160f5d3
Rafael Espindola authored Oct 27, 2014
```
llvm-svn: 220732
```
4160f5d3
[modules] Load .pcm files specified by -fmodule-file lazily. · d4b230b3
Richard Smith authored Oct 27, 2014
```
llvm-svn: 220731
```
d4b230b3

Oct 27, 2014
- TMP: fix readN & writeN to not encourage UB · 40d3ad33
  Tim Northover authored Oct 27, 2014
```
llvm-svn: 220730
```
  40d3ad33
- Test that the single-threaded lit feature is available iff the corresponding guard is #defined · 33c2c02e
  Jon Roelofs authored Oct 27, 2014
```
http://reviews.llvm.org/D6006

llvm-svn: 220729
```
  33c2c02e
- Fix a stackmap bug introduced in r220710. · 7c801dc9
  Pete Cooper authored Oct 27, 2014
```
For a call to not return in to the stackmap shadow, the shadow must end with the call.

To do this, we must insert any required nops *before* the call, and not after it.

llvm-svn: 220728
```
  7c801dc9
- Objective-C ARC [qoi]. Issue diagnostic if __bridge casting · 992bdf1b
  Fariborz Jahanian authored Oct 27, 2014
```
to C type a collection literal. rdar://18768214

llvm-svn: 220727
```
  992bdf1b
- Frontend: Don't include stdin in the dependency list for an object file · f0822fb0
  David Majnemer authored Oct 27, 2014
```
GCC doesn't do this and it semes weird to include a file that we can't
open.

This fixes PR21362.

llvm-svn: 220726
```
  f0822fb0
- Try to appease the C++ gods · a41521a8
  Hans Wennborg authored Oct 27, 2014
```
Looks like some builds were not happy with the potentially-throwing move
constructor that was added in r220723, and reached for the implicitly
deleted copy constructor instead.

llvm-svn: 220725
```
  a41521a8
- Add special case handling of linux target triples that do not contain `-gnu`. · bb191417
  Eric Fiselier authored Oct 27, 2014
```
For targets that end it `redhat-linux` and `suse-linux` manually add the `-gnu`
section of the target since `linux-gnu` is needed in the testsuite.

This patch also moves the removal of minor and patchlevel numbers from OSX
triples to be handled when deducing the triple instead of when adding available
features.

llvm-svn: 220724
```
  bb191417
- Give TypoExprState a move constructor and assignment operator to appease MSVC build · 5d838722
  Hans Wennborg authored Oct 27, 2014
```
llvm-svn: 220723
```
  5d838722
- Add test to ensure including <atomic> fails when _LIBCPP_HAS_NO_THREADS is defined. · b2a6048b
  Eric Fiselier authored Oct 27, 2014
```
llvm-svn: 220722
```
  b2a6048b
- [ScalarEvolution] Guard dump() with #if · 53c1612e
  Jingyue Wu authored Oct 27, 2014
```
to be consistent with its definition in ScalarEvolution.cpp

llvm-svn: 220721
```
  53c1612e
- Make sure OTHER_CFLAGS and OTHER_LDFLAGS are inherited from the Xcode project... · dc574df0
  Greg Clayton authored Oct 27, 2014
```
Make sure OTHER_CFLAGS and OTHER_LDFLAGS are inherited from the Xcode project so you can easily add to the flags of all targets.

llvm-svn: 220720
```
  dc574df0