Commits · 46fc00661914fb7cab5fec1200f1e01f48ef2d3c · Roger Ferrer / llvm-epi-0.8

Sep 17, 2013

Use a StreamString to fix the endianness in · 46fc0066

Sean Callanan authored Sep 17, 2013

constants before using them in the IR interpreter.

Patch by Félix Cloutier.

llvm-svn: 190877

46fc0066

Costmodel: Add support for horizontal vector reductions · cae8735a

Arnold Schwaighofer authored Sep 17, 2013

Upcoming SLP vectorization improvements will want to be able to estimate costs
of horizontal reductions. Add infrastructure to support this.

We model reductions as a series of (shufflevector,add) tuples ultimately
followed by an extractelement. For example, for an add-reduction of <4 x float>
we could generate the following sequence:

 (v0, v1, v2, v3)
   \   \  /  /
     \  \  /
       +  +

 (v0+v2, v1+v3, undef, undef)
    \      /
 ((v0+v2) + (v1+v3), undef, undef)

 %rdx.shuf = shufflevector <4 x float> %rdx, <4 x float> undef,
                           <4 x i32> <i32 2, i32 3, i32 undef, i32 undef>
 %bin.rdx = fadd <4 x float> %rdx, %rdx.shuf
 %rdx.shuf7 = shufflevector <4 x float> %bin.rdx, <4 x float> undef,
                          <4 x i32> <i32 1, i32 undef, i32 undef, i32 undef>
 %bin.rdx8 = fadd <4 x float> %bin.rdx, %rdx.shuf7
 %r = extractelement <4 x float> %bin.rdx8, i32 0

This commit adds a cost model interface "getReductionCost(Opcode, Ty, Pairwise)"
that will allow clients to ask for the cost of such a reduction (as backends
might generate more efficient code than the cost of the individual instructions
summed up). This interface is excercised by the CostModel analysis pass which
looks for reduction patterns like the one above - starting at extractelements -
and if it sees a matching sequence will call the cost model interface.

We will also support a second form of pairwise reduction that is well supported
on common architectures (haddps, vpadd, faddp).

 (v0, v1, v2, v3)
  \   /    \  /
 (v0+v1, v2+v3, undef, undef)
    \     /
 ((v0+v1)+(v2+v3), undef, undef, undef)

  %rdx.shuf.0.0 = shufflevector <4 x float> %rdx, <4 x float> undef,
        <4 x i32> <i32 0, i32 2 , i32 undef, i32 undef>
  %rdx.shuf.0.1 = shufflevector <4 x float> %rdx, <4 x float> undef,
        <4 x i32> <i32 1, i32 3, i32 undef, i32 undef>
  %bin.rdx.0 = fadd <4 x float> %rdx.shuf.0.0, %rdx.shuf.0.1
  %rdx.shuf.1.0 = shufflevector <4 x float> %bin.rdx.0, <4 x float> undef,
        <4 x i32> <i32 0, i32 undef, i32 undef, i32 undef>
  %rdx.shuf.1.1 = shufflevector <4 x float> %bin.rdx.0, <4 x float> undef,
        <4 x i32> <i32 1, i32 undef, i32 undef, i32 undef>
  %bin.rdx.1 = fadd <4 x float> %rdx.shuf.1.0, %rdx.shuf.1.1
  %r = extractelement <4 x float> %bin.rdx.1, i32 0

llvm-svn: 190876

cae8735a

Don't output a stray 0x if GetData fails for memory read -f hex · 661e89c1
Ed Maste authored Sep 17, 2013
```
llvm-svn: 190875
```
661e89c1
ObjectiveC modern translator: Provide proper cast of · ff0c4608
Fariborz Jahanian authored Sep 17, 2013
```
the ObjectiveC object of an @synchronized statement.
// rdar://14993814

llvm-svn: 190874
```
ff0c4608

Avoid abort on "memory read -s N" for N=3,5,6,7 · 74a23ea4

Ed Maste authored Sep 17, 2013

We cannot use "GetMaxU64Bitfield" for non-power-of-two sizes, so just use
the same code that handles N > 8 for these.

Review: http://llvm-reviews.chandlerc.com/D1699
llvm-svn: 190873

74a23ea4

Logging enhancements to ConnectionFileDescriptor · bac7af21
Andrew Kaylor authored Sep 17, 2013
```
llvm-svn: 190872
```
bac7af21

SLPVectorizer: Don't vectorize phi nodes that use invoke values · 4a3dcaa1

Arnold Schwaighofer authored Sep 17, 2013

We can't insert an insertelement after an invoke. We would have to split a
critical edge. So when we see a phi node that uses an invoke we just give up.

radar://14990770

llvm-svn: 190871

4a3dcaa1

[InstCombiner] Slice a big load in two loads when the elements are next to each · b8d672ef

Quentin Colombet authored Sep 17, 2013

other in memory.

The motivation was to get rid of truncate and shift right instructions that get
in the way of paired load or floating point load.
E.g.,
Consider the following example:
struct Complex {
  float real;
  float imm;
};

When accessing a complex, llvm was generating a 64-bits load and the imm field
was obtained by a trunc(lshr) sequence, resulting in poor code generation, at
least for x86.

The idea is to declare that two load instructions is the canonical form for
loading two arithmetic type, which are next to each other in memory.

Two scalar loads at a constant offset from each other are pretty
easy to detect for the sorts of passes that like to mess with loads. 

<rdar://problem/14477220>

llvm-svn: 190870

b8d672ef

Remove unused code, which had been commented out. · ba6f9d1b
Preston Gurd authored Sep 17, 2013
```
llvm-svn: 190869
```
ba6f9d1b

Examine more than 1 frame for equivalent contexts in ThreadPlanStepOverRange · 4b86728b

Daniel Malea authored Sep 17, 2013

- searches frames beginning from the current frame, stops when an equivalent context is found
- not using GetStackFrameCount() for performance reasons
- fixes TestInlineStepping (clang/gcc buildbots)

llvm-svn: 190868

4b86728b

Update Linux bug tracker link in TestPrintStackTraces · 7d0d6692

Daniel Malea authored Sep 17, 2013

- now fails due to llvm.org/pr15415 (partial stack trace while stopped inside read() call)

llvm-svn: 190867

7d0d6692

Added documentation to getMemsetStores. · 8ec39992
Serge Pavlov authored Sep 17, 2013
```
llvm-svn: 190866
```
8ec39992
Re-enabling TestStopHookMultipleThreads · f04a22d9
Daniel Malea authored Sep 17, 2013
```
- original bug llvm.org/pr14323 is long closed

llvm-svn: 190865
```
f04a22d9

Add llvm.x86.* intrinsics for Intel SHA Extensions · de39520f

Ben Langmuir authored Sep 17, 2013

Add llvm.x86.* intrinsics for all of the Intel SHA Extensions instructions, as
well as tests. Also remove mayLoad and hasSideEffects, which can be inferred
from the instruction patterns.

llvm-svn: 190864

de39520f

[asan] inline the calls to __asan_stack_free_* with small sizes. Yet another... · bc86efb8
Kostya Serebryany authored Sep 17, 2013
```
[asan] inline the calls to __asan_stack_free_* with small sizes. Yet another 10%-20% speedup for use-after-return

llvm-svn: 190863
```
bc86efb8
[ARM] Fix the deprecation of MCR encodings that map to CP15{ISB,DSB,DMB}. · 830c27ab
Joey Gouly authored Sep 17, 2013
```
llvm-svn: 190862
```
830c27ab

clang-format: Don't accidentally move tokens into preprocessor directive. · fb81b09d

Daniel Jasper authored Sep 17, 2013

This fixes llvm.org/PR17265.

Before:
  Foo::Foo()
  #ifdef BAR
      : baz(0)
  #endif {
  }

After:
  Foo::Foo()
  #ifdef BAR
      : baz(0)
  #endif
  {
  }

llvm-svn: 190861

fb81b09d

[ASan] Don't add SANITIZER_INTERFACE_ATTRIBUTE for internal ASan functions · c947eb08
Alexey Samsonov authored Sep 17, 2013
```
llvm-svn: 190860
```
c947eb08

Bugfix for PR17099: · dc2c4b44

Stepan Dyatkovskiy authored Sep 17, 2013

Wrong cast operation.
MergeFunctions emits Bitcast instead of pointer-to-integer operation.
Patch fixes MergeFunctions::writeThunk function. It replaces
unconditional Bitcast creation with "Value* createCast(...)" method, that
checks operand types and selects proper instruction.
See unit-test as example.

llvm-svn: 190859

dc2c4b44

clang-format: Add comment to tests explaining their grouping. · 545c652d
Daniel Jasper authored Sep 17, 2013
```
llvm-svn: 190858
```
545c652d
Fix typo. · 8092c957
Joerg Sonnenberger authored Sep 17, 2013
```
llvm-svn: 190857
```
8092c957
[ASan] Enable fake stack test on Mac and Android, as no-instrumentation tests are now fixed · a7f35c06
Alexey Samsonov authored Sep 17, 2013
```
llvm-svn: 190856
```
a7f35c06

clang-format: Fix line breaking bug after empty ifs. · 88f9222c

Daniel Jasper authored Sep 17, 2013

Before:
  if () {
  }
    else {
  }

After:
  if () {
  } else {
  }

This fixed llvm.org/PR17262.

llvm-svn: 190855

88f9222c

clang-format: Don't split a >>-operator. · 0de8efa6

Daniel Jasper authored Sep 17, 2013

Before (with column limit 60):
  aaaaaaaaaaaaaaaaaaaaaaaaaaaa(aaaaaaaaaaaaaaaaaaaaaaaaaaaaa >
      > aaaaa);

After:
  aaaaaaaaaaaaaaaaaaaaaaaaaaaa(
      aaaaaaaaaaaaaaaaaaaaaaaaaaaaa >> aaaaa);

(Not sure how that could have stayed in that long without being
detected..)

llvm-svn: 190854

0de8efa6

[ASan] Link tests with -pie if ASan runtime uses zero-base shadow · 676c109c
Alexey Samsonov authored Sep 17, 2013
```
llvm-svn: 190853
```
676c109c
[asan] further speedup use-after-return: simplify deallocation of fake frames. ~ 20% speedup. · 2f5c2be6
Kostya Serebryany authored Sep 17, 2013
```
llvm-svn: 190852
```
2f5c2be6
AVX-512: Converted to Unix style · ac3e8eb9
Elena Demikhovsky authored Sep 17, 2013
```
llvm-svn: 190851
```
ac3e8eb9
Add AES and SHA instructions to the load folding tables. · 514f02cc
Craig Topper authored Sep 17, 2013
```
llvm-svn: 190850
```
514f02cc
Fix column alignment. No functional change. · 684abc82
Craig Topper authored Sep 17, 2013
```
llvm-svn: 190849
```
684abc82

Push contents of X86TargetInfo::setFeatureEnabled down to a static function... · 86d79ef4

Craig Topper authored Sep 17, 2013

Push contents of X86TargetInfo::setFeatureEnabled down to a static function called by the virtual version and all the places in getDefaultFeatures. This way getDefaultFeatures doesn't make so many virtual calls.

llvm-svn: 190847

86d79ef4

Mark setSSELevel/setMMXLevel/setXOPLevel as static since they don't access anything in the class. · 13f61a6c
Craig Topper authored Sep 17, 2013
```
llvm-svn: 190846
```
13f61a6c

Don't build extra init lists. · b2a8d464

Eli Friedman authored Sep 17, 2013

AssignConvertType::IncompatibleVectors means the two types are in fact
compatible. :)

No testcase; I don't think the extra init list has any actual visible effect
other than making the resulting AST dump look a bit strange.

llvm-svn: 190845

b2a8d464

Fix const-eval of vector init-lists of a vector. · 1409e6e7

Eli Friedman authored Sep 17, 2013

Like any other type, an init list for a vector can have the same type as
the vector itself; handle that case.

<rdar://problem/14990460>

llvm-svn: 190844

1409e6e7

Make a more clear AVX-512 section header that matches similar in the file. · 79d1bff2
Craig Topper authored Sep 17, 2013
```
llvm-svn: 190843
```
79d1bff2
clang-format recent change · b276158e
Tobias Grosser authored Sep 17, 2013
```
llvm-svn: 190842
```
b276158e

Move SCEVAffinator member definitions out of class body · 0695ee43

Tobias Grosser authored Sep 17, 2013

Instead of defining the relevant functions inline, we now just keep the
declarations in the class itself. This makes the class declaration a lot
easier to read as all functions can be seen at once. We also use this
opportunity to privatize all functions not used in the public interface of the
class.

llvm-svn: 190841

0695ee43

[lld][ELF] Assign sectionChoice properly to ELF atoms · 063b7501

Shankar Easwaran authored Sep 17, 2013

This sets the sectionChoice property for DefinedAtoms. The output section name
is derived by the property of the atom. This also decreases native file size.

Adds a test.

llvm-svn: 190840

063b7501

Implement 3 AArch64 neon instructions : umov smov ins. · 36399e6b
Kevin Qin authored Sep 17, 2013
```
llvm-svn: 190839
```
36399e6b
Make the docs for the -s -o -S and -O options clearer. · 47ea51f5
Jim Ingham authored Sep 17, 2013
```
llvm-svn: 190838
```
47ea51f5
G M: Restore the ability for libcxx to compile again on mingw 64. · 5f878d4b
Howard Hinnant authored Sep 17, 2013
```
llvm-svn: 190837
```
5f878d4b