Commits · dc073addc5e89601f35be20566a146cb887f2350 · Roger Ferrer / llvm-epi-0.8

Sep 19, 2013

whitespace · dc073add
Andrew Trick authored Sep 18, 2013
```
llvm-svn: 190973
```
dc073add

Fix two issues regarding Got pointer (GP) setup. · d6aadc79

Reed Kotler authored Sep 18, 2013

1) make sure that the first two instructions of the sequence cannot
separate from each other. The linker requires that they be sequential.
If they get separated, it can still work but it will not work in all
cases because the first of the instructions mostly involves the hi part
of the pc relative offset and that part changes slowly. You would have
to be at the right boundary for this to matter.
2) make sure that this sequence begins on a longword boundary.
There appears to be a bug in binutils which makes some of these calculations
get messed up if the instruction sequence does not begin on a longword
boundary. This is being investigated with the appropriate binutils folks.

llvm-svn: 190966

d6aadc79

Debug info: Get rid of the VLA indirection hack in FastISel. · 262bcf45

Adrian Prantl authored Sep 18, 2013

Use the DIVariable::isIndirect() flag set by the frontend instead of
guessing whether to set the machine location's indirection bit.
Paired commit with CFE.

llvm-svn: 190961

262bcf45

Sep 18, 2013

Make DynamicLibrary use ManagedStatic. This is pretty simple and should just work as · 57093e88

Filip Pizlo authored Sep 18, 2013

advertised - but it does have the caveat that calls to DynamicLibrary::AddSymbol will
"reset" if you shutdown llvm and try to come back for seconds. This is a subtle
behavior change, but I'm assuming that nobody is affected by it.

llvm-svn: 190946

57093e88

More XCore TTI cleanup -- remove an unused private field flagged by · dbf6589b
Chandler Carruth authored Sep 18, 2013
```
-Wunused-private-field with Clang.

llvm-svn: 190941
```
dbf6589b
[asan] call __asan_stack_malloc_N only if use-after-return detection is... · f322382e
Kostya Serebryany authored Sep 18, 2013
```
[asan] call __asan_stack_malloc_N only if use-after-return detection is enabled with the run-time option

llvm-svn: 190939
```
f322382e
Target/XCore/CMakeLists.txt: Add XCoreTargetTransformInfo.cpp. · 0b642ec1
NAKAMURA Takumi authored Sep 18, 2013
```
llvm-svn: 190937
```
0b642ec1

Prevent LoopVectorizer and SLPVectorizer running if the target has no vector registers. · f637e2cb

Robert Lytton authored Sep 18, 2013

XCore target: Add XCoreTargetTransformInfo
This is where getNumberOfRegisters() resides, which in turn returns the
number of vector registers (=0).

llvm-svn: 190936

f637e2cb

[SystemZ] Add unsigned compare-and-branch instructions · 93183ee7

Richard Sandiford authored Sep 18, 2013

For some reason I never got around to adding these at the same time as
the signed versions.  No idea why.

I'm not sure whether this SystemZII::BranchC* stuff is useful, or whether
it should just be replaced with an "is normal" flag.  I'll leave that
for later though.

There are some boundary conditions that can be tweaked, such as preferring
unsigned comparisons for equality with [128, 256), and "<= 255" over "< 256",
but again I'll leave those for a separate patch.

llvm-svn: 190930

93183ee7

[ARMv8] Add CRC instructions. · 2f8890ed
Joey Gouly authored Sep 18, 2013
```
Patch by Bradley Smith!

llvm-svn: 190928
```
2f8890ed

Revert r190921. It broke Windows. · 591f1541

Filip Pizlo authored Sep 18, 2013

I'll roll it back in when I have a chance to look at it in detail.

llvm-svn: 190923

591f1541

Make DynamicLibrary use ManagedStatic. This is pretty simple and should just work as · 4389ee38

Filip Pizlo authored Sep 18, 2013

llvm-svn: 190921

4389ee38

Prevent extra calls to ToggleFeature for Feature64Bit and FeatureCMOV if... · 358c7989

Craig Topper authored Sep 18, 2013

Prevent extra calls to ToggleFeature for Feature64Bit and FeatureCMOV if they've already been enabled. The extra call ends up clearing the bit in FeatureBits since its a 'toggle'. Can't prove that anything was broken because of this since I don't think the FeatureBits for these are used.

llvm-svn: 190920

358c7989

Fix X86 subtarget to not overwrite the autodetected features by calling... · a8442344

Craig Topper authored Sep 18, 2013

Fix X86 subtarget to not overwrite the autodetected features by calling InitMCProcessorInfo right after detecting them. Instead add a new function that only updates the scheduling model and call that.

llvm-svn: 190919

a8442344

Revert accidental commit I had to make to get the test case in PR17268 to still work correctly. · be3e01e6
Craig Topper authored Sep 18, 2013
```
llvm-svn: 190917
```
be3e01e6
Lift alignment restrictions for load/store folding on VINSERTF128/VEXTRACTF128. Fixes PR17268. · 98064b9f
Craig Topper authored Sep 18, 2013
```
llvm-svn: 190916
```
98064b9f
ifndef NDEBUG-out an asserts-only constant committed in r190863 · eacc287b
David Blaikie authored Sep 18, 2013
```
llvm-svn: 190905
```
eacc287b

Fix a constant folding address space place I missed. · d12e8020

Matt Arsenault authored Sep 17, 2013

If address space 0 was smaller than the address space
in a constant inttoptr/ptrtoint pair, the wrong mask size
would be used.

llvm-svn: 190899

d12e8020

COFF: Ensure that objects produced by LLVM link with /safeseh · c1e7621e

Reid Kleckner authored Sep 17, 2013

Summary:
We indicate that the object files are safe by emitting a @feat.00
absolute address symbol.  The address is presumably interpreted as a
bitfield of features that the compiler would like to enable.  Bit 0 is
documented in the PE COFF spec to opt in to "registered SEH", which is
what /safeseh enables.

LLVM's object files are safe by default because LLVM doesn't know how to
produce SEH handlers.

Reviewers: Bigcheese

CC: llvm-commits

Differential Revision: http://llvm-reviews.chandlerc.com/D1691

llvm-svn: 190898

c1e7621e

Revert the load slicing done in r190870. · 870b6627

Quentin Colombet authored Sep 17, 2013

To avoid regressions with bitfield optimizations, this slicing should take place
later, like ISel time.

llvm-svn: 190891

870b6627

Sep 17, 2013

COFF: Emit all MCSymbols rather than filtering out some of them · 3ea536fe

Reid Kleckner authored Sep 17, 2013

In particular, this means we emit non-external symbols defined to
variables, such as aliases or absolute addresses.

This is needed to implement /safeseh, and it appears there was some
confusion about what symbols to emit previously.

llvm-svn: 190888

3ea536fe

COFF: Remove ExportSection, which has been dead since r114823 · 50689eb9
Reid Kleckner authored Sep 17, 2013
```
llvm-svn: 190887
```
50689eb9
Move variable into assert to avoid unused variable warning. · e7af7bd8
Eric Christopher authored Sep 17, 2013
```
llvm-svn: 190886
```
e7af7bd8

Cleanup handling of constant function casts. · e6952f28

Matt Arsenault authored Sep 17, 2013

Some of this code is no longer necessary since int<->ptr casts are no
longer occur as of r187444.

This also fixes handling vectors of pointers, and adds a bunch of new
testcases for vectors and address spaces.

llvm-svn: 190885

e6952f28

[PowerPC] Add a FIXME. · bdae03f2

Bill Schmidt authored Sep 17, 2013

Documenting a design choice to generate only medium model sequences for TLS
addresses at this time.  Small and large code models could be supported if
necessary.

llvm-svn: 190883

bdae03f2

[PowerPC] Fix problems with large code model (PR17169). · bb381d70

Bill Schmidt authored Sep 17, 2013

Large code model on PPC64 requires creating and referencing TOC entries when
using the addis/ld form of addressing. This was not being done in all cases.
The changes in this patch to PPCAsmPrinter::EmitInstruction() fix this. Two
test cases are also modified to reflect this requirement.

Fast-isel was not creating correct code for loading floating-point constants
using large code model. This also requires the addis/ld form of addressing.
Previously we were using the addis/lfd shortcut which is only applicable to
medium code model. One test case is modified to reflect this requirement.

llvm-svn: 190882

bb381d70

Costmodel: Add support for horizontal vector reductions · cae8735a

Arnold Schwaighofer authored Sep 17, 2013

Upcoming SLP vectorization improvements will want to be able to estimate costs
of horizontal reductions. Add infrastructure to support this.

We model reductions as a series of (shufflevector,add) tuples ultimately
followed by an extractelement. For example, for an add-reduction of <4 x float>
we could generate the following sequence:

 (v0, v1, v2, v3)
   \   \  /  /
     \  \  /
       +  +

 (v0+v2, v1+v3, undef, undef)
    \      /
 ((v0+v2) + (v1+v3), undef, undef)

 %rdx.shuf = shufflevector <4 x float> %rdx, <4 x float> undef,
                           <4 x i32> <i32 2, i32 3, i32 undef, i32 undef>
 %bin.rdx = fadd <4 x float> %rdx, %rdx.shuf
 %rdx.shuf7 = shufflevector <4 x float> %bin.rdx, <4 x float> undef,
                          <4 x i32> <i32 1, i32 undef, i32 undef, i32 undef>
 %bin.rdx8 = fadd <4 x float> %bin.rdx, %rdx.shuf7
 %r = extractelement <4 x float> %bin.rdx8, i32 0

This commit adds a cost model interface "getReductionCost(Opcode, Ty, Pairwise)"
that will allow clients to ask for the cost of such a reduction (as backends
might generate more efficient code than the cost of the individual instructions
summed up). This interface is excercised by the CostModel analysis pass which
looks for reduction patterns like the one above - starting at extractelements -
and if it sees a matching sequence will call the cost model interface.

We will also support a second form of pairwise reduction that is well supported
on common architectures (haddps, vpadd, faddp).

 (v0, v1, v2, v3)
  \   /    \  /
 (v0+v1, v2+v3, undef, undef)
    \     /
 ((v0+v1)+(v2+v3), undef, undef, undef)

  %rdx.shuf.0.0 = shufflevector <4 x float> %rdx, <4 x float> undef,
        <4 x i32> <i32 0, i32 2 , i32 undef, i32 undef>
  %rdx.shuf.0.1 = shufflevector <4 x float> %rdx, <4 x float> undef,
        <4 x i32> <i32 1, i32 3, i32 undef, i32 undef>
  %bin.rdx.0 = fadd <4 x float> %rdx.shuf.0.0, %rdx.shuf.0.1
  %rdx.shuf.1.0 = shufflevector <4 x float> %bin.rdx.0, <4 x float> undef,
        <4 x i32> <i32 0, i32 undef, i32 undef, i32 undef>
  %rdx.shuf.1.1 = shufflevector <4 x float> %bin.rdx.0, <4 x float> undef,
        <4 x i32> <i32 1, i32 undef, i32 undef, i32 undef>
  %bin.rdx.1 = fadd <4 x float> %rdx.shuf.1.0, %rdx.shuf.1.1
  %r = extractelement <4 x float> %bin.rdx.1, i32 0

llvm-svn: 190876

cae8735a

SLPVectorizer: Don't vectorize phi nodes that use invoke values · 4a3dcaa1

Arnold Schwaighofer authored Sep 17, 2013

We can't insert an insertelement after an invoke. We would have to split a
critical edge. So when we see a phi node that uses an invoke we just give up.

radar://14990770

llvm-svn: 190871

4a3dcaa1

[InstCombiner] Slice a big load in two loads when the elements are next to each · b8d672ef

Quentin Colombet authored Sep 17, 2013

other in memory.

The motivation was to get rid of truncate and shift right instructions that get
in the way of paired load or floating point load.
E.g.,
Consider the following example:
struct Complex {
  float real;
  float imm;
};

When accessing a complex, llvm was generating a 64-bits load and the imm field
was obtained by a trunc(lshr) sequence, resulting in poor code generation, at
least for x86.

The idea is to declare that two load instructions is the canonical form for
loading two arithmetic type, which are next to each other in memory.

Two scalar loads at a constant offset from each other are pretty
easy to detect for the sorts of passes that like to mess with loads. 

<rdar://problem/14477220>

llvm-svn: 190870

b8d672ef

Remove unused code, which had been commented out. · ba6f9d1b
Preston Gurd authored Sep 17, 2013
```
llvm-svn: 190869
```
ba6f9d1b
Added documentation to getMemsetStores. · 8ec39992
Serge Pavlov authored Sep 17, 2013
```
llvm-svn: 190866
```
8ec39992

Add llvm.x86.* intrinsics for Intel SHA Extensions · de39520f

Ben Langmuir authored Sep 17, 2013

Add llvm.x86.* intrinsics for all of the Intel SHA Extensions instructions, as
well as tests. Also remove mayLoad and hasSideEffects, which can be inferred
from the instruction patterns.

llvm-svn: 190864

de39520f

[asan] inline the calls to __asan_stack_free_* with small sizes. Yet another... · bc86efb8
Kostya Serebryany authored Sep 17, 2013
```
[asan] inline the calls to __asan_stack_free_* with small sizes. Yet another 10%-20% speedup for use-after-return

llvm-svn: 190863
```
bc86efb8
[ARM] Fix the deprecation of MCR encodings that map to CP15{ISB,DSB,DMB}. · 830c27ab
Joey Gouly authored Sep 17, 2013
```
llvm-svn: 190862
```
830c27ab

Bugfix for PR17099: · dc2c4b44

Stepan Dyatkovskiy authored Sep 17, 2013

Wrong cast operation.
MergeFunctions emits Bitcast instead of pointer-to-integer operation.
Patch fixes MergeFunctions::writeThunk function. It replaces
unconditional Bitcast creation with "Value* createCast(...)" method, that
checks operand types and selects proper instruction.
See unit-test as example.

llvm-svn: 190859

dc2c4b44

AVX-512: Converted to Unix style · ac3e8eb9
Elena Demikhovsky authored Sep 17, 2013
```
llvm-svn: 190851
```
ac3e8eb9
Add AES and SHA instructions to the load folding tables. · 514f02cc
Craig Topper authored Sep 17, 2013
```
llvm-svn: 190850
```
514f02cc
Fix column alignment. No functional change. · 684abc82
Craig Topper authored Sep 17, 2013
```
llvm-svn: 190849
```
684abc82
Implement 3 AArch64 neon instructions : umov smov ins. · 36399e6b
Kevin Qin authored Sep 17, 2013
```
llvm-svn: 190839
```
36399e6b

[SelectionDAG] Teach the vector scalarizer about TRUNCATE. · d30a9585

Quentin Colombet authored Sep 17, 2013

When a truncate node defines a legal vector type but uses an illegal
vector type, the legalization process was splitting the vector until
<1 x vector> type, but then it was failing to scalarize the node because
it did not know how to handle TRUNCATE.

<rdar://problem/14989896>

llvm-svn: 190830

d30a9585