Commits · 8e06e2a8c4778420f7ea30593f3b8677835f9286 · Roger Ferrer / llvm-epi-0.8

Apr 10, 2013

R600/SI: adjust writemask to only the used components · 8e06e2a8

Christian Konig authored Apr 10, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 179165

8e06e2a8

R600/SI: remove image sample writemask · 4ace6632

Christian Konig authored Apr 10, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 179164

4ace6632

Cleanup PPCInstrInfo::DefinesPredicate · af822018

Hal Finkel authored Apr 10, 2013

Implement suggestions made by Bill Schmidt in post-commit review. Thanks!

llvm-svn: 179162

af822018

RegionInfo: Add helpers to replace entry/exit recursively · 141cc3e8
Tobias Grosser authored Apr 10, 2013
```
Contributed by: Star Tan <tanmx_star@yeah.net>

llvm-svn: 179157
```
141cc3e8

PPC: Prep for if conversion of bctr[l] · 500b0045

Hal Finkel authored Apr 10, 2013

This adds in-principle support for if-converting the bctr[l] instructions.
These instructions are used for indirect branching. It seems, however, that the
current if converter will never actually predicate these. To do so, it would
need the ability to hoist a few setup insts. out of the conditionally-executed
block. For example, code like this:
  void foo(int a, int (*bar)()) { if (a != 0) bar(); }
becomes:
        ...
        beq 0, .LBB0_2
        std 2, 40(1)
        mr 12, 4
        ld 3, 0(4)
        ld 11, 16(4)
        ld 2, 8(4)
        mtctr 3
        bctrl
        ld 2, 40(1)
.LBB0_2:
        ...
and it would be safe to do all of this unconditionally with a predicated
beqctrl instruction.

llvm-svn: 179156

500b0045

Template the MachO types over endianness. · eaae687d
Rafael Espindola authored Apr 10, 2013
```
For now they are still only used as little endian.

llvm-svn: 179147
```
eaae687d
Include the more specific header. · e9c2407c
Rafael Espindola authored Apr 10, 2013
```
llvm-svn: 179146
```
e9c2407c
__sincosf_stret returns sinf / cosf in bits 0:31 and 32:63 of xmm0, not in · ac0469c5
Evan Cheng authored Apr 10, 2013
```
xmm0 / xmm1.

rdar://13599493

llvm-svn: 179141
```
ac0469c5

Generalize the PassConfig API and remove addFinalizeRegAlloc(). · e220323c

Andrew Trick authored Apr 10, 2013

The target hooks are getting out of hand. What does it mean to run
before or after regalloc anyway? Allowing either Pass* or AnalysisID
pass identification should make it much easier for targets to use the
substitutePass and insertPass APIs, and create less need for badly
named target hooks.

llvm-svn: 179140

e220323c

Mips specific inline asm operand modifier 'D' · b04e357d

Jack Carter authored Apr 09, 2013

Modifier 'D' is to use the second word of a double integer.

We had previously implemented the pure register varient of 
the modifier and this patch implements the memory reference.



#include "stdio.h"

int b[8] = {0,1,2,3,4,5,6,7};
void main()
{
    int i;
    
    // The first word. Notice, no 'D'
    {asm (
    "lw    %0,%1;"
    : "=r" (i)
    : "m" (*(b+4))
    );}
    
    printf("%d\n",i);

    // The second word
    {asm (
    "lw    %0,%D1;"
    : "=r" (i)
    : "m" (*(b+4))
    );}
    
    printf("%d\n",i);
}

llvm-svn: 179135

b04e357d

Allow PPC B and BLR to be if-converted into some predicated forms · 5711eca1

Hal Finkel authored Apr 09, 2013

This enables us to form predicated branches (which are the same conditional
branches we had before) and also a larger set of predicated returns (including
instructions like bdnzlr which is a conditional return and loop-counter
decrement all in one).

At the moment, if conversion does not capture all possible opportunities. A
simple example is provided in early-ret2.ll, where if conversion forms one
predicated return, and then the PPCEarlyReturn pass picks up the other one. So,
at least for now, we'll keep both mechanisms.

llvm-svn: 179134

5711eca1

Fix some comment typos. · 798a7709
Bob Wilson authored Apr 09, 2013
```
llvm-svn: 179132
```
798a7709

Apr 09, 2013

Cleanup. No functional change intended. · 18785857
Chad Rosier authored Apr 09, 2013
```
llvm-svn: 179129
```
18785857
Cleanup. No functional change intended. · 10d1d1cc
Chad Rosier authored Apr 09, 2013
```
llvm-svn: 179125
```
10d1d1cc
Remove unused method and default values. · 1b276c5c
Rafael Espindola authored Apr 09, 2013
```
llvm-svn: 179124
```
1b276c5c
Update the version of dwarf we say we're emitting to at least 3. · 06c89d65
Eric Christopher authored Apr 09, 2013
```
Deals with a dwarf2 -> dwarf3 DW_FORM_ref_addr change.

llvm-svn: 179122
```
06c89d65
Revert r179115 as it looks to have killed the ASan tests. · e8d8288d
Chad Rosier authored Apr 09, 2013
```
llvm-svn: 179120
```
e8d8288d
Rationalize the formatting of these case labels. Having two sorted · 9f6b59ae
Chandler Carruth authored Apr 09, 2013
```
columns is essentially impossible to edit.

llvm-svn: 179119
```
9f6b59ae

This patch enables llvm to switch between compiling for mips32/mips64 · 1595f36d

Reed Kotler authored Apr 09, 2013

and mips16 on a per function basis.

Because this patch is somewhat involved I have provide an overview of the
key pieces of it.

The patch is written so as to not change the behavior of the non mixed
mode. We have tested this a lot but it is something new to switch subtargets
so we don't want any chance of regression in the mainline compiler until
we have more confidence in this.

Mips32/64 are very different from Mip16 as is the case of ARM vs Thumb1.
For that reason there are derived versions of the register info, frame info, 
instruction info and instruction selection classes.

Now we register three separate passes for instruction selection.
One which is used to switch subtargets (MipsModuleISelDAGToDAG.cpp) and then
one for each of the current subtargets (Mips16ISelDAGToDAG.cpp and
MipsSEISelDAGToDAG.cpp).

When the ModuleISel pass runs, it determines if there is a need to switch
subtargets and if so, the owning pointers in MipsTargetMachine are
appropriately changed.

When 16Isel or SEIsel is run, they will return immediately without doing
any work if the current subtarget mode does not apply to them.

In addition, MipsAsmPrinter needs to be reset on a function basis.

The pass BasicTargetTransformInfo is substituted with a null pass since the
pass is immutable and really needs to be a function pass for it to be
used with changing subtargets. This will be fixed in a follow on patch.

llvm-svn: 179118

1595f36d

Add support for bottom-up SLP vectorization infrastructure. · 2d9dec32

Nadav Rotem authored Apr 09, 2013

This commit adds the infrastructure for performing bottom-up SLP vectorization (and other optimizations) on parallel computations.
The infrastructure has three potential users:

  1. The loop vectorizer needs to be able to vectorize AOS data structures such as (sum += A[i] + A[i+1]).

  2. The BB-vectorizer needs this infrastructure for bottom-up SLP vectorization, because bottom-up vectorization is faster to compute.

  3. A loop-roller needs to be able to analyze consecutive chains and roll them into a loop, in order to reduce code size. A loop roller does not need to create vector instructions, and this infrastructure separates the chain analysis from the vectorization.

This patch also includes a simple (100 LOC) bottom up SLP vectorizer that uses the infrastructure, and can vectorize this code:

void SAXPY(int *x, int *y, int a, int i) {
  x[i]   = a * x[i]   + y[i];
  x[i+1] = a * x[i+1] + y[i+1];
  x[i+2] = a * x[i+2] + y[i+2];
  x[i+3] = a * x[i+3] + y[i+3];
}

llvm-svn: 179117

2d9dec32

Make check depend on all. · caeddf5a
Eric Christopher authored Apr 09, 2013
```
llvm-svn: 179116
```
caeddf5a

[ms-inline asm] Use parsePrimaryExpr in lieu of parseExpression if we need to · a08f30f0

Chad Rosier authored Apr 09, 2013

parse an identifier.  Otherwise, parseExpression may parse multiple tokens,
which makes it impossible to properly compute an immediate displacement.
An example of such a case is the source operand (i.e., [Symbol + ImmDisp]) in
the below example:

 __asm mov eax, [Symbol + ImmDisp]

The existing test cases exercise this patch.
rdar://13611297

llvm-svn: 179115

a08f30f0

The .dwo section shouldn't contain the unrelocated values (and · 52ce7189

Eric Christopher authored Apr 09, 2013

therefore not at all) of the pc or statement list. We also don't
need to emit the compilation dir so save so space and time
and don't bother.

Fix up the testcase accordingly and verify that we don't emit
the attributes or the items that they use.

llvm-svn: 179114

52ce7189

Cleanup PPCEarlyReturn · 21aad9a8

Hal Finkel authored Apr 09, 2013

Some general cleanup and only scan the end of a BB for branches (once we're
done with the terminators and debug values, then there should not be any other
branches). These address post-commit review suggestions by Bill Schmidt.

No functionality change intended.

llvm-svn: 179112

21aad9a8

Revert r176408 and r176407 to address PR15540. · abcc64fd
Nadav Rotem authored Apr 09, 2013
```
llvm-svn: 179111
```
abcc64fd

[ms-inline asm] Maintain a StringRef to reference a symbol in a parsed operand, · e81309b3

Chad Rosier authored Apr 09, 2013

rather than deriving the StringRef from the Start and End SMLocs.

Using the Start and End SMLocs works fine for operands such as [Symbol], but
not for operands such as [Symbol + ImmDisp].  All existing test cases that
reference a variable exercise this patch.
rdar://13602265

llvm-svn: 179109

e81309b3

DAGCombiner: Fold a shuffle on CONCAT_VECTORS into a new CONCAT_VECTORS if possible. · bbae991d

Benjamin Kramer authored Apr 09, 2013

This pattern occurs in SROA output due to the way vector arguments are lowered
on ARM.

The testcase from PR15525 now compiles into this, which is better than the code
we got with the old scalarrepl:
_Store:
	ldr.w	r9, [sp]
	vmov	d17, r3, r9
	vmov	d16, r1, r2
	vst1.8	{d16, d17}, [r0]
	bx	lr

Differential Revision: http://llvm-reviews.chandlerc.com/D647

llvm-svn: 179106

bbae991d

Use virtual base registers on PPC · b5899d57

Hal Finkel authored Apr 09, 2013

On PowerPC, non-vector loads and stores have r+i forms; however, in functions
with large stack frames these were not being used to access slots far from the
stack pointer because such slots were out of range for the signed 16-bit
immediate offset field. This increases register pressure because we need a
separate register for each offset (when the r+r form is used). By enabling
virtual base registers, we can deal with large stack frames without unduly
increasing register pressure.

llvm-svn: 179105

b5899d57

Convert test PowerPC/2007-09-07-LoadStoreIdxForms to FileCheck · 059825b0
Hal Finkel authored Apr 09, 2013
```
llvm-svn: 179104
```
059825b0

Rewrite test/Linker tests to use FileCheck instead of grep. · 1cc814a8

Eli Bendersky authored Apr 09, 2013

Some translations here are not 1x1 because there are grep|grep
chains that are non-trivial to implement in terms of FileCheck features. I
made an effort for the tests to remain as similar as possible; do let me know
if you notice anything fishy. The good news are that some buggy tests were
fixed (grep | not grep - a bug waiting to happen).

llvm-svn: 179102

1cc814a8

Convert MachOObjectFile to a template. · c2413f59

Rafael Espindola authored Apr 09, 2013

For now it is templated only on being 64 or 32 bits. I will add little/big
endian next.

llvm-svn: 179097

c2413f59

DWARF parser: Fix DWARF-2/3 incompatibility: size of DW_FORM_ref_addr is the... · d60859b2

Alexey Samsonov authored Apr 09, 2013

DWARF parser: Fix DWARF-2/3 incompatibility: size of DW_FORM_ref_addr is the same as DW_FORM_addr in DWARF2, and is 4/8 bytes on 32/64-bit DWARF starting from DWARF3. Adding a test for this is a huge pain - generating and uploading pre-built binary with DWARF3 debug info is way too ugly, and writing fine-grained unittests for DebugInfo is impossible, as it doesn't expose any headers in include/llvm. That said, I'm going to choose the second approach and submit the patch exposing DebugInfo headers for review soon enough.

llvm-svn: 179095

d60859b2

Converted 8x tests of SimplifyCFG to use FileCheck instead of grep. · ccc93e72
Michael Gottesman authored Apr 09, 2013
```
llvm-svn: 179087
```
ccc93e72
Extract a function. · c910feb4
Jakob Stoklund Olesen authored Apr 09, 2013
```
llvm-svn: 179086
```
c910feb4
Remove the confusing sentence. · 757aec95
Nadav Rotem authored Apr 09, 2013
```
llvm-svn: 179085
```
757aec95
Revert 179071 because it is not the right way to support non standard new/new[] operators. · 7b7585d1
Nadav Rotem authored Apr 09, 2013
```
llvm-svn: 179084
```
7b7585d1

Compute correct frame sizes for SPARC v9 64-bit frames. · 2cfe46fd

Jakob Stoklund Olesen authored Apr 09, 2013

The save area is twice as big and there is no struct return slot. The
stack pointer is always 16-byte aligned (after adding the bias).

Also eliminate the stack adjustment instructions around calls when the
function has a reserved stack frame.

llvm-svn: 179083

2cfe46fd

More uses for SymbolTableEntryBase. · eb8b211e
Rafael Espindola authored Apr 09, 2013
```
llvm-svn: 179076
```
eb8b211e

Add a SymbolTableEntryBase. · 5d6cec9b

Rafael Espindola authored Apr 09, 2013

Use it when we don't need to know if we have a 32 or 64 bit SymbolTableEntry.

llvm-svn: 179074

5d6cec9b

Fix PointerIntPair to be enum class compatible. · 6cdbe3f6

Joe Groff authored Apr 09, 2013

Some parts of PointerIntPair assumed that the IntType of the pair was implicitly
convertible to intptr_t, which is not the case for enum class values. Add a
static_cast<intptr_t> to make these conversions explicit and allow
PointerIntPair to be used with an enum class IntType. While we're here, rename
some of the argument values so we don't have variables named "Int" floating
around.

llvm-svn: 179073

6cdbe3f6