Commits · c86fdf12e85215777cfdc1d5074e73b0118fa42f · Roger Ferrer / llvm-epi-0.8

Apr 11, 2013

Rename the C function to create a SLPVectorizerPass to something sane and... · c86fdf12
Benjamin Kramer authored Apr 11, 2013
```
Rename the C function to create a SLPVectorizerPass to something sane and expose it in the header file.

llvm-svn: 179272
```
c86fdf12

Optimize vector select from all 0s or all 1s · 55658d42

Michael Liao authored Apr 11, 2013

As packed comparisons in AVX/SSE produce all 0s or all 1s in each SIMD lane,
vector select could be simplified to AND/OR or removed if one or both values
being selected is all 0s or all 1s.

llvm-svn: 179267

55658d42

Add CLAC/STAC instruction encoding/decoding support · 95d94403

Michael Liao authored Apr 11, 2013

As these two instructions in AVX extension are privileged instructions for
special purpose, it's only expected to be used in inlined assembly.

llvm-svn: 179266

95d94403

Enhance bool simplifcation in X86 to handle more cases · f7bf8705

Michael Liao authored Apr 11, 2013

This patch is revised based on patch from Victor Umansky
<victor.umansky@intel.com>. More cases are handled in X86's bool
simplification, i.e.
- SETCC_CARRY
- value is truncated to i1 with AND

As a by-product, PR5443 is also fixed.

llvm-svn: 179265

f7bf8705

R600ControlFlowFinalizer.cpp: Fix a warning. [-Wunused-variable] · 3ee2b1e2
NAKAMURA Takumi authored Apr 11, 2013
```
llvm-svn: 179263
```
3ee2b1e2
Whitespace. · 3b0853be
NAKAMURA Takumi authored Apr 11, 2013
```
llvm-svn: 179262
```
3b0853be

Fix MachO's getRelocationAdditionalInfo. · e410099a

Rafael Espindola authored Apr 11, 2013

It was returning the loaded address of the section containing the relocation,
which really doesn't seem to be the intent of this function.

llvm-svn: 179255

e410099a

Make PPCInstrInfo::isPredicated always return false · f29285a4

Hal Finkel authored Apr 11, 2013

Because of how predication in implemented on PPC (only for branches), I think
that this is the right thing to do.  No functionality change intended.

llvm-svn: 179252

f29285a4

MC: Support COFF image-relative MCSymbolRefs · 1da4529b

Nico Rieck authored Apr 10, 2013

Add support for the COFF relocation types IMAGE_REL_I386_DIR32NB and
IMAGE_REL_AMD64_ADDR32NB for 32- and 64-bit respectively. These are
similar to normal 4-byte relocations except that they do not include
the base address of the image.

Image-relative relocations are used for debug information (32-bit) and
SEH unwind tables (64-bit).

A new MCSymbolRef variant called 'VK_COFF_IMGREL32' is introduced to
specify such relocations. For AT&T assembly, this variant can be accessed
using the symbol suffix '@imgrel'.

llvm-svn: 179240

1da4529b

Manually remove successors in if conversion when CopyAndPredicateBlock is used · 95081bff

Hal Finkel authored Apr 10, 2013

In the simple and triangle if-conversion cases, when CopyAndPredicateBlock is
used because the to-be-predicated block has other predecessors, we need to
explicitly remove the old copied block from the successors list. Normally if
conversion relies on TII->AnalyzeBranch combined with BB->CorrectExtraCFGEdges
to cleanup the successors list, but if the predicated block contained an
un-analyzable branch (such as a now-predicated return), then this will fail.

These extra successors were causing a problem on PPC because it was causing
later passes (such as PPCEarlyReturm) to leave dead return-only basic blocks in
the code.

llvm-svn: 179227

95081bff

No need to have this return a bool. · 5be12a14
Bill Wendling authored Apr 10, 2013
```
llvm-svn: 179226
```
5be12a14

Apr 10, 2013

fixed xsave, xsaveopt, xrstor mnemonics with intel syntax; added test cases · 394bf148
Kay Tiong Khoo authored Apr 10, 2013
```
llvm-svn: 179223
```
394bf148

Track the compact unwind encoding for when we are unable to generate compact unwind information. · 2d1df6be

Bill Wendling authored Apr 10, 2013

Compact unwind has an encoding for when we're not able to generate compact
unwind and must generate an EH frame instead. Track that, but still emit that CU
encoding.

llvm-svn: 179220

2d1df6be

fixed to disassemble with tab after mnemonic rather than space · 6f76c210
Kay Tiong Khoo authored Apr 10, 2013
```
llvm-svn: 179215
```
6f76c210

· ddf96b50

Preston Gurd authored Apr 10, 2013

In the X86 back end, getMemoryOperandNo() returns the offset
into the operand array of the start of the memory reference descriptor.

Additional code in EncodeInstruction provides an additional adjustment.

This patch places that additional code in a separate function,
called getOperandBias, so that any caller of getMemoryOperandNo
can also call getOperandBias.

llvm-svn: 179211

ddf96b50

Tidy up, fix and simplify a few of the SMLocs. Prior to r179109 the Start SMLoc · 70f47596

Chad Rosier authored Apr 10, 2013

wasn't always the start of the operand.  If there was a symbol reference, then
Start pointed to that token.  It's very likely there are other places that need
to be updated.

llvm-svn: 179210

70f47596

Make the SLP store-merger less paranoid about function calls. We check for... · 73dffa41

Nadav Rotem authored Apr 10, 2013

Make the SLP store-merger less paranoid about function calls. We check for function calls when we check if it is safe to sink instructions.

llvm-svn: 179207

73dffa41

We require DataLayout for analyzing the size of stores. · 88dd5f7a
Nadav Rotem authored Apr 10, 2013
```
llvm-svn: 179206
```
88dd5f7a
Remove unused variable. · 53eb7d79
Chad Rosier authored Apr 10, 2013
```
llvm-svn: 179205
```
53eb7d79

PPC: Don't predicate a diamond with two counter decrements · 30ae2291

Hal Finkel authored Apr 10, 2013

I've not seen this happen in practice, and probably can't until we start
allowing decrement-counter-based conditional branches to be double predicated,
but just in case, don't allow predication of a diamond in which both sides have
ctr-defining branches. Even though the branching behavior of these can be
predicated, the counter-decrementing behavior cannot be.

llvm-svn: 179199

30ae2291

Reapply r179115, but use parsePrimaryExpression a little more judiciously. · 1863f4f4

Chad Rosier authored Apr 10, 2013

Test cases that regressed due to r179115, plus a few more, were added in
r179182.  Original commit message below:

[ms-inline asm] Use parsePrimaryExpr in lieu of parseExpression if we need to
parse an identifier.  Otherwise, parseExpression may parse multiple tokens,
which makes it impossible to properly compute an immediate displacement.
An example of such a case is the source operand (i.e., [Symbol + ImmDisp]) in
the below example:

 __asm mov eax, [Symbol + ImmDisp]

Part of rdar://13611297

llvm-svn: 179187

1863f4f4

R600/SI: Add pattern for AMDGPUurecip · 8caa904b

Michel Danzer authored Apr 10, 2013



21 more little piglits with radeonsi.

Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 179186

8caa904b

This is for an experimental option -mips-os16. The idea is to compile all · fe94cc3e

Reed Kotler authored Apr 10, 2013

Mips32 code as Mips16 unless it can't be compiled as Mips 16. For now this
would happen as long as floating point instructions are not needed.
Probably it would also make sense to compile as mips32 if atomic operations
are needed too. There may be other cases too.

A module pass prescans the IR and adds the mips16 or nomips16 attribute
to functions depending on the functions needs.

Mips 16 mode can result in a 40% code compression by utililizing 16 bit
encoding of many instructions.

The hope is for this to replace the traditional gcc way of dealing with
Mips16 code using floating point which involves essentially using soft float
but with a library implemented using mips32 floating point. This gcc 
method also requires creating stubs so that Mips32 code can interact with
these Mips 16 functions that have floating point needs. My conjecture is
that in reality this traditional gcc method would never win over this
new method.

I will be implementing the traditional gcc method also. Some of it is already
done but I needed to do the stubs to finish the work and those required
this mips16/32 mixed mode capability.

I have more ideas for to make this new method much better and I think the old
method will just live in llvm for anyone that needs the backward compatibility
but I don't for what reason that would be needed.

llvm-svn: 179185

fe94cc3e

Use a scheme closer to that of GNU as when deciding the type of a · adac407e

Peter Collingbourne authored Apr 10, 2013

symbol with multiple .type declarations.

Differential Revision: http://llvm-reviews.chandlerc.com/D607

llvm-svn: 179184

adac407e

Template MachOObjectFile over endianness too. · 641c9bcf
Rafael Espindola authored Apr 10, 2013
```
llvm-svn: 179179
```
641c9bcf
R600: Add VTX_READ_* and RAT_WRITE_CACHELESS_* when computing cf addr · 04d9aa48
Vincent Lejeune authored Apr 10, 2013
```
llvm-svn: 179174
```
04d9aa48

ARM: Make "SMC" instructions conditional on new TrustZone architecture feature. · c6047655

Tim Northover authored Apr 10, 2013

These instructions aren't universally available, but depend on a specific
extension to the normal ARM architecture (rather than, say, v6/v7/...) so a new
feature is appropriate.

This also enables the feature by default on A-class cores which usually have
these extensions, to avoid breaking existing code and act as a sensible
default.

llvm-svn: 179171

c6047655

Change CloneFunctionInto to always clone Argument attributes induvidually, · 81259294

Joey Gouly authored Apr 10, 2013

rather than checking if the source and destination have the same number of
arguments and copying the attributes over directly.

llvm-svn: 179169

81259294

R600/SI: dynamical figure out the reg class of MIMG · 8b1ed28e

Christian Konig authored Apr 10, 2013



Depending on the number of bits set in the writemask.

Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 179166

8b1ed28e

R600/SI: adjust writemask to only the used components · 8e06e2a8

Christian Konig authored Apr 10, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 179165

8e06e2a8

R600/SI: remove image sample writemask · 4ace6632

Christian Konig authored Apr 10, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 179164

4ace6632

Cleanup PPCInstrInfo::DefinesPredicate · af822018

Hal Finkel authored Apr 10, 2013

Implement suggestions made by Bill Schmidt in post-commit review. Thanks!

llvm-svn: 179162

af822018

RegionInfo: Add helpers to replace entry/exit recursively · 141cc3e8
Tobias Grosser authored Apr 10, 2013
```
Contributed by: Star Tan <tanmx_star@yeah.net>

llvm-svn: 179157
```
141cc3e8

PPC: Prep for if conversion of bctr[l] · 500b0045

Hal Finkel authored Apr 10, 2013

This adds in-principle support for if-converting the bctr[l] instructions.
These instructions are used for indirect branching. It seems, however, that the
current if converter will never actually predicate these. To do so, it would
need the ability to hoist a few setup insts. out of the conditionally-executed
block. For example, code like this:
  void foo(int a, int (*bar)()) { if (a != 0) bar(); }
becomes:
        ...
        beq 0, .LBB0_2
        std 2, 40(1)
        mr 12, 4
        ld 3, 0(4)
        ld 11, 16(4)
        ld 2, 8(4)
        mtctr 3
        bctrl
        ld 2, 40(1)
.LBB0_2:
        ...
and it would be safe to do all of this unconditionally with a predicated
beqctrl instruction.

llvm-svn: 179156

500b0045

Template the MachO types over endianness. · eaae687d
Rafael Espindola authored Apr 10, 2013
```
For now they are still only used as little endian.

llvm-svn: 179147
```
eaae687d
__sincosf_stret returns sinf / cosf in bits 0:31 and 32:63 of xmm0, not in · ac0469c5
Evan Cheng authored Apr 10, 2013
```
xmm0 / xmm1.

rdar://13599493

llvm-svn: 179141
```
ac0469c5

Generalize the PassConfig API and remove addFinalizeRegAlloc(). · e220323c

Andrew Trick authored Apr 10, 2013

The target hooks are getting out of hand. What does it mean to run
before or after regalloc anyway? Allowing either Pass* or AnalysisID
pass identification should make it much easier for targets to use the
substitutePass and insertPass APIs, and create less need for badly
named target hooks.

llvm-svn: 179140

e220323c

Mips specific inline asm operand modifier 'D' · b04e357d

Jack Carter authored Apr 09, 2013

Modifier 'D' is to use the second word of a double integer.

We had previously implemented the pure register varient of 
the modifier and this patch implements the memory reference.



#include "stdio.h"

int b[8] = {0,1,2,3,4,5,6,7};
void main()
{
    int i;
    
    // The first word. Notice, no 'D'
    {asm (
    "lw    %0,%1;"
    : "=r" (i)
    : "m" (*(b+4))
    );}
    
    printf("%d\n",i);

    // The second word
    {asm (
    "lw    %0,%D1;"
    : "=r" (i)
    : "m" (*(b+4))
    );}
    
    printf("%d\n",i);
}

llvm-svn: 179135

b04e357d

Allow PPC B and BLR to be if-converted into some predicated forms · 5711eca1

Hal Finkel authored Apr 09, 2013

This enables us to form predicated branches (which are the same conditional
branches we had before) and also a larger set of predicated returns (including
instructions like bdnzlr which is a conditional return and loop-counter
decrement all in one).

At the moment, if conversion does not capture all possible opportunities. A
simple example is provided in early-ret2.ll, where if conversion forms one
predicated return, and then the PPCEarlyReturn pass picks up the other one. So,
at least for now, we'll keep both mechanisms.

llvm-svn: 179134

5711eca1

Fix some comment typos. · 798a7709
Bob Wilson authored Apr 09, 2013
```
llvm-svn: 179132
```
798a7709