Commits · 20a7e6b76424283ec5a6a05cd47a4e8952d62363 · Roger Ferrer / llvm-epi-0.8

Mar 27, 2013

R600/SI: add srl/sha patterns for SI · 20a7e6b7

Christian Konig authored Mar 27, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
Tested-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 178125

20a7e6b7

Allocate r0 on PPC · 0f77861d

Hal Finkel authored Mar 27, 2013

The R0 register can now be allocated because instructions
that cannot use R0 as a GPR have been appropriately marked.

llvm-svn: 178123

0f77861d

Use the PPC no-r0 class on the TOC LD pseudos · 573fc28d

Hal Finkel authored Mar 27, 2013

The register parameter in these instructions becomes the base register in an
r+i ld instruction (and, thus, cannot be r0).

This is not yet testable because we don't yet allocate r0 (and even then any
test would be very fragile).

llvm-svn: 178121

573fc28d

Apply the no-r0 register class to the PPC SELECT_CC_I[4|8] pseudos · 3fa362a5

Hal Finkel authored Mar 27, 2013

Either operand of these pseudo instructions can be transformed into the first
operand of an isel instruction (and this operand cannot be r0).

This is not yet testable because we don't yet allocate r0 (and even when we do,
any test would be very fragile).

llvm-svn: 178119

3fa362a5

Apply the no-r0 class to PPC TOC ADDI[S] pseudo instructions · 42a312b2

Hal Finkel authored Mar 27, 2013

Like the addi/addis instructions themselves, these pseudo instructions also
cannot have r0 as their register parameter (because it will be interpreted as
the value 0).

This is not yet testable because we don't yet allocate r0 (and even when we do,
any regression test would be very fragile because it would depend on the
register allocator heuristics).

llvm-svn: 178118

42a312b2

Remove the link register from the GPR classes on PowerPC. · a1b72d0f

Bill Schmidt authored Mar 27, 2013

Some implementation detail in the forgotten past required the link
register to be placed in the GPRC and G8RC register classes.  This is
just wrong on the face of it, and causes several extra intersection
register classes to be generated.  I found this was having evil
effects on instruction scheduling, by causing the wrong register class
to be consulted for register pressure decisions.

No code generation changes are expected, other than some minor changes
in instruction order.  Seven tests in the test bucket required minor
tweaks to adjust to the new normal.

llvm-svn: 178114

a1b72d0f

Don't spill PPC VRSAVE on non-Darwin (even in SjLj) · a7b0630b

Hal Finkel authored Mar 27, 2013

As Bill Schmidt pointed out to me, only on Darwin do we need to spill/restore
VRSAVE in the SjLj code. For non-Darwin, don't spill/restore VRSAVE (and I've
added some asserts to make sure that we're not).

As it turns out, we're not currently handling the Darwin case correctly (I've
added a FIXME in the test case). I've tried adding various implied register
definitions/uses to force the spill without success, so I'll need to address
this later.

llvm-svn: 178096

a7b0630b

Mar 26, 2013

Add XTEST codegen support · 03f9ad0e
Michael Liao authored Mar 26, 2013
```
llvm-svn: 178083
```
03f9ad0e
Add HLE target feature · e344ec91
Michael Liao authored Mar 26, 2013
```
llvm-svn: 178082
```
e344ec91

Enable SandyBridgeModel for all modern Intel P6 descendants. · 1ac7e662

Jakob Stoklund Olesen authored Mar 26, 2013

All Intel CPUs since Yonah look a lot alike, at least at the granularity
of the scheduling models. We can add more accurate models for
processors that aren't Sandy Bridge if required. Haswell will probably
need its own.

The Atom processor and anything based on NetBurst is completely
different. So are the non-Intel chips.

llvm-svn: 178080

1ac7e662

Restore real bit lengths on PPC register numbers · 567fa62d

Hal Finkel authored Mar 26, 2013

As suggested by Bill Schmidt (in reviewing r178067), use the real register
number bit lengths (which is self-documenting, and prevents using illegal
numbers), and set only the relevant bits in HWEncoding (which defaults to 0).

No functionality change intended.

llvm-svn: 178077

567fa62d

PPC: Use HWEncoding and TRI->getEncodingValue · feea6539

Hal Finkel authored Mar 26, 2013

As pointed out by Jakob, we don't need to maintain a separate
register-numbering table. Instead we should let TableGen generate the table for
us from the information (already present) in PPCRegisterInfo.td.
TRI->getEncodingValue is now used to access register-encoding values.

No functionality change intended.

llvm-svn: 178067

feea6539

R600/SIMCCodeEmitter.cpp: Prune a couple of unused members, STI and Ctx. [-Wunused-private-field] · 3234178b
NAKAMURA Takumi authored Mar 26, 2013
```
llvm-svn: 178065
```
3234178b

Use multiple virtual registers in PPC CR spilling · 0dfbb05a

Hal Finkel authored Mar 26, 2013

Now that the register scavenger can support multiple spill slots, and PEI can
use virtual-register-based scavenging for multiple simultaneous registers, we
can use a virtual register for the transfer register in the CR spilling code.

This should eliminate the last place (outside of the prologue/epilogue) where
we depend on the unconditional availability of the r0 register. We will soon be
able to allocate it (in a somewhat restricted sense) as a GPR.

llvm-svn: 178060

0dfbb05a

Update PPCRegisterInfo's use of virtual registers to be SSA · d8a423cd

Hal Finkel authored Mar 26, 2013

PPC's use of PEI's virtual-register-based scavenging functionality had
redefined the virtual registers (it was non-SSA). Now that PEI supports
dealing with instructions with multiple virtual registers, this can be
cleanup up to use multiple virtual registers and keep SSA form.

No functionality change intended.

llvm-svn: 178059

d8a423cd

Annotate the remaining x86 instructions with SchedRW lists. · e440d476

Jakob Stoklund Olesen authored Mar 26, 2013

Now all x86 instructions that have itinerary classes also have SchedRW
lists. This is required before the new scheduling models can be used.

There are still unannotated instructions remaining, but they don't have
itinerary classes either.

llvm-svn: 178051

e440d476

Annotate x87 and mmx instructions with SchedRW lists. · 267dd946
Jakob Stoklund Olesen authored Mar 26, 2013
```
This only covers the instructions that were given itinerary classes for
the Atom model.

llvm-svn: 178050
```
267dd946
Annotate control instructions with SchedRW lists. · d59419eb
Jakob Stoklund Olesen authored Mar 26, 2013
```
This could definitely be more granular. I am not sure if it makes a
difference.

llvm-svn: 178049
```
d59419eb
Annotate the rest of X86InstrInfo.td with SchedRW lists. · 7c8a760d
Jakob Stoklund Olesen authored Mar 26, 2013
```
llvm-svn: 178048
```
7c8a760d
Add PREFETCHW codegen support · 5173ee03
Michael Liao authored Mar 26, 2013
```
- Add 'PRFCHW' feature defined in AVX2 ISA extension

llvm-svn: 178040
```
5173ee03
Hexagon: Use multiclass for aslh, asrh, sxtb, sxth, zxtb and zxth. · 15957b12
Jyotsna Verma authored Mar 26, 2013
```
llvm-svn: 178032
```
15957b12
Hexagon: Remove HexagonMCInst.h file. It has been replaced with MCTargetDesc/HexagonMCInst.h. · f299668a
Jyotsna Verma authored Mar 26, 2013
```
llvm-svn: 178030
```
f299668a

Revert ARM Scheduler Model: Add resources instructions, map resources · 414ef565

Arnold Schwaighofer authored Mar 26, 2013

This reverts commit r177968. It is causing failures in a local build bot.

"fatal error: error in backend: Expected a variant SchedClass"

Original commit message:
Move the CortexA9 resources into the CortexA9 SchedModel namespace. Define
resource mappings under the CortexA9 SchedModel. Define resources and mappings
for the SwiftModel.

llvm-svn: 178028

414ef565

Remove default case from fully covered switch. · cf3d5aae
Benjamin Kramer authored Mar 26, 2013
```
llvm-svn: 178025
```
cf3d5aae

R600/SI: improve post ISel folding · 8370dbbf

Christian Konig authored Mar 26, 2013



Not only fold immediates, but avoid unnecessary copies as well.

Signed-off-by: Christian König <christian.koenig@amd.com>
llvm-svn: 178024

8370dbbf

R600/SI: improve vector interpolation · 082c661f

Christian Konig authored Mar 26, 2013



Prevent loading M0 multiple times.

Signed-off-by: Christian König <christian.koenig@amd.com>
llvm-svn: 178023

082c661f

R600/SI: avoid unecessary subreg extraction in IMAGE_SAMPLE · 25ce3e9f

Christian Konig authored Mar 26, 2013



Just define the address as unknown instead of VReg_32.

Signed-off-by: Christian König <christian.koenig@amd.com>
llvm-svn: 178022

25ce3e9f

R600/SI: switch back to RegPressure scheduling · eecebd0b
Christian Konig authored Mar 26, 2013
```
Signed-off-by: Christian König <christian.koenig@amd.com>
llvm-svn: 178021
```
eecebd0b

R600/SI: mark most intrinsics as readnone v2 · 727d06de

Christian Konig authored Mar 26, 2013



They read from constant register space anyway.

v2: fix lit tests

Signed-off-by: Christian König <christian.koenig@amd.com>
llvm-svn: 178020

727d06de

R600/SI: replace WQM intrinsic · 737d4a16

Christian Konig authored Mar 26, 2013



Just enable WQM when we see an LDS interpolation instruction.

Signed-off-by: Christian König <christian.koenig@amd.com>
llvm-svn: 178019

737d4a16

R600/SI: fix ELSE pseudo op handling · 6a9d390b

Christian Konig authored Mar 26, 2013



Restore the EXEC mask early, otherwise a copy might end up not beeing executed.

Candidate for the mesa stable branch.

Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
Tested-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 178018

6a9d390b

Patch by Gordon Keiser! · f686be46

Joe Abbey authored Mar 26, 2013

If PC or SP is the destination, the disassembler erroneously failed with the
invalid encoding, despite the manual saying that both are fine.

This patch addresses failure to decode encoding T4 of LDR (A8.8.62) which is a
postindexed load, where the offset 0xc is applied to SP after the load occurs.

llvm-svn: 178017

f686be46

PowerPC: Mark patterns as isCodeGenOnly. · bbfb0c55

Ulrich Weigand authored Mar 26, 2013

There remain a number of patterns that cannot (and should not)
be handled by the asm parser, in particular all the Pseudo patterns.

This commit marks those patterns as isCodeGenOnly.

No change in generated code.

llvm-svn: 178008

bbfb0c55

PowerPC: Simplify handling of fixups. · 3e186015

Ulrich Weigand authored Mar 26, 2013

MCTargetDesc/PPCMCCodeEmitter.cpp current has code like:

 if (isSVR4ABI() && is64BitMode())
   Fixups.push_back(MCFixup::Create(0, MO.getExpr(),
                                    (MCFixupKind)PPC::fixup_ppc_toc16));
 else
   Fixups.push_back(MCFixup::Create(0, MO.getExpr(),
                                    (MCFixupKind)PPC::fixup_ppc_lo16));

This is a problem for the asm parser, since it requires knowledge of
the ABI / 64-bit mode to be set up.  However, more fundamentally,
at this point we shouldn't make such distinctions anyway; in an assembler
file, it always ought to be possible to e.g. generate TOC relocations even
when the main ABI is one that doesn't use TOC.

Fortunately, this is actually completely unnecessary; that code was added
to decide whether to generate TOC relocations, but that information is in
fact already encoded in the VariantKind of the underlying symbol.

This commit therefore merges those fixup types into one, and then decides
which relocation to use based on the VariantKind.

No changes in generated code.

llvm-svn: 178007

3e186015

PowerPC: Simplify FADD in round-to-zero mode. · 874fc628

Ulrich Weigand authored Mar 26, 2013

As part of the the sequence generated to implement long double -> int
conversions, we need to perform an FADD in round-to-zero mode.  This is
problematical since the FPSCR is not at all modeled at the SelectionDAG
level, and thus there is a risk of getting floating point instructions
generated out of sequence with the instructions to modify FPSCR.

The current code handles this by somewhat "special" patterns that in part
have dummy operands, and/or duplicate existing instructions, making them
awkward to handle in the asm parser.

This commit changes this by leaving the "FADD in round-to-zero mode"
as an atomic operation on the SelectionDAG level, and only split it up into
real instructions at the MI level (via custom inserter).  Since at *this*
level the FPSCR *is* modeled (via the "RM" hard register), much of the
"special" stuff can just go away, and the resulting patterns can be used by
the asm parser.

No significant change in generated code expected.

llvm-svn: 178006

874fc628

PowerPC: Remove LDrs pattern. · 4a083886

Ulrich Weigand authored Mar 26, 2013

The LDrs pattern is a duplicate of LD, except that it accepts memory
addresses where the displacement is a symbolLo64.  An operand type
"memrs" is defined for just that purpose.

However, this wouldn't be necessary if the default "memrix" operand
type were to simply accept 64-bit symbolic addresses directly.
The only problem with that is that it uses "symbolLo", which is
hardcoded to 32-bit.

To fix this, this commit changes "memri" and "memrix" to use new
operand types for the memory displacement, which allow iPTR
instead of i32.  This will also make address parsing easier to
implment in the asm parser.

No change in generated code.

llvm-svn: 178005

4a083886

PowerPC: Remove ADDIL patterns. · 35f9fdfd

Ulrich Weigand authored Mar 26, 2013

The ADDI/ADDI8 patterns are currently duplicated into ADDIL/ADDI8L,
which describe the same instruction, except that they accept a
symbolLo[64] operand instead of a s16imm[64] operand.

This duplication confuses the asm parser, and it actually not really
needed, since symbolLo[64] already accepts immediate operands anyway.
So this commit removes the duplicate patterns.

No change in generated code.

llvm-svn: 178004

35f9fdfd

PowerPC: Use CCBITRC operand for ISEL patterns. · 4749b1ec

Ulrich Weigand authored Mar 26, 2013

This commit changes the ISEL patterns to use a CCBITRC operand
instead of a "pred" operand.  This matches the actual instruction
text more directly, and simplifies use of ISEL with the asm parser.
In addition, this change allows some simplification of handling
the "pred" operand, as this is now only used by BCC.

No change in generated code.

llvm-svn: 178003

4749b1ec

PowerPC: Simplify BLR pattern. · 63aa852a

Ulrich Weigand authored Mar 26, 2013

The BLR pattern cannot be recognized by the asm parser in its current form.
This complexity is due to an apparent attempt to enable conditional BLR
variants.  However, none of those can ever be generated by current code;
the pattern is only ever created using the default "pred" operand.

To simplify the pattern and allow it to be recognized by the parser,
this commit removes those attempts at conditional BLR support.

When we later come back to actually add real conditional BLR, this
should probably be done via a fully generic conditional branch pattern.

No change in generated code.

llvm-svn: 178002

63aa852a

PowerPC: Move some 64-bit branch patterns. · 410a40bb

Ulrich Weigand authored Mar 26, 2013

In PPCInstr64Bit.td, some branch patterns appear in a different sequence
than the corresponding 32-bit patterns in PPCInstrInfo.td.

To simplify future changes that affect both files, this commit moves
those patterns to rearrange them into a similar sequence.

No effect on generated code.

llvm-svn: 178001

410a40bb