Commits · 2734d79d9440498b6bece624711373d8ae1c968c · Roger Ferrer / llvm-epi-0.8

Sep 12, 2013

Mark PPC MFTB and DST (and friends) as deprecated · 0096dbd5

Hal Finkel authored Sep 12, 2013

Use the new instruction deprecation feature to mark mftb (now replaced with
mfspr) and dst (along with the other Altivec cache control instructions) as
deprecated when targeting cores supporting at least ISA v2.03.

llvm-svn: 190605

0096dbd5

Add an instruction deprecation feature to TableGen. · 0e76fa7d

Joey Gouly authored Sep 12, 2013

The 'Deprecated' class allows you to specify a SubtargetFeature that the
instruction is deprecated on.

The 'ComplexDeprecationPredicate' class allows you to define a custom
predicate that is called to check for deprecation.
For example:
  ComplexDeprecationPredicate<"MCR">

would mean you would have to define the following function:
  bool getMCRDeprecationInfo(MCInst &MI, MCSubtargetInfo &STI,
                             std::string &Info)

Which returns 'false' for not deprecated, and 'true' for deprecated
and store the warning message in 'Info'.

The MCTargetAsmParser constructor was chaned to take an extra argument of
the MCInstrInfo class, so out-of-tree targets will need to be changed.

llvm-svn: 190598

0e76fa7d

AVX-512: implemented extractelement with variable index. · 8952974e
Elena Demikhovsky authored Sep 12, 2013
```
Added parsing of mask register and "zeroing" semantic, like {%k1} {z}.

llvm-svn: 190595
```
8952974e

PPC: Enable aggressive anti-dependency breaking · 7fe6a539

Hal Finkel authored Sep 12, 2013

Aggressive anti-dependency breaking is enabled by default for all PPC cores.
This provides a general speedup on the P7 and other platforms (among other
factors, the instruction group formation for the non-embedded PPC cores is done
during post-RA scheduling). In order to do this safely, the incompatibility
between uses of the MFOCRF instruction and anti-dependency breaking are
resolved by marking MFOCRF with hasExtraSrcRegAllocReq. As noted in the removed
FIXME, the problem was that MFOCRF's output is sensitive to the identify of the
source register, and always paired with a shift to undo this effect. Because
anti-dependency breaking is unaware of this hidden dependency of the shift
amount on the source register of the MFOCRF instruction, changing that register
must be inhibited.

Two test cases were adjusted: The SjLj test was made more insensitive to
register choices and scheduling; the saveCR test disabled anti-dependency
breaking because part of what it is testing is proper register reuse.

llvm-svn: 190587

7fe6a539

R600/SI: expose TBUFFER_STORE_FORMAT_* for OpenGL transform feedback · afcf12f3

Tom Stellard authored Sep 12, 2013



For _XYZ, the type of VDATA is v4i32, because v3i32 doesn't exist.

The ADDR64 bit is not exposed. A simpler intrinsic that doesn't take
a resource descriptor might be nicer.

The maximum number of input SGPRs is bumped to 17.

Signed-off-by: Marek Olšák <marek.olsak@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 190575

afcf12f3

R600: Don't use trans slot for instructions that read LDS source registers · 7f6fa4c4

Tom Stellard authored Sep 12, 2013

This fixes some regressions in the piglit local memory store tests
introduced by recent commits which made the scheduler aware of the trans
slot.

It's not possible to test this using lit, because there is no way to
determine from the assembly dumps whether or not an instruction is in
the trans slot.

Even if this were possible, the test would be highly sensitive to
changes in the scheduler and might generate confusing false negatives.

Reviewed-by: Vincent Lejeune<vljn at ovi.com>
llvm-svn: 190574

7f6fa4c4

Greatly simplify the PPC A2 scheduling itinerary · f574c277

Hal Finkel authored Sep 11, 2013

As Andy pointed out to me a long time ago, there are no structural hazards in
the later pipeline stages of the A2, and so modeling them is useless. Also,
modeling the top pre-dispatch stages is deceiving because, when multiple
hardware threads are active, those resources are shared among the threads. The
bypass definitions were mostly wrong, and so those have been removed. The
resulting itinerary is much simpler, and more accurate.

llvm-svn: 190562

f574c277

Enable MI scheduling (and CodeGen AA) by default for embedded PPC cores · 21442b24

Hal Finkel authored Sep 11, 2013

For embedded PPC cores (especially the A2 core), using the MI scheduler with AA
is far superior to the other scheduling options.

llvm-svn: 190558

21442b24

Sep 11, 2013
- Use the appropriate return type for the compact unwind encoding. · 7b650a75
  Bill Wendling authored Sep 11, 2013
```
llvm-svn: 190551
```
  7b650a75
- Implement TTI getUnrollingPreferences for PowerPC · 71780ec4
  Hal Finkel authored Sep 11, 2013
```
The PowerPC A2 core greatly benefits from aggressive concatenation unrolling;
use the new getUnrollingPreferences to enable this by default when targeting
the PPC A2 core.

llvm-svn: 190549
```
  71780ec4
- Move into an anonymous namespace and closer to where it's used. · 184d5d31
  Bill Wendling authored Sep 11, 2013
```
llvm-svn: 190547
```
  184d5d31
- [mips][msa] Added support for matching mulv, nlzc, sll, sra, srl, and subv... · fbcb5829
  Daniel Sanders authored Sep 11, 2013
```
[mips][msa] Added support for matching mulv, nlzc, sll, sra, srl, and subv from normal IR (i.e. not intrinsics)

llvm-svn: 190518
```
  fbcb5829
- [mips][msa] Added support for matching fadd, fdiv, flog2, fmul, frint, fsqrt,... · f5bd937b
  Daniel Sanders authored Sep 11, 2013
```
[mips][msa] Added support for matching fadd, fdiv, flog2, fmul, frint, fsqrt, and fsub from normal IR (i.e. not intrinsics)

llvm-svn: 190512
```
  f5bd937b
- [mips][msa] Added support for matching div_[su] from normal IR (i.e. not intrinsics) · 607952bd
  Daniel Sanders authored Sep 11, 2013
```
llvm-svn: 190509
```
  607952bd
- [mips][msa] Added support for matching addv from normal IR (i.e. not intrinsics) · fa5ab1c8
  Daniel Sanders authored Sep 11, 2013
```
The corresponding intrinsic is now lowered into equivalent IR (ISD::ADD) before instruction selection.

llvm-svn: 190507
```
  fa5ab1c8
- [mips][msa] Separate the configuration of int/float vector types since they will diverge soon · c65f58a9
  Daniel Sanders authored Sep 11, 2013
```
No functional change

llvm-svn: 190506
```
  c65f58a9
- [mips][msa] Corrected the definition of the dotp_[su].[hwd] intrinsics · cb2929c2
  Daniel Sanders authored Sep 11, 2013
```
The elements of the operands should be half the width of the elements of
the result.

llvm-svn: 190505
```
  cb2929c2
- Rename variables for consistency. · 8f06d556
  Eli Friedman authored Sep 11, 2013
```
No functional change.

llvm-svn: 190466
```
  8f06d556
- Fix unused variables. · 78bffa57
  Eli Friedman authored Sep 10, 2013
```
llvm-svn: 190448
```
  78bffa57
- Remove unused functions. · 1891f693
  Eli Friedman authored Sep 10, 2013
```
llvm-svn: 190442
```
  1891f693
Sep 10, 2013

ARM: Use the PICADD opcode calculated. · 19ae779a

Jim Grosbach authored Sep 10, 2013

We were figuring out whether to use tPICADD or PICADD, then just using
tPICADD unconditionally anyway. Oops.

A testcase from someone familiar enough with ELF to produce one would
be appreciated. The existing PIC testcase correctly verifies the .s
generated, but that doesn't catch this bug, which only showed up in
direct-to-object mode.

http://llvm.org/bugs/show_bug.cgi?id=17180

llvm-svn: 190417

19ae779a

Remove unused private member in ARMAsmPrinter.cpp. · d532cb6b
Logan Chien authored Sep 10, 2013
```
This commit removes the unused "AttributeItem" from
ObjectAttributeEmitter.

llvm-svn: 190412
```
d532cb6b
[SystemZ] Update README. · 0e0498b2
Richard Sandiford authored Sep 10, 2013
```
llvm-svn: 190404
```
0e0498b2

[SystemZ] Add TM and TMY · a9eb9972

Richard Sandiford authored Sep 10, 2013

The main complication here is that TM and TMY (the memory forms) set
CC differently from the register forms. When the tested bits contain
some 0s and some 1s, the register forms set CC to 1 or 2 based on the
value the uppermost bit. The memory forms instead set CC to 1
regardless of the uppermost bit.

Until now, I've tried to make it so that a branch never tests for an
impossible CC value. E.g. NR only sets CC to 0 or 1, so branches on the
result will only test for 0 or 1. Originally I'd tried to do the same
thing for TM and TMY by using custom matching code in ISelDAGToDAG.
That ended up being very ugly though, and would have meant duplicating
some of the chain checks that the common isel code does.

I've therefore gone for the simpler alternative of adding an extra
operand to the TM DAG opcode to say whether a memory form would be OK.
This means that the inverse of a "TM;JE" is "TM;JNE" rather than the
more precise "TM;JNLE", just like the inverse of "TMLL;JE" is "TMLL;JNE".
I suppose that's arguably less confusing though...

llvm-svn: 190400

a9eb9972

[mips][msa] Removed unsupported dot product instructions (dotp_[su].b) · f561730a
Daniel Sanders authored Sep 10, 2013
```
The dotp_[su].b instructions never existed in any revision of the MSA spec.

llvm-svn: 190398
```
f561730a
Add test cases for Mips mthc1/mfhc1 instructions. Add check for odd value of... · 65cd5744
Vladimir Medic authored Sep 10, 2013
```
Add test cases for Mips mthc1/mfhc1 instructions. Add check for odd value of register when PFU is 32 bit.

llvm-svn: 190397
```
65cd5744
Remove obsolete code from MipsAsmParser.cpp. · 88269706
Vladimir Medic authored Sep 10, 2013
```
llvm-svn: 190396
```
88269706
Revert r190366. It was breaking build bots. · f27e3315
Bill Wendling authored Sep 10, 2013
```
llvm-svn: 190373
```
f27e3315
Use a default value for the prologue's debug location. · b07305fc
Bill Wendling authored Sep 09, 2013
```
llvm-svn: 190366
```
b07305fc

Sep 09, 2013

Revert patches to add case-range support for PR1255. · e407736a

Bob Wilson authored Sep 09, 2013

The work on this project was left in an unfinished and inconsistent state.
Hopefully someone will eventually get a chance to implement this feature, but
in the meantime, it is better to put things back the way the were. I have
left support in the bitcode reader to handle the case-range bitcode format,
so that we do not lose bitcode compatibility with the llvm 3.3 release.

This reverts the following commits: 155464, 156374, 156377, 156613, 156704,
156757, 156804 156808, 156985, 157046, 157112, 157183, 157315, 157384, 157575,
157576, 157586, 157612, 157810, 157814, 157815, 157880, 157881, 157882, 157884,
157887, 157901, 158979, 157987, 157989, 158986, 158997, 159076, 159101, 159100,
159200, 159201, 159207, 159527, 159532, 159540, 159583, 159618, 159658, 159659,
159660, 159661, 159703, 159704, 160076, 167356, 172025, 186736

llvm-svn: 190328

e407736a

[mips] When double precision loads and stores are split into two i32 loads and · 9cf069f6

Akira Hatanaka authored Sep 09, 2013

stores, make sure the load or store that accesses the higher half does not have
an alignment that is larger than the offset from the original address.

llvm-svn: 190318

9cf069f6

[ARMv8] Prevent generation of deprecated IT blocks on ARMv8 in Thumb mode. · a5153cb0

Joey Gouly authored Sep 09, 2013

IT blocks can only be one instruction lonf, and can only contain a subset of
the 16 instructions.

Patch by Artyom Skrobov!

llvm-svn: 190309

a5153cb0

A better way to silence the warning in MSVC (replaces r190304). · 83d81784
Aaron Ballman authored Sep 09, 2013
```
llvm-svn: 190308
```
83d81784
Silencing a warning about control flow reaching the end of a non-void function. · c4280dd9
Aaron Ballman authored Sep 09, 2013
```
llvm-svn: 190304
```
c4280dd9

XCore handling of thread local lowering · 3d3194bf

Robert Lytton authored Sep 09, 2013

Fix XCoreLowerThreadLocal trying to initialise globals
which have no initializer.

Add handling of const expressions containing thread local variables.
These need to be replaced with instructions, as the thread ID is
used to access the thread local variable.

llvm-svn: 190300

3d3194bf

XCore target: change to Sched::Source · 4809ea41

Robert Lytton authored Sep 09, 2013

This sidesteps a bug in PrescheduleNodesWithMultipleUses() which
does not check if callResources will be affected by the transformation.

llvm-svn: 190299

4809ea41

XCore target: fix weak linkage attribute handling · e4538883
Robert Lytton authored Sep 09, 2013
```
llvm-svn: 190298
```
e4538883

Generate compact unwind encoding from CFI directives. · 58e2d3d8

Bill Wendling authored Sep 09, 2013

We used to generate the compact unwind encoding from the machine
instructions. However, this had the problem that if the user used `-save-temps'
or compiled their hand-written `.s' file (with CFI directives), we wouldn't
generate the compact unwind encoding.

Move the algorithm that generates the compact unwind encoding into the
MCAsmBackend. This way we can generate the encoding whether the code is from a
`.ll' or `.s' file.

<rdar://problem/13623355>

llvm-svn: 190290

58e2d3d8

Implement aarch64 neon instruction set AdvSIMD (3V Diff), covering the following 26 instructions, · 2878dc8f

Jiangning Liu authored Sep 09, 2013

SADDL, UADDL, SADDW, UADDW, SSUBL, USUBL, SSUBW, USUBW, ADDHN, RADDHN, SABAL, UABAL, SUBHN, RSUBHN, SABDL, UABDL, SMLAL, UMLAL, SMLSL, UMLSL, SQDMLAL, SQDMLSL, SMULL, UMULL, SQDMULL, PMULL

llvm-svn: 190288

2878dc8f

Sep 08, 2013
- Add neverHasSideEffects=1 on a couple move instructions. · adbb9a12
  Craig Topper authored Sep 08, 2013
```
llvm-svn: 190259
```
  adbb9a12