Commits · 10367eb42262bdcddf171ab502a547960d509425 · Lorenzo Albano / LLVM bpEVL

Apr 12, 2018

[Power9]Legalize and emit code for converting (Un)Signed DWord to Quad-Precision · 10367eb4

Lei Huang authored Apr 12, 2018

Legalize and emit code for:

  * xscvsdqp
  * xscvudqp

Differential Revision: https://reviews.llvm.org/D45230

llvm-svn: 329931

10367eb4

Apr 04, 2018

[Power9]Legalize and emit code for quad-precision fma instructions · 09fda63a

Lei Huang authored Apr 04, 2018

Legalize and emit code for the following quad-precision fma:

  * xsmaddqp
  * xsnmaddqp
  * xsmsubqp
  * xsnmsubqp

Differential Revision: https://reviews.llvm.org/D44843

llvm-svn: 329206

09fda63a

Mar 26, 2018

[Power9]Legalize and emit code for quad-precision convert from double-precision · be0afb08

Lei Huang authored Mar 26, 2018

Legalize and emit code for quad-precision floating point operation xscvdpqp
and add option to guard the quad precision operation support.

Differential Revision: https://reviews.llvm.org/D44746

llvm-svn: 328558

be0afb08

[PowerPC] Infrastructure work. Implement getting the opcode for a spill in one place. · 26d4f923

Stefan Pintilie authored Mar 26, 2018

A new function getOpcodeForSpill should now be the only place to get
the opcode for a given spilled register.

Differential Revision: https://reviews.llvm.org/D43086

llvm-svn: 328556

26d4f923

Mar 19, 2018

[Power9]Legalize and emit code for quad-precision copySign/abs/nabs/neg/sqrt · ecfede94

Lei Huang authored Mar 19, 2018

Legalize and emit code for quad-precision floating point operations:

  * xscpsgnqp
  * xsabsqp
  * xsnabsqp
  * xsnegqp
  * xssqrtqp

Differential Revision: https://reviews.llvm.org/D44530

llvm-svn: 327889

ecfede94

[PowerPC][Power9]Legalize and emit code for quad-precision add/div/mul/sub · 6d1596a9

Lei Huang authored Mar 19, 2018

Legalize and emit code for quad-precision floating point operations:

  * xsaddqp
  * xssubqp
  * xsdivqp
  * xsmulqp

Differential Revision: https://reviews.llvm.org/D44506

llvm-svn: 327878

6d1596a9

Mar 12, 2018
- [PowerPC][NFC] Explicitly state types on FP SDAG patterns in anticipation of adding the f128 type · cd4f3857
  Lei Huang authored Mar 12, 2018
```
llvm-svn: 327319
```
  cd4f3857
Feb 23, 2018

[PowerPC] Code cleanup. Remove instructions that were withdrawn from Power 9. · 15e6b10e

Stefan Pintilie authored Feb 23, 2018

The following set of instructions was originally planned to be added for Power 9
and so code was added to support them. However, a decision was made later on to
withdraw support for these instructions in the hardware.
xscmpnedp
xvcmpnesp
xvcmpnedp
This patch removes support for the instructions that were not added.

Differential Revision: https://reviews.llvm.org/D43641

llvm-svn: 325918

15e6b10e

Nov 27, 2017

[Power9] Improvements to vector extract with variable index exploitation · 48cb3c15

Zaara Syeda authored Nov 27, 2017

This patch extends on to rL307174 to not use the power9 vector extract with
variable index instructions when extracting word element 1. For such cases,
the existing selection of MFVSRWZ provides a better sequence.

Differential Revision: https://reviews.llvm.org/D38287

llvm-svn: 319049

48cb3c15

Nov 20, 2017

[PPC] Heuristic to choose between a X-Form VSX ld/st vs a X-Form FP ld/st. · 438bf4a6

Tony Jiang authored Nov 20, 2017

The VSX versions have the advantage of a full 64-register target whereas the FP
ones have the advantage of lower latency and higher throughput. So what we’re
after is using the faster instructions in low register pressure situations and
using the larger register file in high register pressure situations.

The heuristic chooses between the following 7 pairs of instructions.
PPC::LXSSPX vs PPC::LFSX
PPC::LXSDX vs PPC::LFDX
PPC::STXSSPX vs PPC::STFSX
PPC::STXSDX vs PPC::STFDX
PPC::LXSIWAX vs PPC::LFIWAX
PPC::LXSIWZX vs PPC::LFIWZX
PPC::STXSIWX vs PPC::STFIWX

Differential Revision: https://reviews.llvm.org/D38486

llvm-svn: 318651

438bf4a6

Nov 07, 2017

Use new vector insert half-word and byte instructions when we see... · 5cd044e8

Graham Yiu authored Nov 07, 2017

Use new vector insert half-word and byte instructions when we see insertelement on '8 x i16' and '16 x i8' types. Also extended existing lit testcase to cover these cases.

Differential Revision: https://reviews.llvm.org/D34630

llvm-svn: 317613

5cd044e8

Sep 21, 2017

[Power9] Spill gprs to vector registers rather than stack · fcd9697d

Zaara Syeda authored Sep 21, 2017

This patch updates register allocation to enable spilling gprs to
volatile vector registers rather than the stack. It can be enabled
 for Power9 with option -ppc-enable-gpr-to-vsr-spills.

Differential Revision: https://reviews.llvm.org/D34815

llvm-svn: 313886

fcd9697d

Sep 05, 2017
- [PPC][NFC] Renaming things with 'xxinsert' moniker to 'vecinsert' to make it more general. · 61ef1c54
  Tony Jiang authored Sep 05, 2017
```
Commit on behalf of Graham Yiu (gyiu@ca.ibm.com)

llvm-svn: 312547
```
  61ef1c54
Aug 14, 2017

[PowerPC] Add codegen for VSX word extract convert to FP · 451ef4ad

Lei Huang authored Aug 14, 2017

Add codegen for VSX word extract conversion from signed/unsigned to single/double
precision.

For UINT_TO_FP:
Extract word unsigned and convert to float was implemented in https://reviews.llvm.org/D20239.
Here we will add the missing extract integer and conversion to double. This
utilizes the new P9 instruction xxextractuw to extracting an integer element
when the result will be converted to double thereby saving 2 direct moves
(VSR <-> GPR).

For SINT_TO_FP:
We will implement the following sequence which will also reduce the number of
instructions by saving 2 direct moves.

v4i32->f32:
        xxspltw
        xvcvsxwsp
        xscvspdpn

v4i32->f64:
        xxspltw
        xvcvsxwdp

Differential Revision: https://reviews.llvm.org/D35859

llvm-svn: 310866

451ef4ad

Jul 13, 2017

[PowerPC] Ensure displacements for DQ-Form instructions are multiples of 16 · 3c7e276d

Nemanja Ivanovic authored Jul 13, 2017

As outlined in the PR, we didn't ensure that displacements for DQ-Form
instructions are multiples of 16. Since the instruction encoding encodes
a quad-word displacement, a sub-16 byte displacement is meaningless and
ends up being encoded incorrectly.

Fixes https://bugs.llvm.org/show_bug.cgi?id=33671.

Differential Revision: https://reviews.llvm.org/D35007

llvm-svn: 307934

3c7e276d

Jul 05, 2017

[Power9] Exploit vector extract with variable index. · aa5a6a1c

Tony Jiang authored Jul 05, 2017

This patch adds the exploitation for new power 9 instructions which extract
variable elements from vectors:
VEXTUBLX
VEXTUBRX
VEXTUHLX
VEXTUHRX
VEXTUWLX
VEXTUWRX

Differential Revision: https://reviews.llvm.org/D34032
Commit on behalf of Zaara Syeda (syzaara@ca.ibm.com)

llvm-svn: 307174

aa5a6a1c

[Power9] Exploit vector integer extend instructions when indices aren't correct. · 9a91a181

Tony Jiang authored Jul 05, 2017

This patch adds on to the exploitation added by https://reviews.llvm.org/D33510.
This now catches build vector nodes where the inputs are coming from sign
extended vector extract elements where the indices used by the vector extract
are not correct. We can still use the new hardware instructions by adding a
shuffle to move the elements to the correct indices. I introduced a new PPCISD
node here because adding a vector_shuffle and changing the elements of the
vector_extracts was getting undone by another DAG combine.

Commit on behalf of Zaara Syeda (syzaara@ca.ibm.com)
Differential Revision: https://reviews.llvm.org/D34009

llvm-svn: 307169

9a91a181

Jun 12, 2017

[PowerPC] Match vec_revb builtins to P9 instructions. · 1a8eec14

Tony Jiang authored Jun 12, 2017

Power9 has instructions that will reverse the bytes within an element for all
sizes (half-word, word, double-word and quad-word). These can be used for the
vec_revb builtins in altivec.h. However, we implement these to match vector
shuffle nodes as that will cover both the builtins and vector shuffles that
occur in the SDAG through other means.

Differential Revision: https://reviews.llvm.org/D33690

llvm-svn: 305214

1a8eec14

Jun 08, 2017

[Power9] Exploit vector integer extend instructions · 79acbbe5

Zaara Syeda authored Jun 08, 2017

This patch adds build vector patterns to exploit the vector integer
extend instructions:
vextsb2w - Vector Extend Sign Byte To Word
vextsb2d - Vector Extend Sign Byte To Doubleword
vextsh2w - Vector Extend Sign Halfword To Word
vextsh2d - Vector Extend Sign Halfword To Doubleword
vextsw2d - Vector Extend Sign Word To Doubleword

Differential Revision: https://reviews.llvm.org/D33510

llvm-svn: 304992

79acbbe5

May 31, 2017

[PowerPC] Fix a performance bug for PPC::XXPERMDI. · 60c247de

Tony Jiang authored May 31, 2017

There are some VectorShuffle Nodes in SDAG which can be selected to XXPERMDI
Instruction, this patch recognizes them and does the selection to improve
the PPC performance.

Differential Revision: https://reviews.llvm.org/D33404

llvm-svn: 304298

60c247de

May 29, 2017

[PPC] Fix assertion failure during binary encoding with -mcpu=pwr9 · e3c14ebb

Hiroshi Inoue authored May 29, 2017

Summary
clang -c -mcpu=pwr9 test/CodeGen/PowerPC/build-vector-tests.ll causes an assertion failure during the binary encoding.
The failure occurs when a D-form load instruction takes two register operands instead of a register + an immediate.

This patch fixes the problem and also adds an assertion to catch this failure earlier before the binary encoding (i.e. during lit test).
The fix is from Nemanja Ivanovic @nemanjai.

Differential Revision: https://reviews.llvm.org/D33482

llvm-svn: 304133

e3c14ebb

May 25, 2017

[PowerPC] Fix a performance bug for PPC::XXSLDWI. · 0a429f04

Tony Jiang authored May 24, 2017

There are some VectorShuffle Nodes in SDAG which can be selected to XXSLDWI
instruction, this patch recognizes them and does the selection to improve the
PPC performance.

llvm-svn: 303822

0a429f04

May 24, 2017
- P9: D-form vector load/store. Differential Revision: https://reviews.llvm.org/D33248 · 93297831
  Zaara Syeda authored May 24, 2017
```
llvm-svn: 303780
```
  93297831
May 12, 2017

[PPC] Change the register constraint of the first source operand of... · 22e7da95

Guozhi Wei authored May 11, 2017

[PPC] Change the register constraint of the first source operand of instruction mtvsrdd to g8rc_nox0

According to Power ISA V3.0 document, the first source operand of mtvsrdd is constant 0 if r0 is specified. So the corresponding register constraint should be g8rc_nox0.

This bug caused wrong output generated by 401.bzip2 when -mcpu=power9 and fdo are specified.

Differential Revision: https://reviews.llvm.org/D32880

llvm-svn: 302834

22e7da95

May 02, 2017

[PowerPC] Emit VMX loads/stores for aligned ops to avoid adding swaps on LE · b89c27f5

Nemanja Ivanovic authored May 02, 2017

Fixes PR30730.
This is a re-commit of a pulled commit. The commit was pulled because some
software projects contained uses of Altivec vectors that violated alignment
requirements. Known issues have now been fixed.

Committing on behalf of Lei Huang.

Differential Revision: https://reviews.llvm.org/D26861

llvm-svn: 301892

b89c27f5

Mar 30, 2017
- Spelling mistakes in comments. NFCI. · 68168d17
  Simon Pilgrim authored Mar 30, 2017
```
Based on corrections mentioned in patch for clang for PR27635

llvm-svn: 299072
```
  68168d17
Mar 15, 2017

[PowerPC][Altivec] Add mfvrd and mffprd extended mnemonic · ffcf0fb1

Nemanja Ivanovic authored Mar 15, 2017

mfvrd and mffprd are both alias to mfvrsd.
This patch enables correct parsing of the aliases, but we still emit a mfvrsd.

Committing on behalf of brunoalr (Bruno Rosa).

Differential Revision: https://reviews.llvm.org/D29177

llvm-svn: 297849

ffcf0fb1

Jan 26, 2017

[PPC] cleanup of mayLoad/mayStore flags and memory operands. · 3c8c385a

Sean Fertile authored Jan 26, 2017

1) Explicitly sets mayLoad/mayStore property in the tablegen files on load/store
   instructions.
2) Updated the flags on a number of intrinsics indicating that they write
    memory.
3) Added SDNPMemOperand flags for some target dependent SDNodes so that they
   propagate their memory operand

Review: https://reviews.llvm.org/D28818
llvm-svn: 293200

3c8c385a

Dec 15, 2016

[Power9] Allow AnyExt immediates for XXSPLTIB · 552c8e96

Nemanja Ivanovic authored Dec 15, 2016

In some situations, the BUILD_VECTOR node that builds a v18i8 vector by
a splat of an i8 constant will end up with signed 8-bit values and other
situations, it'll end up with unsigned ones. Handle both situations.

Fixes PR31340.

llvm-svn: 289804

552c8e96

Dec 09, 2016
- [PPC] Add intrinsics for vector extract word and vector insert word. · 1c4109b4
  Sean Fertile authored Dec 09, 2016
```
Revision: https://reviews.llvm.org/D26547
llvm-svn: 289227
```
  1c4109b4
Dec 06, 2016

[PowerPC] Improvements for BUILD_VECTOR Vol. 4 · 15748f49

Nemanja Ivanovic authored Dec 06, 2016

This is the final patch in the series of patches that improves
BUILD_VECTOR handling on PowerPC. This adds a few peephole optimizations
to remove redundant instructions. It also adds a large test case which
encompasses a large set of code patterns that build vectors - this test
case was the motivator for this series of patches.

Differential Revision: https://reviews.llvm.org/D26066

llvm-svn: 288800

15748f49

Nov 30, 2016

Revert https://reviews.llvm.org/rL287679 · f57f150b

Nemanja Ivanovic authored Nov 29, 2016

This commit caused some miscompiles that did not show up on any of the bots.
Reverting until we can investigate the cause of those failures.

llvm-svn: 288214

f57f150b

Nov 29, 2016

[PowerPC] Improvements for BUILD_VECTOR Vol. 1 · df1cb520

Nemanja Ivanovic authored Nov 29, 2016

This patch corresponds to review:
https://reviews.llvm.org/D25912

This is the first patch in a series of 4 that improve the lowering and combining
for BUILD_VECTOR nodes on PowerPC.

llvm-svn: 288152

df1cb520

Nov 23, 2016

[PowerPC] Remove InstAlias definitions that cause incorrect assembly · 10fc3cfc

Nemanja Ivanovic authored Nov 23, 2016

In rL283190, I added some InstAlias definitions to generate extended mnemonics
for some uses of the XXPERMDI instruction. However, when the assembler matches
these extended mnemonics, it matches the new instruction in situations where it
should match the old one.
This patch removes these definitions and accomplishes that by defining these
mnemonics with additional instructions that are isCodeGenOnly.

Fixes PR31127.

llvm-svn: 287765

10fc3cfc

Nov 22, 2016

[PowerPC] Emit VMX loads/stores for aligned ops to avoid adding swaps on LE · b8e30d6d

Nemanja Ivanovic authored Nov 22, 2016

This patch corresponds to review:
https://reviews.llvm.org/D26861

It also fixes PR30730.

Committing on behalf of Lei Huang.

llvm-svn: 287679

b8e30d6d

Nov 15, 2016

vector load store with length (left justified) llvm portion · a19c9e60
Zaara Syeda authored Nov 15, 2016
```
llvm-svn: 286993
```
a19c9e60

[PowerPC] Implement BE VSX load/store builtins - llvm portion. · 5f850cd1

Tony Jiang authored Nov 15, 2016

This patch implements all the overloads for vec_xl_be and vec_xst_be. On BE,
they behaves exactly the same with vec_xl and vec_xst, therefore they are
simply implemented by defining a matching macro. On LE, they are implemented
by defining new builtins and intrinsics. For int/float/long long/double, it
is just a load (lxvw4x/lxvd2x) or store(stxvw4x/stxvd2x). For char/char/short,
we also need some extra shuffling before or after call the builtins to get the
desired BE order. For int128, simply call vec_xl or vec_xst.

llvm-svn: 286967

5f850cd1

Nov 14, 2016

[PPC] Add intrinsic mapping to the xscvhpsp instruction · a435e07d

Sean Fertile authored Nov 14, 2016

add an intrinsic to expose the 'VSX Scalar Convert Half-Precision to
Single-Precision' instruction.

Differential review: https://reviews.llvm.org/D26536

llvm-svn: 286862

a435e07d

[PPC] add intrinsics for vec extract exp/significand and vec test data class. · adda5b2d
Sean Fertile authored Nov 14, 2016
```
  Differential Revision: https://reviews.llvm.org/D26272

llvm-svn: 286829
```
adda5b2d

Nov 11, 2016

[PowerPC] Add remaining vector permute builtins in altivec.h - LLVM portion · ec4b0c36

Nemanja Ivanovic authored Nov 11, 2016

This patch corresponds to review:
https://reviews.llvm.org/D26480

Adds all the intrinsics used for various permute builtins that will
be added to altivec.h.

llvm-svn: 286638

ec4b0c36