Commits · f4ec67822fb6dd96bb9959d84178d220833325c2 · Lorenzo Albano / LLVM bpEVL

May 24, 2018

[PowerPC] Remove the match pattern in the definition of LXSDX/STXSDX · f4ec6782

Lei Huang authored 6 years ago

The match pattern in the definition of LXSDX is xoaddr, so the Pseudo
instruction XFLOADf64 never gets selected. XFLOADf64 expands to LXSDX/LFDX post
RA based on the register pressure. To avoid ambiguity, we need to remove the
select pattern for LXSDX, same as what was done for LXSD. STXSDX also have
the same issue.

Patch by Qing Shan Zhang (steven.zhang).

Differential Revision: https://reviews.llvm.org/D47178

llvm-svn: 333150

f4ec6782

May 23, 2018

[Power9]Legalize and emit code for W vector extract and convert to QP · 8b0da65b

Lei Huang authored 6 years ago

Implemente patterns to extract [Un]signed Word vector element and convert to
quad-precision.

Differential Revision: https://reviews.llvm.org/D46536

llvm-svn: 333115

8b0da65b

[Power9]Legalize and emit code for DW vector extract and convert to QP · 8990168a

Lei Huang authored 6 years ago

Implemente patterns to extract [Un]signed DWord vector element and convert to
quad-precision.

Differential Revision: https://reviews.llvm.org/D46333

llvm-svn: 333112

8990168a

May 21, 2018

MC: Separate creating a generic object writer from creating a target object writer. NFCI. · dcd7d6c3

Peter Collingbourne authored 6 years ago

With this we gain a little flexibility in how the generic object
writer is created.

Part of PR37466.

Differential Revision: https://reviews.llvm.org/D47045

llvm-svn: 332868

dcd7d6c3

MC: Change MCAsmBackend::writeNopData() to take a raw_ostream instead of an MCObjectWriter. NFCI. · 571a3301

Peter Collingbourne authored 6 years ago

To make this work I needed to add an endianness field to MCAsmBackend
so that writeNopData() implementations know which endianness to use.

Part of PR37466.

Differential Revision: https://reviews.llvm.org/D47035

llvm-svn: 332857

571a3301

May 18, 2018

Support: Simplify endian stream interface. NFCI. · e3f65297

Peter Collingbourne authored 6 years ago

Provide some free functions to reduce verbosity of endian-writing
a single value, and replace the endianness template parameter with
a field.

Part of PR37466.

Differential Revision: https://reviews.llvm.org/D47032

llvm-svn: 332757

e3f65297

May 14, 2018

[NFC] [Power] Fix instruction format for xsrqpi · 421a5960

Zaara Syeda authored 6 years ago

xsrqpi is currently using Z23Form_1.
The instruction format is xsrqpi R,VRT,VRB,RMC.
Rathar than bits 11-15 being used for FRA, it should have
bits 11-14 reserved and bit 15 for R. This patch adds a new
class Z23Form_4 to fix the instruction format.

Differential Revision: https://reviews.llvm.org/D46761

llvm-svn: 332253

421a5960

Rename DEBUG macro to LLVM_DEBUG. · d34e60ca

Nicola Zaghen authored 6 years ago

    
The DEBUG() macro is very generic so it might clash with other projects.
The renaming was done as follows:
- git grep -l 'DEBUG' | xargs sed -i 's/\bDEBUG\s\?(/LLVM_DEBUG(/g'
- git diff -U0 master | ../clang/tools/clang-format/clang-format-diff.py -i -p1 -style LLVM
- Manual change to APInt
- Manually chage DOCS as regex doesn't match it.

In the transition period the DEBUG() macro is still present and aliased
to the LLVM_DEBUG() one.

Differential Revision: https://reviews.llvm.org/D43624

llvm-svn: 332240

d34e60ca

May 11, 2018

[STLExtras] Add distance() for ranges, pred_size(), and succ_size() · e0b5f86b

Vedant Kumar authored 6 years ago

This commit adds a wrapper for std::distance() which works with ranges.
As it would be a common case to write `distance(predecessors(BB))`, this
also introduces `pred_size()` and `succ_size()` helpers to make that
easier to write.

Differential Revision: https://reviews.llvm.org/D46668

llvm-svn: 332057

e0b5f86b

May 09, 2018

[DebugInfo] Examine all uses of isDebugValue() for debug instructions. · 801bf7eb

Shiva Chen authored 6 years ago

Because we create a new kind of debug instruction, DBG_LABEL, we need to
check all passes which use isDebugValue() to check MachineInstr is debug
instruction or not. When expelling debug instructions, we should expel
both DBG_VALUE and DBG_LABEL. So, I create a new function,
isDebugInstr(), in MachineInstr to check whether the MachineInstr is
debug instruction or not.

This patch has no new test case. I have run regression test and there is
no difference in regression test.

Differential Revision: https://reviews.llvm.org/D45342

Patch by Hsiangkai Wang.

llvm-svn: 331844

801bf7eb

May 08, 2018

[Power9]Legalize and emit code for truncate and convert QP to HW and Byte · e41e3d32

Lei Huang authored 6 years ago

Legalize and emit code for truncate and convert float128 to (un)signed short
and (un)signed char.

Differential Revision: https://reviews.llvm.org/D46194

llvm-svn: 331797

e41e3d32

[Power9]Legalize and emit code for truncate and convert Quad-Precision to Word · 6364288d

Lei Huang authored 6 years ago

Legalize and emit code for:

  * xscvqpswz : VSX Scalar truncate & Convert Quad-Precision to Signed Word
  * xscvqpuwz : VSX Scalar truncate & Convert Quad-Precision to Unsigned Word

Differential Revision: https://reviews.llvm.org/D45635

llvm-svn: 331790

6364288d

[Power9]Legalize and emit code for truncate and convert QP to DW · c517e95b

Lei Huang authored 6 years ago

Legalize and emit code for:

  * xscvqpsdz : VSX Scalar truncate & Convert Quad-Precision to Signed Dword
  * xscvqpudz : VSX Scalar truncate & Convert Quad-Precision to Unsigned Dword

Differential Revision: https://reviews.llvm.org/D45553

llvm-svn: 331787

c517e95b

[PowerPC] Unify handling for conversion of FP_TO_INT feeding a store · c29229a6

Lei Huang authored 6 years ago

Existing DAG combine only handles conversions for FP_TO_SINT:
"{f32, f64} x { i32, i16 }"

This patch simplifies the code to handle:
"{ FP_TO_SINT, FP_TO_UINT } x { f64, f32 } x { i64, i32, i16, i8 }"

Differential Revision: https://reviews.llvm.org/D46102

llvm-svn: 331778

c29229a6

May 03, 2018

Commit r331416 breaks the big-endian PPC bot. On the big endian build, we · 61ffbf21
Nemanja Ivanovic authored 6 years ago
```
actually encounter constants wider than 64-bits. Add the guard to prevent
tripping the assert.

llvm-svn: 331420
```
61ffbf21

[PowerPC] Implement isMaskAndCmp0FoldingBeneficial · 01e2e79a

Nemanja Ivanovic authored 6 years ago

Sinking the and closer to a compare against zero is beneficial on PPC as it
allows us to emit record-form instructions. In the future, we may expand this
to a larger set of operations that feed compares against zero since PPC has
lots of record-form instructions.

Differential revision: https://reviews.llvm.org/D46060

llvm-svn: 331416

01e2e79a

[PowerPC] No CTR loop if the candidate exiting block is in a different loop · 2139e99e

Nemanja Ivanovic authored 6 years ago

The CTR loops pass will insert the decrementing branch instruction in an exiting
block for the loop being transformed. However if that block is part of another
loop as well (whether a nested loop or with irreducible CFG), it is not valid
to use that exiting block. In fact, if the loop hass irreducible CFG, we don't
bother analyzing it and we just bail on the transformation. In practice, this
doesn't lead to a noticeable reduction in the number of loops transformed by
this pass.

Fixes https://bugs.llvm.org/show_bug.cgi?id=37229

Differential Revision: https://reviews.llvm.org/D46162

llvm-svn: 331410

2139e99e

May 01, 2018

Remove \brief commands from doxygen comments. · 5f8f34e4

Adrian Prantl authored 6 years ago

We've been running doxygen with the autobrief option for a couple of
years now. This makes the \brief markers into our comments
redundant. Since they are a visual distraction and we don't want to
encourage more \brief markers in new code either, this patch removes
them all.

Patch produced by

  for i in $(git grep -l '\\brief'); do perl -pi -e 's/\\brief //g' $i & done

Differential Revision: https://reviews.llvm.org/D46290

llvm-svn: 331272

5f8f34e4

Apr 30, 2018

IWYU for llvm-config.h in llvm, additions. · 432a3883

Nico Weber authored 6 years ago

See r331124 for how I made a list of files missing the include.
I then ran this Python script:

    for f in open('filelist.txt'):
        f = f.strip()
        fl = open(f).readlines()

        found = False
        for i in xrange(len(fl)):
            p = '#include "llvm/'
            if not fl[i].startswith(p):
                continue
            if fl[i][len(p):] > 'Config':
                fl.insert(i, '#include "llvm/Config/llvm-config.h"\n')
                found = True
                break
        if not found:
            print 'not found', f
        else:
            open(f, 'w').write(''.join(fl))

and then looked through everything with `svn diff | diffstat -l | xargs -n 1000 gvim -p`
and tried to fix include ordering and whatnot.

No intended behavior change.

llvm-svn: 331184

432a3883

Apr 23, 2018
- Consistently sort add_subdirectory calls in lib/Target/*/CMakeLists.txt · 5d53aed4
  Nico Weber authored 6 years ago
```
llvm-svn: 330584
```
  5d53aed4
Apr 21, 2018

[PowerPC] fix incorrect vectorization of abs() on POWER9 · 33486787

Hiroshi Inoue authored 6 years ago

Vectorized loops with abs() returns incorrect results on POWER9. This patch fixes it.
For example the following code returns negative result if input values are negative though it sums up the absolute value of the inputs.

int vpx_satd_c(const int16_t *coeff, int length) {
  int satd = 0;
  for (int i = 0; i < length; ++i) satd += abs(coeff[i]);
  return satd;
}

This problem causes test failures for libvpx.
For vector absolute and vector absolute difference on POWER9, LLVM generates VABSDUW (Vector Absolute Difference Unsigned Word) instruction or variants.
Since these instructions are for unsigned integers, we need adjustment for signed integers.
For abs(sub(a, b)), we generate VABSDUW(a+0x80000000, b+0x80000000). Otherwise, abs(sub(-1, 0)) returns 0xFFFFFFFF(=-1) instead of 1. For abs(a), we generate VABSDUW(a+0x80000000, 0x80000000).

Differential Revision: https://reviews.llvm.org/D45522

llvm-svn: 330497

33486787

Apr 18, 2018

[Power9]Legalize and emit code for converting Unsigned HWord/Char to Quad-Precision · 192c6ccf

Lei Huang authored 6 years ago

Legalize and emit code for converting unsigned HWord/Char to QP:

xscvsdqp
xscvudqp

Only covering patterns for unsigned forms cause we don't have part-word
sign-extending integer loads into VSX registers.

Differential Revision: https://reviews.llvm.org/D45494

llvm-svn: 330278

192c6ccf

[Power9]Legalize and emit code for converting (Un)Signed Word to Quad-Precision · 198e6785

Lei Huang authored 6 years ago

Legalize and emit code for converting (Un)Signed Word to quad-precision via:

xscvsdqp
xscvudqp

Differential Revision: https://reviews.llvm.org/D45389

llvm-svn: 330273

198e6785

Apr 16, 2018

[NFC] Move verificaiton check for f128 conversion into LowerINT_TO_FP() · 42ab1d3d

Lei Huang authored 6 years ago

Move veriication check for legal conversions to f128 into LowerINT_TO_FP()
and fix some indentations to match other sections of the code for readability.

llvm-svn: 330138

42ab1d3d

Apr 13, 2018

[Power9] Add the TLS store instructions to the Power 9 model · 118b8675

Stefan Pintilie authored 6 years ago

The Power 9 scheduler model should now include the TLS instructions.
We can now, once again, mark the model as complete.
From now on, if instructions are added to Power 9 but are not
added to the model the build should produce an error. Hopefully
that will alert the developer who is adding new instructions
that they should also be added to the scheulder model.

llvm-svn: 330060

118b8675

Apr 12, 2018

[Power9]Legalize and emit code for converting (Un)Signed DWord to Quad-Precision · 10367eb4

Lei Huang authored 6 years ago

Legalize and emit code for:

  * xscvsdqp
  * xscvudqp

Differential Revision: https://reviews.llvm.org/D45230

llvm-svn: 329931

10367eb4

Apr 11, 2018

[PowerPC] Fix condition for 64-bit rotate when replacing r+r instr with r+i · c564dc06

Nemanja Ivanovic authored 6 years ago

This patch fixes https://bugs.llvm.org/show_bug.cgi?id=37039
The condition only covers one of the two 64-bit rotate instructions. This just
adds the second (RLDICLo).

Patch by Josh Stone.

llvm-svn: 329852

c564dc06

Apr 08, 2018

[PowerPC] Change std::sort to llvm::sort in response to r327219 · 327fd5e4

Mandeep Singh Grang authored 6 years ago

Summary:
r327219 added wrappers to std::sort which randomly shuffle the container before sorting.
This will help in uncovering non-determinism caused due to undefined sorting
order of objects having the same key.

To make use of that infrastructure we need to invoke llvm::sort instead of std::sort.

Note: This patch is one of a series of patches to replace *all* std::sort to llvm::sort.
Refer the comments section in D44363 for a list of all the required patches.

Reviewers: hfinkel, RKSimon

Reviewed By: RKSimon

Subscribers: nemanjai, kbarton, llvm-commits

Differential Revision: https://reviews.llvm.org/D44870

llvm-svn: 329535

327fd5e4

Apr 06, 2018

[PowerPC] allow D-form VSX load/store when accessing FrameIndex without offset · a2eefb6d

Hiroshi Inoue authored 6 years ago

VSX D-form load/store instructions of POWER9 require the offset be a multiple of 16 and a helper`isOffsetMultipleOf` is used to check this.
So far, the helper handles FrameIndex + offset case, but not handling FrameIndex without offset case. Due to this, we are missing opportunities to exploit D-form instructions when accessing an object or array allocated on stack.
For example, x-form store (stxvx) is used for int a[4] = {0}; instead of d-form store (stxv). For larger arrays, D-form instruction is not used when accessing the first 16-byte. Using D-form instructions reduces register pressure as well as instructions.

Differential Revision: https://reviews.llvm.org/D45079

llvm-svn: 329377

a2eefb6d

Apr 05, 2018

[PowerPC] fix assertion failure due to missing instruction in P9InstrResources.td · bbf98aea

Hiroshi Inoue authored 6 years ago

This patch adds L(W|H|B)ZXTLS_32 instructions introduced by https://reviews.llvm.org/rL327635 in P9InstrResources.td.

llvm-svn: 329299

bbf98aea

[SchedModel] Complete models shouldn't match against itineraries when they don't use them (PR35639) · 1d793b8a

Simon Pilgrim authored 6 years ago

For schedule models that don't use itineraries, checkCompleteness still checks that an instruction has a matching itinerary instead of skipping and going straight to matching the InstRWs. That doesn't seem to match what happens in TargetSchedule.cpp

This patch causes problems for a number of models that had been incorrectly flagged as complete.

Differential Revision: https://reviews.llvm.org/D43235

llvm-svn: 329280

1d793b8a

Apr 04, 2018

[Power9]Legalize and emit code for quad-precision fma instructions · 09fda63a

Lei Huang authored 7 years ago

Legalize and emit code for the following quad-precision fma:

  * xsmaddqp
  * xsnmaddqp
  * xsmsubqp
  * xsnmsubqp

Differential Revision: https://reviews.llvm.org/D44843

llvm-svn: 329206

09fda63a

Sort targetgen calls in lib/Target/*/CMakeLists. · 1cbd0969

Nico Weber authored 7 years ago

Makes it easier to see mistakes such as the one fixed in r329178 and makes
the different target CMakeLists more consistent.

Also remove some stale-looking comments from the Nios2 target cmakefile.

No intended behavior change.

llvm-svn: 329181

1cbd0969

Apr 03, 2018
- [PowerPC] reorder entries in P9InstrResources.td in alphabetical order; NFC · 08a1775f
  Hiroshi Inoue authored 7 years ago
```
Reorder entries added in my previous commit (rL328969) to keep alphabetical order.

llvm-svn: 329064
```
  08a1775f
Apr 02, 2018

[PowerPC] fix assertion failure due to missing instruction in P9InstrResources.td · 6d484938

Hiroshi Inoue authored 7 years ago

This patch adds L(D|W|H|B)XTLS instructions introduced by https://reviews.llvm.org/rL327635 in P9InstrResources.td.

llvm-svn: 328969

6d484938

Mar 29, 2018

[IR][CodeGen] Remove dependency on EVT from IR/Function.cpp. Move EVT to CodeGen layer. · 2fa14362

Craig Topper authored 7 years ago

Currently EVT is in the IR layer only because of Function.cpp needing a very small piece of the functionality of EVT::getEVTString(). The rest of EVT is used in codegen making CodeGen a better place for it.

The previous code converted a Type* to EVT and then called getEVTString. This was only expected to handle the primitive types from Type*. Since there only a few primitive types, we can just print them as strings directly.

Differential Revision: https://reviews.llvm.org/D45017

llvm-svn: 328806

2fa14362

Plumb useAA through TargetTransformInfo to remove Transforms->CodeGen header dependency · 8ad9a973
David Blaikie authored 7 years ago
```
Thanks to echristo for the pointers on direction.

llvm-svn: 328737
```
8ad9a973

Mar 28, 2018

Transforms: Introduce Transforms/Utils.h rather than spreading the... · a373d18e

David Blaikie authored 7 years ago

Transforms: Introduce Transforms/Utils.h rather than spreading the declarations amongst Scalar.h and IPO.h

Fixes layering - Transforms/Utils shouldn't depend on including a Scalar
or IPO header, because Scalar and IPO depend on Utils.

llvm-svn: 328717

a373d18e

Mar 27, 2018
- Initialize variable added in r328617. · 33dc0186
  Sterling Augustine authored 7 years ago
```
llvm-svn: 328667
```
  33dc0186
- [Power9] Fix the resource list for the COPY instruction. · 659f0403
  Stefan Pintilie authored 7 years ago
```
The COPY instruction was listed as a 4 cycle instruction.
It is now listed correctly as a 2 cycle ALU instruction.

llvm-svn: 328647
```
  659f0403