Commits · a780ffaac29e9d38db75ba9ba7f74617a2e59ba4 · Roger Ferrer / llvm-epi

Mar 23, 2017
- [AMDGPU] Emit kernel debug properties as code object metadata · a780ffaa
  Konstantin Zhuravlyov authored Mar 22, 2017
```
Differential Revision: https://reviews.llvm.org/D30969

llvm-svn: 298558
```
  a780ffaa
Mar 22, 2017

[AMDGPU] Emit kernel code properties as code object metadata · ca0e7f64

Konstantin Zhuravlyov authored Mar 22, 2017

  - These are not required for low level runtime

Differential Revision: https://reviews.llvm.org/D29949

llvm-svn: 298556

ca0e7f64

[AMDGPU] Restructure code object metadata creation · 7498cd61

Konstantin Zhuravlyov authored Mar 22, 2017

  - Rename runtime metadata -> code object metadata
  - Make metadata not flow
  - Switch enums to use ScalarEnumerationTraits
  - Cleanup and move AMDGPUCodeObjectMetadata.h to AMDGPU/MCTargetDesc
  - Introduce in-memory representation for attributes
  - Code object metadata streamer
  - Create metadata for isa and printf during EmitStartOfAsmFile
  - Create metadata for kernel during EmitFunctionBodyStart
  - Finalize and emit metadata to .note during EmitEndOfAsmFile
  - Other minor improvements/bug fixes

Differential Revision: https://reviews.llvm.org/D29948

llvm-svn: 298552

7498cd61

[AMDGPU] Fix bug 31610 · eb685e5f
Konstantin Zhuravlyov authored Mar 22, 2017
```
Differential Revision: https://reviews.llvm.org/D31258

llvm-svn: 298551
```
eb685e5f

Mar 10, 2017

Rename PT_NOTE namespace name used in AMDGPUPTNote.h · 874d26a8

Yaxun Liu authored Mar 10, 2017

Patch by Guansong Zhang.

Differential Revision: https://reviews.llvm.org/D30750

llvm-svn: 297498

874d26a8

Mar 07, 2017

Revert "AMDGPU: Set MCAsmInfo::PointerSize" · e8aaab8a

Konstantin Zhuravlyov authored Mar 07, 2017

It breaks line tables because the patch is not complete, working on a complete one at the moment

This reverts commit r294031.

llvm-svn: 297118

e8aaab8a

Feb 27, 2017

AMDGPU: Add VOP3P instruction format · 9be7b0d4

Matt Arsenault authored Feb 27, 2017

Add a few non-VOP3P but instructions related to packed.

Includes hack with dummy operands for the benefit of the assembler

llvm-svn: 296368

9be7b0d4

[AMDGPU] Runtime metadata fixes: · 972948b3

Konstantin Zhuravlyov authored Feb 27, 2017

  - Verify that runtime metadata is actually valid runtime metadata when assembling, otherwise we could accept the following when assembling, but ocl runtime will reject it:
    .amdgpu_runtime_metadata
    { amd.MDVersion: [ 2, 1 ], amd.RandomUnknownKey, amd.IsaInfo: ...
  - Make IsaInfo optional, and always emit it.

Differential Revision: https://reviews.llvm.org/D30349

llvm-svn: 296324

972948b3

Feb 10, 2017
- AMDGPU: Fix trailing whitespace · b4493e90
  Matt Arsenault authored Feb 10, 2017
```
llvm-svn: 294694
```
  b4493e90
Feb 08, 2017
- [AMDGPU][NFC] Assign IsaInfo to reference variable in order to shorten long lines · b5acb8ec
  Konstantin Zhuravlyov authored Feb 08, 2017
```
llvm-svn: 294454
```
  b5acb8ec
- [AMDGPU] Add target information that is required by tools to metadata · 9f89ede1
  Konstantin Zhuravlyov authored Feb 08, 2017
```
Differential Revision: https://reviews.llvm.org/D28760#fb670e28

llvm-svn: 294449
```
  9f89ede1
Feb 04, 2017
- [AMDGPU] Fix some Include What You Use warnings; other minor fixes (NFC). · e894b4dc
  Eugene Zelenko authored Feb 03, 2017
```
This is preparation to reduce MCExpr.h dependencies.

llvm-svn: 294067
```
  e894b4dc
Feb 03, 2017
- AMDGPU: Set MCAsmInfo::PointerSize · 1fa5eacf
  Matt Arsenault authored Feb 03, 2017
```
llvm-svn: 294031
```
  1fa5eacf
Feb 02, 2017

AMDGPU: Use source modifiers with f16->f32 conversions · 9dba9bd4

Matt Arsenault authored Feb 02, 2017

The operand types were defined to fit the fp16_to_fp node, which
has the half as an integer type. v_cvt_f32_f16 does support
source modifiers, so change this to have an FP type and modifiers.

For targets without legal f16, this requires recognizing the
bit operations and trying to produce them.

llvm-svn: 293857

9dba9bd4

Jan 20, 2017
- [AMDGPU] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). · 734bb7bb
  Eugene Zelenko authored Jan 20, 2017
```
llvm-svn: 292623
```
  734bb7bb
Jan 13, 2017

Apply clang-tidy's performance-unnecessary-value-param to LLVM. · 061f4a5f

Benjamin Kramer authored Jan 13, 2017

With some minor manual fixes for using function_ref instead of
std::function. No functional change intended.

llvm-svn: 291904

061f4a5f

Dec 23, 2016
- Enable '-Wstring-conversion' and fix some bad asserts that it helped · ee086761
  Chandler Carruth authored Dec 23, 2016
```
find.

Notable is the assert in NewGVN which had no effect because of the bug.

llvm-svn: 290400
```
  ee086761
Dec 19, 2016

AMDGPU: [AMDGPU] Assembler: add .hsa_code_object_metadata directive for functime metadata V2.0 · 69c8aa26

Sam Kolton authored Dec 19, 2016

Summary:
Added pair of directives .hsa_code_object_metadata/.end_hsa_code_object_metadata.
Between them user can put YAML string that would be directly put to the generated note. E.g.:
'''
.hsa_code_object_metadata
    {
        amd.MDVersion: [ 2, 0 ]
    }
.end_hsa_code_object_metadata
'''
Based on D25046

Reviewers: vpykhtin, nhaustov, yaxunl, tstellarAMD

Subscribers: arsenm, kzhuravl, wdng, nhaehnle, mgorny, tony-tye

Differential Revision: https://reviews.llvm.org/D27619

llvm-svn: 290097

69c8aa26

Dec 14, 2016
- Fix build failure due to r289674 on certain systems · 04334b52
  Yaxun Liu authored Dec 14, 2016
```
Removed a useless include which caused conflict.

llvm-svn: 289700
```
  04334b52
- AMDGPU: Emit runtime metadata version 2 as YAML · 07d659bc
  Yaxun Liu authored Dec 14, 2016
```
Differential Revision: https://reviews.llvm.org/D25046

llvm-svn: 289674
```
  07d659bc
Dec 12, 2016

[AMDGPU, PowerPC, TableGen] Fix some Clang-tidy modernize and Include What You... · 6a9226d9

Eugene Zelenko authored Dec 12, 2016

[AMDGPU, PowerPC, TableGen] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC).

llvm-svn: 289475

6a9226d9

Dec 10, 2016

AMDGPU: Fix handling of 16-bit immediates · 4bd72361

Matt Arsenault authored Dec 10, 2016

Since 32-bit instructions with 32-bit input immediate behavior
are used to materialize 16-bit constants in 32-bit registers
for 16-bit instructions, determining the legality based
on the size is incorrect. Change operands to have the size
specified in the type.

Also adds a workaround for a disassembler bug that
produces an immediate MCOperand for an operand that
is supposed to be OPERAND_REGISTER.

The assembler appears to accept out of bounds immediates and
truncates them, but this seems to be an issue for 32-bit
already.

llvm-svn: 289306

4bd72361

Nov 19, 2016

Check that emitted instructions meet their predicates on all targets except ARM, Mips, and X86. · 72db2a39

Daniel Sanders authored Nov 19, 2016

Summary:
* ARM is omitted from this patch because this check appears to expose bugs in this target.
* Mips is omitted from this patch because this check either detects bugs or deliberate
  emission of instructions that don't satisfy their predicates. One deliberate
  use is the SYNC instruction where the version with an operand is correctly
  defined as requiring MIPS32 while the version without an operand is defined
  as an alias of 'SYNC 0' and requires MIPS2.
* X86 is omitted from this patch because it doesn't use the tablegen-erated
  MCCodeEmitter infrastructure.

Patches for ARM and Mips will follow.

Depends on D25617

Reviewers: tstellarAMD, jmolloy

Subscribers: wdng, jmolloy, aemerson, rengolin, arsenm, jyknight, nemanjai, nhaehnle, tstellarAMD, llvm-commits

Differential Revision: https://reviews.llvm.org/D25618

llvm-svn: 287439

72db2a39

Nov 11, 2016
- [AMDGPU] TargetStreamer: Fix .note section name · ce0aba74
  Sam Kolton authored Nov 11, 2016
```
llvm-svn: 286591
```
  ce0aba74
- Fix requirements. · 618d475c
  Joerg Sonnenberger authored Nov 10, 2016
```
llvm-svn: 286527
```
  618d475c
Nov 10, 2016

AMDGPU: Emit runtime metadata as a note element in .note section · d6fbe650

Yaxun Liu authored Nov 10, 2016

Currently runtime metadata is emitted as an ELF section with name .AMDGPU.runtime_metadata.

However there is a standard way to convey vendor specific information about how to run an ELF binary, which is called vendor-specific note element (http://www.netbsd.org/docs/kernel/elf-notes.html).

This patch lets AMDGPU backend emits runtime metadata as a note element in .note section.

Differential Revision: https://reviews.llvm.org/D25781

llvm-svn: 286502

d6fbe650

Oct 29, 2016
- AMDGPU: Use 1/2pi inline imm on VI · c88ba36e
  Matt Arsenault authored Oct 29, 2016
```
I'm guessing at how it is supposed to be printed

llvm-svn: 285490
```
  c88ba36e
Oct 20, 2016
- [AMDGPU] Make note record name a static const member of target streamer · 521e5ef4
  Konstantin Zhuravlyov authored Oct 20, 2016
```
Differential Revision: https://reviews.llvm.org/D25746

llvm-svn: 284760
```
  521e5ef4
Oct 19, 2016
- [AMDGPU] Stop using MCRegisterClass::getSize() · c8715503
  Krzysztof Parzyszek authored Oct 19, 2016
```
Differential Review: https://reviews.llvm.org/D24675

llvm-svn: 284619
```
  c8715503
Oct 18, 2016
- [AMDGPU] Mark .note section SHF_ALLOC so lld creates a segment for it · 98a3ac71
  Konstantin Zhuravlyov authored Oct 17, 2016
```
Differential Revision: https://reviews.llvm.org/D25694

llvm-svn: 284435
```
  98a3ac71
Oct 14, 2016
- [AMDGPU] Add 32-bit lo/hi got and pc relative variant kinds and emit appropriate relocations · 2a2ac37c
  Konstantin Zhuravlyov authored Oct 14, 2016
```
Differential Revision: https://reviews.llvm.org/D25548

llvm-svn: 284195
```
  2a2ac37c
Oct 10, 2016

Move the global variables representing each Target behind accessor function · f42454b9

Mehdi Amini authored Oct 09, 2016

This avoids "static initialization order fiasco"

Differential Revision: https://reviews.llvm.org/D25412

llvm-svn: 283702

f42454b9

Oct 07, 2016

AMDGPU/SI: Add support for 8-byte relocations · 6982bb8f

Tom Stellard authored Oct 07, 2016

Reviewers: arsenm, kzhuravl

Subscribers: wdng, nhaehnle, yaxunl, llvm-commits, tony-tye

Differential Revision: https://reviews.llvm.org/D25375

llvm-svn: 283593

6982bb8f

AMDGPU/SI: Emit fixups for long branches · ef33c4b3

Tom Stellard authored Oct 07, 2016

Reviewers: arsenm

Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, llvm-commits, tony-tye

Differential Revision: https://reviews.llvm.org/D25366

llvm-svn: 283570

ef33c4b3

Oct 06, 2016
- BranchRelaxation: Support expanding unconditional branches · 6bc43d86
  Matt Arsenault authored Oct 06, 2016
```
AMDGPU needs to expand unconditional branches in a new
block with an indirect branch.

llvm-svn: 283464
```
  6bc43d86
Sep 21, 2016

[AMDGPU] Assembler: remove unused AMDGPUMCObjectWriter. · 12b633be

Sam Kolton authored Sep 21, 2016

Summary: It is replaced by AMDGPUELFObjectWriter

Reviewers: tstellarAMD, vpykhtin, artem.tamazov

Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl

Differential Revision: https://reviews.llvm.org/D24654

llvm-svn: 282065

12b633be

Sep 19, 2016

[AMDGPU] Fix s_branch with -1 offset · be7ffb90

Sam Kolton authored Sep 19, 2016

Summary:
In case s_branch instruction target is itself backend should emit offset -1 but instead it emit 0.
'''
label:
    s_branch label  // should emit [0xff,0xff,0x82,0xbf]
'''

Tom, Matt: why are we adjusting fixup values in applyFixup() method instead of processFixup()? processFixup() is calling adjustFixupValue() but does nothing with its result.

Reviewers: vpykhtin, artem.tamazov, tstellarAMD

Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl

Differential Revision: https://reviews.llvm.org/D24671

llvm-svn: 281896

be7ffb90

Sep 09, 2016

AMDGPU] Assembler: better support for immediate literals in assembler. · 1eeb11bf

Sam Kolton authored Sep 09, 2016

Summary:
Prevously assembler parsed all literals as either 32-bit integers or 32-bit floating-point values. Because of this we couldn't support f64 literals.
E.g. in instruction "v_fract_f64 v[0:1], 0.5", literal 0.5 was encoded as 32-bit literal 0x3f000000, which is incorrect and will be interpreted as 3.0517578125E-5 instead of 0.5. Correct encoding is inline constant 240 (optimal) or 32-bit literal 0x3FE00000 at least.

With this change the way immediate literals are parsed is changed. All literals are always parsed as 64-bit values either integer or floating-point. Then we convert parsed literals to correct form based on information about type of operand parsed (was literal floating or binary) and type of expected instruction operands (is this f32/64 or b32/64 instruction).
Here are rules how we convert literals:
- We parsed fp literal:
- Instruction expects 64-bit operand:
- If parsed literal is inlinable (e.g. v_fract_f64_e32 v[0:1], 0.5)
- then we do nothing this literal
- Else if literal is not-inlinable but instruction requires to inline it (e.g. this is e64 encoding, v_fract_f64_e64 v[0:1], 1.5)
- report error
- Else literal is not-inlinable but we can encode it as additional 32-bit literal constant
- If instruction expect fp operand type (f64)
- Check if low 32 bits of literal are zeroes (e.g. v_fract_f64 v[0:1], 1.5)
- If so then do nothing
- Else (e.g. v_fract_f64 v[0:1], 3.1415)
- report warning that low 32 bits will be set to zeroes and precision will be lost
- set low 32 bits of literal to zeroes
- Instruction expects integer operand type (e.g. s_mov_b64_e32 s[0:1], 1.5)
- report error as it is unclear how to encode this literal
- Instruction expects 32-bit operand:
- Convert parsed 64 bit fp literal to 32 bit fp. Allow lose of precision but not overflow or underflow
- Is this literal inlinable and are we required to inline literal (e.g. v_trunc_f32_e64 v0, 0.5)
- do nothing
- Else report error
- Do nothing. We can encode any other 32-bit fp literal (e.g. v_trunc_f32 v0, 10000000.0)
- Parsed binary literal:
- Is this literal inlinable (e.g. v_trunc_f32_e32 v0, 35)
- do nothing
- Else, are we required to inline this literal (e.g. v_trunc_f32_e64 v0, 35)
- report error
- Else, literal is not-inlinable and we are not required to inline it
- Are high 32 bit of literal zeroes or same as sign bit (32 bit)
- do nothing (e.g. v_trunc_f32 v0, 0xdeadbeef)
- Else
- report error (e.g. v_trunc_f32 v0, 0x123456789abcdef0)

For this change it is required that we know operand types of instruction (are they f32/64 or b32/64). I added several new register operands (they extend previous register operands) and set operand types to corresponding types:
'''
enum OperandType {
OPERAND_REG_IMM32_INT,
OPERAND_REG_IMM32_FP,
OPERAND_REG_INLINE_C_INT,
OPERAND_REG_INLINE_C_FP,
}
'''

This is not working yet:
- Several tests are failing
- Problems with predicate methods for inline immediates
- LLVM generated assembler parts try to select e64 encoding before e32.
More changes are required for several AsmOperands.

Reviewers: vpykhtin, tstellarAMD

Subscribers: arsenm, kzhuravl, artem.tamazov

Differential Revision: https://reviews.llvm.org/D22922

llvm-svn: 281050

1eeb11bf

[AMDGPU] Assembler: rename amd_kernel_code_t asm names according to spec · a2e5c88b

Sam Kolton authored Sep 09, 2016

Summary:
Also removed duplicate code from AMDGPUTargetAsmStreamer.
This change only change how amd_kernel_code_t is parsed and printed. No variable names are changed.

Reviewers: vpykhtin, tstellarAMD

Subscribers: arsenm, wdng, nhaehnle

Differential Revision: https://reviews.llvm.org/D24296

llvm-svn: 281028

a2e5c88b

Aug 29, 2016
- AMDGPU/R600: Fix fixups used for constant arrays · b90fc9b3
  Matt Arsenault authored Aug 29, 2016
```
Fixes bug 29289

llvm-svn: 279986
```
  b90fc9b3