Commits · d0084464b5d22bcec1f912ddd8969a4a64727482 · Roger Ferrer / llvm-epi-0.8

Mar 17, 2014

R600/SI: Fix implementation of isInlineConstant() used by the verifier · d0084464

Tom Stellard authored Mar 17, 2014



The type of the immediates should not matter as long as the encoding is
equivalent to the encoding of one of the legal inline constants.

Tested-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 204056

d0084464

R600/SI: Add generic checks to SIInstrInfo::verifyInstruction() · ca700e41

Tom Stellard authored Mar 17, 2014



Added checks for number of operands and operand register classes.

Tested-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 204054

ca700e41

Mar 14, 2014

Phase 2 of the great MachineRegisterInfo cleanup. This time, we're changing · 16c6bf49

Owen Anderson authored Mar 13, 2014

operator* on the by-operand iterators to return a MachineOperand& rather than
a MachineInstr&. At this point they almost behave like normal iterators!

Again, this requires making some existing loops more verbose, but should pave
the way for the big range-based for-loop cleanups in the future.

llvm-svn: 203865

16c6bf49

Mar 11, 2014
- Move trivial getter into header. · 6dde3035
  Matt Arsenault authored Mar 11, 2014
```
llvm-svn: 203517
```
  6dde3035
Feb 10, 2014

R600/SI: Initialize M0 and emit S_WQM_B64 whenever DS instructions are used · 5d7aaaed

Tom Stellard authored Feb 10, 2014

DS instructions that access local memory can only uses addresses that
are less than or equal to the value of M0.  When M0 is uninitialized,
then we experience undefined behavior.

This patch also changes the behavior to emit S_WQM_B64 on pixel shaders
no matter what kind of DS instruction is used.

llvm-svn: 201097

5d7aaaed

Dec 17, 2013

Allow MachineCSE to coalesce trivial subregister copies the same way that it... · e339828b

Andrew Trick authored Dec 17, 2013

Allow MachineCSE to coalesce trivial subregister copies the same way that it coalesces normal copies.

Without this, MachineCSE is powerless to handle redundant operations with truncated source operands.

This required fixing the 2-addr pass to handle tied subregisters. It isn't clear what combinations of subregisters can legally be tied, but the simple case of truncated source operands is now safely handled:

     %vreg11<def> = COPY %vreg1:sub_32bit; GR32:%vreg11 GR64:%vreg1
     %vreg12<def> = COPY %vreg2:sub_32bit; GR32:%vreg12 GR64:%vreg2
     %vreg13<def,tied1> = ADD32rr %vreg11<tied0>, %vreg12<kill>, %EFLAGS<imp-def>

Test case: cse-add-with-overflow.ll.

This exposed an existing bug in
PPCInstrInfo::commuteInstruction. Thanks to Rafael for the test case:
PowerPC/crash.ll.

llvm-svn: 197465

e339828b

Nov 27, 2013

R600/SI: Implement spilling of SGPRs v5 · c149dc02

Tom Stellard authored Nov 27, 2013

SGPRs are spilled into VGPRs using the {READ,WRITE}LANE_B32 instructions.

v2:
  - Fix encoding of Lane Mask
  - Use correct register flags, so we don't overwrite the low dword
    when restoring multi-dword registers.

v3:
  - Register spilling seems to hang the GPU, so replace all shaders
    that need spilling with a dummy shader.

v4:
  - Fix *LANE definitions
  - Change destination reg class for 32-bit SMRD instructions

v5:
  - Remove small optimization that was crashing Serious Sam 3.

https://bugs.freedesktop.org/show_bug.cgi?id=68224
https://bugs.freedesktop.org/show_bug.cgi?id=71285

NOTE: This is a candidate for the 3.4 branch.
llvm-svn: 195880

c149dc02

Nov 18, 2013
- R600/SI: Fix moveToVALU when the first operand is VSrc. · 3a4d86a1
  Matt Arsenault authored Nov 18, 2013
```
Moving into a VSrc doesn't always work, since it could be
replaced with an SGPR later.

llvm-svn: 195042
```
  3a4d86a1
- R600/SI: Fix multiple SGPR reads when using VCC. · 08f7e37a
  Matt Arsenault authored Nov 18, 2013
```
No other SGPR operands are allowed, so if VCC is
used, move the other to a VGPR.

llvm-svn: 195041
```
  08f7e37a
- R600/SI: Move patterns to match add / sub to scalar instructions · 43b8e4ed
  Matt Arsenault authored Nov 18, 2013
```
llvm-svn: 195034
```
  43b8e4ed
- R600/SI: Fix extra defs of VCC / SCC. · f0b1e3a7
  Matt Arsenault authored Nov 18, 2013
```
When replacing scalar operations with vector,
the wrong implicit output register was used.

llvm-svn: 195033
```
  f0b1e3a7
Nov 15, 2013
- Make method static · f14032af
  Matt Arsenault authored Nov 15, 2013
```
llvm-svn: 194858
```
  f14032af
Nov 14, 2013

Indentation fixes · 671a005e
Matt Arsenault authored Nov 14, 2013
```
llvm-svn: 194688
```
671a005e
Add a comment · f4760455
Matt Arsenault authored Nov 14, 2013
```
llvm-svn: 194684
```
f4760455
R600: Fix uninitialized variable usage · 415ef6db
Tom Stellard authored Nov 13, 2013
```
llvm-svn: 194632
```
415ef6db

R600/SI: Add support for private address space load/store · 81d871de

Tom Stellard authored Nov 13, 2013

Private address space is emulated using the register file with
MOVRELS and MOVRELD instructions.

llvm-svn: 194626

81d871de

R600/SI: Prefer SALU instructions for bit shift operations · 8216602a

Tom Stellard authored Nov 13, 2013

All shift operations will be selected as SALU instructions and then
if necessary lowered to VALU instructions in the SIFixSGPRCopies pass.

This allows us to do more operations on the SALU which will improve
performance and is also required for implementing private memory
using indirect addressing, since the private memory pointers must stay
in the scalar registers.

This patch includes some fixes from Matt Arsenault.

llvm-svn: 194625

8216602a

Oct 28, 2013
- Target/R600: Un-tab-ify. · 4bb85f90
  NAKAMURA Takumi authored Oct 28, 2013
```
llvm-svn: 193510
```
  4bb85f90
Oct 22, 2013

R600/SI: Use llvm_unreachable() for an always false assert · debb4cf5
Tom Stellard authored Oct 22, 2013
```
llvm-svn: 193183
```
debb4cf5
R600/SI: Fix warning on non-asserts build · 8be4dd23
Tom Stellard authored Oct 22, 2013
```
llvm-svn: 193180
```
8be4dd23

R600: Simplify handling of private address space · 26a3b67b

Tom Stellard authored Oct 22, 2013

The AMDGPUIndirectAddressing pass was previously responsible for
lowering private loads and stores to indirect addressing instructions.
However, this pass was buggy and way too complicated.  The only
advantage it had over the new simplified code was that it saved one
instruction per direct write to private memory.  This optimization
likely has a minimal impact on performance, and we may be able
to duplicate it using some other transformation.

For the private address space, we now:
1. Lower private loads/store to Register(Load|Store) instructions
2. Reserve part of the register file as 'private memory'
3. After regalloc lower the Register(Load|Store) instructions to
   MOV instructions that use indirect addressing.

llvm-svn: 193179

26a3b67b

R600: Remove unused InstrInfo::getMovImmInstr() function · c460b0dc
Tom Stellard authored Oct 22, 2013
```
llvm-svn: 193178
```
c460b0dc

Oct 16, 2013
- R600/SI: Remove some leftover MI dump call · 5d6c2c31
  Vincent Lejeune authored Oct 15, 2013
```
llvm-svn: 192743
```
  5d6c2c31
Oct 10, 2013

R600/SI: Implement SIInstrInfo::verifyInstruction() for VOP* · 93fabceb

Tom Stellard authored Oct 10, 2013

The function is used by the machine verifier and checks that VOP*
instructions have legal operands.

llvm-svn: 192367

93fabceb

Aug 18, 2013
- Remove unused stdio.h includes · 8b2a3d1f
  Dmitri Gribenko authored Aug 18, 2013
```
llvm-svn: 188626
```
  8b2a3d1f
Aug 16, 2013

R600/SI: Fix broken encoding of DS_WRITE_B32 · 20680b1c

Michel Danzer authored Aug 16, 2013



The logic in SIInsertWaits::getHwCounts() only really made sense for SMRD
instructions, and trying to shoehorn it into handling DS_WRITE_B32 caused
it to corrupt the encoding of that by clobbering the first operand with
the second one.

Undo that damage and only apply the SMRD logic to that.

Fixes some derivates related piglit regressions with radeonsi.

Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 188558

20680b1c

Aug 15, 2013

R600/SI: Assign a register class to the $vaddr operand for MIMG instructions · 16a9a205

Tom Stellard authored Aug 14, 2013

The previous code declared the operand as unknown:$vaddr, which made
it possible for scalar registers to be used instead of vector registers.

llvm-svn: 188425

16a9a205

Jul 15, 2013
- Make some arrays 'static const' · 0afd0ab7
  Craig Topper authored Jul 15, 2013
```
llvm-svn: 186307
```
  0afd0ab7
Jun 07, 2013
- Don't cache the instruction and register info from the TargetMachine, because · 37e9adb0
  Bill Wendling authored Jun 07, 2013
```
the internals of TargetMachine could change.

No functionality change intended.

llvm-svn: 183561
```
  37e9adb0
Apr 10, 2013

R600/SI: dynamical figure out the reg class of MIMG · 8b1ed28e

Christian Konig authored Apr 10, 2013



Depending on the number of bits set in the writemask.

Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 179166

8b1ed28e

Mar 27, 2013

R600/SI: add cummuting of rev instructions · 3c14580a

Christian Konig authored Mar 27, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>
Tested-by: Michel Dänzer <michel.daenzer@amd.com>
llvm-svn: 178127

3c14580a

Mar 26, 2013

R600/SI: improve vector interpolation · 082c661f

Christian Konig authored Mar 26, 2013



Prevent loading M0 multiple times.

Signed-off-by: Christian König <christian.koenig@amd.com>
llvm-svn: 178023

082c661f

Mar 01, 2013

R600/SI: handle all registers in copyPhysReg v2 · d0e3da18

Christian Konig authored Mar 01, 2013



v2: based on Michels patch, but now allows copying of all registers sizes.

Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
Signed-off-by: Christian König <christian.koenig@amd.com>
llvm-svn: 176346

d0e3da18

Feb 26, 2013

R600/SI: add some more instruction flags · 76edd4f2

Christian Konig authored Feb 26, 2013



Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 176102

76edd4f2

Feb 16, 2013

R600/SI: cleanup literal handling v3 · c756cb99

Christian Konig authored Feb 16, 2013



Seems to be allot simpler, and also paves the
way for further improvements.

v2: rebased on master, use 0 in BUFFER_LOAD_FORMAT_XYZW,
    use VGPR0 in dummy EXP, avoid compiler warning, break
    after encoding the first literal.
v3: correctly use V_ADD_F32_e64

This is a candidate for the stable branch.

Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 175354

c756cb99

Feb 07, 2013

R600/SI: Handle VGPR64 destination in copyPhysReg(). · aac1889a

Tom Stellard authored Feb 07, 2013



Allows nexuiz to run with radeonsi.

Patch by: Michel Dänzer

Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
Reviewed-by: Tom Stellard <thomas.stellard@amd.com>
llvm-svn: 174655

aac1889a

Feb 06, 2013

R600: Support for indirect addressing v4 · f3b2a1e8

Tom Stellard authored Feb 06, 2013

Only implemented for R600 so far.  SI is missing implementations of a
few callbacks used by the Indirect Addressing pass and needs code to
handle frame indices.

At the moment R600 only supports array sizes of 16 dwords or less.
Register packing of vector types is currently disabled, which means that a
vec4 is stored in T0_X, T1_X, T2_X, T3_X, rather than T0_XYZW. In order
to correctly pack registers in all cases, we will need to implement an
analysis pass for R600 that determines the correct vector width for each
array.

v2:
  - Add support for i8 zext load from stack.
  - Coding style fixes

v3:
  - Don't reserve registers for indirect addressing when it isn't
    being used.
  - Fix bug caused by LLVM limiting the number of SubRegIndex
    declarations.

v4:
  - Fix 64-bit defines

llvm-svn: 174525

f3b2a1e8

Jan 02, 2013

Resort the #include lines in include/... and lib/... with the · be81023d

Chandler Carruth authored Jan 02, 2013

utils/sort_includes.py script.

Most of these are updating the new R600 target and fixing up a few
regressions that have creeped in since the last time I sorted the
includes.

llvm-svn: 171362

be81023d

Dec 20, 2012
- Target/R600: Update MIB according to r170588. · 2a0b40f5
  NAKAMURA Takumi authored Dec 20, 2012
```
llvm-svn: 170620
```
  2a0b40f5
Dec 11, 2012

Add R600 backend · 75aadc28

Tom Stellard authored Dec 11, 2012

A new backend supporting AMD GPUs: Radeon HD2XXX - HD7XXX

llvm-svn: 169915

75aadc28