Skip to content
Unverified Commit 28b7e498 authored by Mariusz Sikora's avatar Mariusz Sikora Committed by GitHub
Browse files

AMDGPU/GFX12: Add new dot4 fp8/bf8 instructions (#77892)



Endoding is VOP3P. Tagged as deep/machine learning instructions. i32
type (v4fp8 or v4bf8 packed in i32) is used for src0 and src1. src0 and
src1 have no src_modifiers. src2 is f32 and has src_modifiers: f32
fneg(neg_lo[2]) and f32 fabs(neg_hi[2]).

---------

Co-authored-by: default avatarPetar Avramovic <Petar.Avramovic@amd.com>
parent 18d0a7e4
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment