[MLIR][GPU][NVVM] Add conversion of warp synchronous matrix-multiply accumulate GPU ops (eaaf7a6a) · Commits · Lorenzo Albano / LLVM bpEVL

Commit eaaf7a6a authored May 21, 2021 by Navdeep Kumar Committed by Uday Bondhugula May 21, 2021

[MLIR][GPU][NVVM] Add conversion of warp synchronous matrix-multiply accumulate GPU ops

Add conversion of warp synchronous matrix-multiply
accumulate GPU ops
Add conversion of warp synchronous matrix-multiply accumulate GPU ops to
NVVM ops. The following conversions are added :-
  1.) subgroup_mma_load_matrix -> wmma.m16n16k16.load.[a,b,c]..row.stride
  2.) subgroup_mma_store_matrix -> wmma.m16n16k16.store.d.[f16,f32].row.stride
  3.) subgroup_mma_compute -> wmma.m16n16k16.mma.row.row.[f16,f32].[f16,f32]

Reviewed By: bondhugula, ftynse

Differential Revision: https://reviews.llvm.org/D95331

parent c2d44bd2

Hide whitespace changes

Inline Side-by-side

Please register or to comment