Skip to content
Unverified Commit 41a07e66 authored by Aart Bik's avatar Aart Bik Committed by GitHub
Browse files

[mlir][sparse] recognize NVidia 2:4 type for matmul (#76758)

This removes the temporary DENSE24 attribute and replaces it with proper
recognition of dense to 24 conversion. The compressionh will be
performed on the device prior to performing the matrix mult. Note that
we no longer need to start with the linalg version, we can lift this to
the proper named linalg op. Also renames some files into more consistent
names.
parent 67c2e354
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please to comment