rfcs: add proposal on coo sparse encoding #1941

avmanerikar · 2024-05-31T19:55:25Z

Proposal on extending sparsity support to include Coordinate sparse encoding (COO) format.
Link to rendered document.

spencerpatty · 2024-06-04T18:00:03Z

rfcs/20240531-coo-sparse-encoding/README.md

+## Scope of Extended Functionality
+
+For the extended API, a reference implementation for the matrix multiplication primitive will be introduced that supports input tensors with COO encoding. The coverage of the implementation will be identical to that for the CSR encoding and is listed below:
+* Datatype for COO tensor data: `f32` 


Very cool! Do you plan on adding f16 or bf16 support in the future ? or just sticking to f32 for now ?

And for small matrices, you may find that using even s16 could be of benefit ... If ncols is less than 32k, you could use s16 ... If I recall, many operations from SpMM in AI come from tall skinny matrices, so this might be a possibility for future optimizations ...

Thanks for the comment! We do plan on adding f16 support for the implementation. I have updated the RFC accordingly.

rfcs/20240531-coo-sparse-encoding/README.md

h-sadia · 2024-06-06T04:43:43Z

rfcs/20240531-coo-sparse-encoding/README.md

+        /// @param adims Tensor dimensions.
+        /// @param adata_type Data precision/type.
+        /// @param nnz Number of non-zero entries.
+        /// @param index_dt Data type of indices.


What sort of data types are expected for indices?

Hi! Keeping the same data considerations as the CSR format, the datatype is expected to be s32 for the indices.

densamoilov · 2024-06-21T17:22:52Z

rfcs/20240531-coo-sparse-encoding/README.md

+Because of the compressed row, this format tends to be more efficient for large sparse matrices.
+COO, on the other hand, is simpler in implementation in that the encoding comprises of a list of thruples `(values, row_index, column_index)` corresponding to the non-zero values.
+This makes the COO less efficient than CSR but it has the advantage of a reduced conversion overhead and better interpretability. 
+For practical cases, a sorted variant of COO is used wherein the data is encoded as a set of sorted arrays containing the values, row indices and column indices respectively:


Do we explicitly require users to pass us the memory in the canonical COO format where the entries are sorted by row, then column?

Also, do we allow the values to be zero?

Ideally, the generic COO format does not require the user to explicitly pass the memory in a row-major order. But this will be the case for sorted COO as it determines how the buffers are defined for the data indices.
With the given declaration, the indices for dimension $n$ will be stored in buffer index $[n+1]$, that is, buffer[1] will hold indices for dimension 0, buffer[2] will hold indices for dimension 1 and so on. To do this, a pre-defined order for the sorted entries will be necessary.
The value buffers are expected to only hold non-zero elements.

densamoilov · 2024-06-21T17:31:58Z

rfcs/20240531-coo-sparse-encoding/README.md

+    /// assigned numbers (index):
+    ///  - 0: values
+    ///  - 1: dim0_indices
+    ///  - 2: dim1_indices


We need to be generic here that is, the number of buffers for the indices should be equal to the number of dimensions of the tensor. For example, potentially, we could support 3D sparse tensors in COO format and in that case we would need 3 buffers with the indices.

Thanks for the catch! I have updated the RFC accordingly.

densamoilov · 2024-06-21T17:34:06Z

rfcs/20240531-coo-sparse-encoding/README.md

+char *dim1_indices = malloc(indices_size);
+```
+
+#### Memory Creation Example:


Can you please also add an example for the C++ API?

Fixed. Thanks!

avmanerikar added the RFC A design document label May 31, 2024

spencerpatty reviewed Jun 4, 2024

View reviewed changes

dzarukin reviewed Jun 5, 2024

View reviewed changes

h-sadia reviewed Jun 6, 2024

View reviewed changes

avmanerikar force-pushed the amanerik/rfcs/coo-sparse-encoding branch 2 times, most recently from d7a52de to aa9feb6 Compare June 7, 2024 18:18

densamoilov reviewed Jun 21, 2024

View reviewed changes

vpirogov mentioned this pull request Jun 21, 2024

New/other Matrix multiplication algorithm implementation #1971

Open

avmanerikar force-pushed the amanerik/rfcs/coo-sparse-encoding branch from aa9feb6 to c2fc0e6 Compare June 21, 2024 18:56

vpirogov added this to the v3.6 milestone Jul 16, 2024

avmanerikar force-pushed the amanerik/rfcs/coo-sparse-encoding branch from c2fc0e6 to cd64159 Compare July 31, 2024 18:49

rfcs: add proposal on coo sparse encoding

cd15b92

avmanerikar force-pushed the amanerik/rfcs/coo-sparse-encoding branch from cd64159 to cd15b92 Compare August 21, 2024 18:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rfcs: add proposal on coo sparse encoding #1941

rfcs: add proposal on coo sparse encoding #1941

avmanerikar commented May 31, 2024

spencerpatty Jun 4, 2024

avmanerikar Jun 7, 2024

h-sadia Jun 6, 2024

avmanerikar Jun 7, 2024

densamoilov Jun 21, 2024

avmanerikar Jun 21, 2024

densamoilov Jun 21, 2024

avmanerikar Jun 21, 2024

densamoilov Jun 21, 2024

avmanerikar Jun 21, 2024

rfcs: add proposal on coo sparse encoding #1941

Are you sure you want to change the base?

rfcs: add proposal on coo sparse encoding #1941

Conversation

avmanerikar commented May 31, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment