New Sparse Matrix APIs #1279

cjnolet · 2023-02-14T17:53:06Z

Closes #348.

This design addresses some problems we've had in the past when modeling sparse data where our objects were not flexible nor composable enough, which led to APIs which were hard to maintain and state which was hard to track.

This design starts by decomposing sparse formats into two components which are utlimately combined to compose the full sparse object:

a structural component manages the sparsity of the object , indexing, and data-specific metadata, such as total number of rows and columns.
a valued or matrix component combines with a structure and manages the nonzero elements.

Note that this design also affords the ability to model a sparse tensor in the future, as a new format tensor could allow for composing multiple structural and multiple valued components. This could enable our algorithms to support things decompositions of higher ordered structures and/or associated values.

In addition to being flexible and composable, this design also needs to satisfy a couple different levels of immutability:

Read-only: immutable structure, immutable nonzero elements
Fixed-sparsity and value-mutable: immutable structure, mutable nonzero elements
Mutable-sparsity and value-mutable: mutable structure, mutable nonzero elements

Two concepts introduced in this design are pretty core to the 3 states above:

structure-preserving formats are views and require the sparsity to be known at creation time. The actual structural components may or may not be mutable.
structure-owning formats house owning containers and don't require the sparsity to be known at creation time and provide a way to initialize() the sparsity once it is known. These formats will have mutable structure and nonzero elements.

Both the structure and matrix formats can be structure-preserving or structure-owning. While this PR only includes csr_matrix and coo_matrix (I'm considering dropping the r from csr since it doesn't really matter if it's csr or csc), the design will further allow for other formats, such as dcsr and bcsr, in the future.

csr_matrix_view - this is a structure-preserving matrix view. Sparsity must be known up front and the underlying arrays may or may not be const.
csr_matrix - this can be structure-owning or structure-preserving depending on whether its underlying structural component is structure-preserving (view) or structure-owning. Calling view() on this object produces the csr_matrix_view above.
coo_matrix_view - this is a structure-preserving matrix view. Sparsity must be known up front and the underlying arrays may or may not be const.
coo_matrix - this can be structure-owning or structure-preserving depending on whether its underlying structural component is structure-preserving (view) or structure-owning. Calling view() on this object produces the csr_matrix_view above.

Similar to mdarray and mdspan, a bunch of factory functions are provided in raft/core/device_sparse_matrix.hpp to ease the construction process for users. The owning matrix types can be constructed either to own the underlying structure or a view of the structure.

These new formats will allow us to model our sparse APIs so they are much more concise- a function can explicitly require a structure-owning matrix, which is a signal to the user that the function itself will compute the sparsity and at least fill in the initial structure. This will allow us to continue to provide an API which is easier to use, and ultimately feels more like our dense API, while still considering the design differences in sparse computations.

I also want to thank @divyegala for his help making the template metaprogramming layer flexible, reusable, and generally pleasant to use.

more comprehensive googletests
split into different files (types, compressed, coordinate, etc...)
add host APIs

…arse_matrix

… fea-2304-sparse_matrix

cjnolet · 2023-02-16T13:19:00Z

cc @mhoemmen for thoughts

… to capture the sparsity types

… having to care whether it's sparsity owning or preserving

mhoemmen · 2023-02-17T01:16:34Z

Hi @cjnolet ! I do like the idea of distinguishing between structure-preserving formats and structure-owning formats. The latter show up, for example, when wrapping third-party libraries that ingest 3-array CSR and produce an optimized opaque format.

Sometimes those libraries give users a way to modify values or even structure. Regardless, users still may want to take some structure-preserving or opaque format, and "dissolve" it to get a transparent, modifiable format.

I'll take a look at the PR; thanks for tagging me!

cjnolet · 2023-02-17T02:00:09Z

@mhoemmen, thanks so much! I'm looking forward to your feedback. I tried to find a way to generalize the different states w/ nomenclature that we could apply directly to the different options.

… fea-2304-sparse_matrix

cpp/include/raft/core/coo_matrix.hpp

cpp/include/raft/core/csr_matrix.hpp

Co-authored-by: Divye Gala <[email protected]>

…nd run fine

cjnolet · 2023-03-14T19:50:58Z

Opened up docs issue as follow-on #1342.

…arse_matrix

cjnolet · 2023-03-15T20:16:31Z

/merge

Thinking through initial interfaces

8acda77

cjnolet added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Feb 14, 2023

cjnolet self-assigned this Feb 14, 2023

github-actions bot added the cpp label Feb 14, 2023

cjnolet added 4 commits February 14, 2023 19:17

Continuing to work on sparse designs

c3a701d

I think the design is just aobut there.

684fed1

Adding factories for structure and views

e1baa4f

Getting there...

1b8ef76

github-actions bot added the CMake label Feb 15, 2023

cjnolet and others added 6 commits February 15, 2023 16:27

Test passing. Writing more tests

eacfd0f

Added very basic csr test

b220cce

Merge branch 'branch-23.04' into fea-2304-sparse_matrix

14cfed3

Merge remote-tracking branch 'rapidsai/branch-23.04' into fea-2304-sp…

769d4d4

…arse_matrix

Merge branch 'fea-2304-sparse_matrix' of github.com:cjnolet/raft into…

cc090a0

… fea-2304-sparse_matrix

Adding more docs to the public API for new sparse types

dc519ce

cjnolet marked this pull request as ready for review February 16, 2023 00:23

cjnolet requested review from a team as code owners February 16, 2023 00:23

cjnolet added 4 commits February 16, 2023 15:52

Using a common base for matrix types. need to figure out a clever way…

788113f

… to capture the sparsity types

Correcting template

70edcd4

Divye and I finally got it- we can accept a sparse_matrix now without…

d84851d

… having to care whether it's sparsity owning or preserving

Splitting sparse impls into difference files

f14d679

cjnolet added 2 commits February 16, 2023 20:26

Trying type traits

72be283

Fixing bug

9804e87

cjnolet added 2 commits February 16, 2023 21:01

Adding example of testing if function input is device_csr_matrix

389db97

Almost there....

4168d0b

cjnolet and others added 7 commits March 7, 2023 15:37

Adding to googletests and fixing resulting things

7ccc6a1

Merge branch 'branch-23.04' into fea-2304-sparse_matrix

68ffcb6

Merge branch 'branch-23.04' into fea-2304-sparse_matrix

1544af8

Merge branch 'branch-23.04' into fea-2304-sparse_matrix

6f58873

Merge branch 'branch-23.04' into fea-2304-sparse_matrix

d4f715d

Fixign compile error

182b1d2

Merge branch 'fea-2304-sparse_matrix' of github.com:cjnolet/raft into…

c56a1f2

… fea-2304-sparse_matrix

github-actions bot added the python label Mar 14, 2023

Removing setup.cfg

9573036

github-actions bot removed the python label Mar 14, 2023

cjnolet added 2 commits March 14, 2023 14:46

Fixing temporary device buffer

c5f2283

Fixes

ffd60bb

divyegala reviewed Mar 14, 2023

View reviewed changes

cjnolet and others added 4 commits March 14, 2023 15:25

Update cpp/include/raft/core/coo_matrix.hpp

1a41f98

Co-authored-by: Divye Gala <[email protected]>

Update cpp/include/raft/core/coo_matrix.hpp

438f0c6

Co-authored-by: Divye Gala <[email protected]>

Update cpp/include/raft/core/coo_matrix.hpp

f0a32a9

Co-authored-by: Divye Gala <[email protected]>

Reverting the std::enable_if_t -> std::enable_if seems to compile a…

fffbcb2

…nd run fine

Implementing review feedback

350719b

divyegala approved these changes Mar 14, 2023

View reviewed changes

cjnolet added 4 commits March 14, 2023 19:30

Adding device resources

0e644b6

Adding device resources to ivf pq types

8b1d580

Merge remote-tracking branch 'rapidsai/branch-23.04' into fea-2304-sp…

581b29b

…arse_matrix

Updates

4c00fe5

tfeher mentioned this pull request Mar 15, 2023

Gram matrix support for sparse input #1296

Merged

Fixing ivf-flat serialize

d478ac4

rapids-bot bot merged commit fb84190 into rapidsai:branch-23.04 Mar 15, 2023

divyegala mentioned this pull request Apr 11, 2023

[FEA] Support Buffer Object in Raft #1408

Open

lowener mentioned this pull request Nov 21, 2024

[FEA] Add new sparse matrix API to Python #2504

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Sparse Matrix APIs #1279

New Sparse Matrix APIs #1279

cjnolet commented Feb 14, 2023 •

edited

Loading

cjnolet commented Feb 16, 2023

mhoemmen commented Feb 17, 2023

cjnolet commented Feb 17, 2023

cjnolet commented Mar 14, 2023

cjnolet commented Mar 15, 2023

New Sparse Matrix APIs #1279

New Sparse Matrix APIs #1279

Conversation

cjnolet commented Feb 14, 2023 • edited Loading

cjnolet commented Feb 16, 2023

mhoemmen commented Feb 17, 2023

cjnolet commented Feb 17, 2023

cjnolet commented Mar 14, 2023

cjnolet commented Mar 15, 2023

cjnolet commented Feb 14, 2023 •

edited

Loading