-
-
Notifications
You must be signed in to change notification settings - Fork 26.5k
Open
Labels
ModerateAnything that requires some knowledge of conventions and best practicesAnything that requires some knowledge of conventions and best practicesNew Featurecythonmodule:cluster
Description
Introduction
This is a (hopefully) exhaustive list of ongoing/future work for HDBSCAN. These have all been discussed and are considered wanted, but some still require thorough investigation (especially heuristic evaluations).
Priority List
The higher priority items appear earlier in this list.
- ENH Add
np.float32data support forHDBSCAN#26888- ENH Add
float32implementations forBallTreeandKDTree#25914 - Finalize
{KD, Ball}TreeAPI to avoid writing custom dispatcher
- ENH Add
- Support
np.nanin Cython implementation for sparse matrices - Reintroduce
Boruvkaalgorithm (removed in b7736ef) - Implement PWD backend for weighted
argkminin medoid calculation - Investigate PWD backend for
mst_from_*functions in_linkage.pyx - Investigate PWD backend for
_reachability.pyx - Benchmark KD vs Ball Tree efficiency
- Add consistent threading semantics to enable
prange, e.g. in_reachability.pyx - Improve partition strategy in
_reachability.pyx(cf. CLN Cleanedcluster/_hdbscan/_reachability.pyx#24701 (comment)) - Add support for
np.infvalues whenmetric=='precomputed'andXis sparse.
jjerphan
Metadata
Metadata
Assignees
Labels
ModerateAnything that requires some knowledge of conventions and best practicesAnything that requires some knowledge of conventions and best practicesNew Featurecythonmodule:cluster