Skip to content

Benchmark using CountMinSketch on character n-grams for text indexing #1783

@rfecher

Description

@rfecher

It should be an improvement to use CountMinSketch as an index stat for text indicies and then for terms that are longer than the "n" for the n-gram we can choose the n-gram within the term that has the smallest estimated cardinality.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions