Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
-
Updated
Jun 1, 2022 - Java
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
🪼 a python library for doing approximate and phonetic matching of strings.
🎯 String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).
📚 String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...
A .NET port of java-string-similarity
🔤 Rust implementations of string similarity metrics
set of functions and operators for executing similarity queries
Go metrics for calculating string similarity and other string utility functions
🦀📏 Rust library to compare strings (or any sequences). 25+ algorithms, pure Rust, common interface, Unicode support.
Ruby & C implementation of Jaro-Winkler distance algorithm which supports UTF-8 string.
Ruby gem (native extension in Rust) providing implementations of various string metrics
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity
String similarity metrics for Elixir
String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
Spark functions to run popular phonetic and string matching algorithms
A collection of metrics and phonetic algorithms for fuzzy string matching in Elixir.
PySpark phonetic and string matching algorithms
Fast batch jaro winkler distance implementation in C99 with Ruby, OCaml and Python bindings.
Distance related functions (Damerau-Levenshtein, Jaro-Winkler , longest common substring & subsequence) implemented as SQLite run-time loadable extension. Any UTF-8 strings are supported.
Add a description, image, and links to the jaro-winkler topic page so that developers can more easily learn about it.
To associate your repository with the jaro-winkler topic, visit your repo's landing page and select "manage topics."