A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲
-
Updated
Jun 6, 2022 - Python
A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲
A simple, time-tested, family of random hash functions in Java, based on CRC32, affine transformations, and the Mersenne Twister. 🎲
python implementations of the Flajolet-Martin, LogLog, SuperLogLog, and HyperLogLog cardinality estimation algorithms, specifically used to estimate the cardinality of unique traffic violations in NYC in the 2019 fiscal year
Comparative Analysis of Unsupervised Learning Methods for Real-time Anomaly Detection in Industrial Control Systems (ICS)
USC DSCI 553 - Foundations & Applications of Data Mining - Spring 2024 - Prof. Wei-Min Shen
Basic implementation of Bloom filter and Flajolet-Martin algorithms in python with hashes and test files
Bloom filtering, Flajolet-Martin algorithm, and reservoir sampling
Streaming data in Spark and doing data analytics
This repository contains the assignments and project codes created during the Big data coursework
Simple Projects in Data Mining
Add a description, image, and links to the flajolet-martin topic page so that developers can more easily learn about it.
To associate your repository with the flajolet-martin topic, visit your repo's landing page and select "manage topics."