data-diff
Data-diff is a command-line tool and Python library to efficiently diff rows across two different databases.
⇄ Verifies across many different databases (e.g. PostgreSQL -> Snowflake) !
🔍 Outputs diff of rows in detail
🚨 Simple CLI/API to create monitoring and alerts
🔥 Verify 25M+ rows in <10s, and 1B+ rows in ~5min.
♾️ Works for tables with 10s of billions of rows
For more information, See our README
Resources
Source code (git): https://github.com/datafold/data-diff
The rest of the documentation