Stars
Scrapy, a fast high-level web crawling & scraping framework for Python.
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
DuckDB is an analytical in-process SQL database management system
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset
Stream events from companies house in real time
A Database Change Management tool for Snowflake
A simple, easy to use PowerShell script to remove pre-installed apps from Windows, disable telemetry, remove Bing from Windows search as well as perform various other changes to declutter and impro…
Repo that contains parser to download, parse and load companies house bulk product into a postgres db.
Python package to parse Companies House accounts data in a streaming way
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Opensource IDE For Exploring and Testing Api's (lightweight alternative to postman/insomnia)
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
Asynchronous, non-blocking SAP NW RFC SDK bindings for Python
A selection of snippets to make working with FastAPI a dream
A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
A terminal workspace with batteries included
Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.
A simple, fast, and fun package for building command line apps in Go
Tutorial Project for a NextJS 13 Static Blog with TailwindCSS Styling
Source code for Twitter's Recommendation Algorithm