Starred repositories
Extract 500+ technologies from any repository. Detect Languages, SaaS, Cloud, Infrastructure, Dependencies and Services
This repository explores the concepts of Jaccard distance, min hashing, and LSH (Locality Sensitive Hashing) in the context of user similarity in a movie rating dataset. To be more precise, we will…
A Jekyll theme for the responsive theme for GitHub Pages http://jasonlong.github.io/cayman-theme/
A simple, easy to use PowerShell script to remove pre-installed apps from Windows, disable telemetry, remove Bing from Windows search as well as perform various other changes to declutter and impro…
Concatenate a directory full of files into a single prompt for use with LLMs
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Scraper for all the laptops on Lenovo outlet
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Download market data from Yahoo! Finance's API
Copy to/from Parquet in S3 from within PostgreSQL
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
A curated list of awesome remote jobs and resources. Inspired by https://github.com/vinta/awesome-python
High-performance In-browser LLM Inference Engine
Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
A system for agentic LLM-powered data processing and ETL
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
🛡️ ⚛️ A simple, scalable, and powerful architecture for building production ready React applications.
A high-throughput and memory-efficient inference and serving engine for LLMs
A server-side bookmarklet compiler with greasemonkey userscript-like metadata options and the power of babel and uglify
The official Python library for the Google Gemini API
Agentic components of the Llama Stack APIs
The fastest way to create an HTML app
PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.
DuckDB-powered Postgres for high performance apps & analytics.