Stars
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Source code for the LabelMe annotation tool.
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
COCO API - Dataset @ http://cocodataset.org/
A simple screen parsing tool towards pure vision based GUI agent
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Effortless data labeling with AI support from Segment Anything and other awesome models.
VizTracer is a low-overhead logging/debugging/profiling tool that can trace and visualize your python code execution.
A free and strong UCI chess engine
Cross-platform automation framework for all kinds of apps, built on top of the W3C WebDriver protocol
Socket.IO integration for Flask applications.
An open-source cross-platform alternative to AirDrop
📊 Blazing fast Python framework for web crawling, scraping, testing, and reporting. Supports pytest. Stealth abilities: UC Mode and CDP Mode.
JavaScript API for Chrome and Firefox
Python library and shell utilities to monitor filesystem events.
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…
OCR, layout analysis, reading order, table recognition in 90+ languages
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A curated list of awesome Machine Learning frameworks, libraries and software.
Python based web automation tool. Powerful and elegant.
We write your reusable computer vision tools. 💜
heshameraqi / labelImg_OBB
Forked from HumanSignal/labelImg🤘 LabelImg is a graphical image annotation tool and label object bounding boxes in images. This fork updates to tool to support "oriented" bounding boxes (OBB).