Stars
An Open Source Machine Learning Framework for Everyone
A feature-rich command-line audio/video downloader
Tesseract Open Source OCR Engine (main repository)
A list of cool features of Git and GitHub.
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
A collection of modern/faster/saner alternatives to common unix commands.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
A high-performance, zero-overhead, extensible Python compiler using LLVM
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows piping of output to tools like jq and …
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
A Unified Toolkit for Deep Learning Based Document Image Analysis
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
A synthetic data generator for text recognition
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
A curated list of image inpainting and video inpainting papers and resources
🎡 Build Python wheels for all the platforms with minimal configuration.
2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.