Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
-
Updated
Dec 5, 2025 - Python
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
TextBoxes++: A Single-Shot Oriented Scene Text Detector
OpenOCR: An Open-Source Toolkit for General OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful reproductions of the core implementations from a wide range of academic papers.
Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition
Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
A scene text recognition toolbox based on PyTorch
Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)
A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text removal, text image super resolution, text editing, handwritten generation, scene text recognition and scene text detection.
Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision.
A collection of OCR-related datasets
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
Scene Text Recognition (STR) methods trained with fewer real labels (CVPR 2021)
text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.
Scene text detection and recognition based on Extremal Region(ER)
Add a description, image, and links to the scene-text-recognition topic page so that developers can more easily learn about it.
To associate your repository with the scene-text-recognition topic, visit your repo's landing page and select "manage topics."