Skip to content
View 0AlphaZero0's full-sized avatar
🏊‍♂️
Swimming in life
🏊‍♂️
Swimming in life
  • Paris, France

Block or report 0AlphaZero0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,018 1,430 Updated Aug 9, 2024

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,511 176 Updated Nov 22, 2024

A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171

Python 922 92 Updated May 28, 2024

A context-based spellchecker for correcting OCR output.

Python 18 4 Updated Feb 3, 2023

A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.

Python 1,403 364 Updated Aug 1, 2024

📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO and PaddlePaddle.

Python 3,096 368 Updated Nov 25, 2024

A Repo For Document AI

Python 2,596 141 Updated Nov 25, 2024

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 3,911 446 Updated Nov 25, 2024

Official implementation of Character Region Awareness for Text Detection (CRAFT)

Python 3,129 887 Updated Jul 16, 2024

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Python 3,389 652 Updated Aug 23, 2024

text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network

Python 3,434 1,334 Updated Oct 3, 2023

Text recognition (optical character recognition) with deep learning methods, ICCV 2019

Jupyter Notebook 3,761 1,105 Updated Mar 4, 2024

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Python 4,365 754 Updated Jul 15, 2024

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 5,873 477 Updated Jul 11, 2024

Fast and simple OCR library written in Swift

Swift 4,623 482 Updated Dec 13, 2020

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 24,621 3,170 Updated Sep 24, 2024

Tesseract Open Source OCR Engine (main repository)

C++ 62,616 9,530 Updated Nov 23, 2024

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 44,502 7,846 Updated Nov 21, 2024

Extract text from a pdf

PHP 859 126 Updated Oct 18, 2024

Simple PDF text extraction

Python 872 99 Updated Oct 20, 2024

PDFium - Project to compile PDFium library to multiple platforms.

Python 931 89 Updated Sep 13, 2024

Convert a pdf to an image

PHP 1,329 229 Updated Oct 16, 2024

A python module that wraps the pdftoppm utility to convert PDF to PIL Image object

Python 1,646 195 Updated Jul 23, 2024

Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class…

Java 1,575 208 Updated Dec 17, 2023

A Python library for reading and writing PDF, powered by QPDF

Python 2,187 191 Updated Nov 23, 2024

Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame

Python 2,196 300 Updated Oct 17, 2024

Evaluating the performance and accuracy of ABBYY FineReader's OCR on Senate Financial Disclosure scanned forms

CSS 129 16 Updated Mar 22, 2016

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

Python 2,221 369 Updated Jun 24, 2022

A machine learning software for extracting information from scholarly documents

Java 3,599 460 Updated Nov 25, 2024
Next