Skip to content
View bertsky's full-sized avatar

Organizations

@slub

Block or report bertsky

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Open Source Machine Learning Framework for Everyone

C++ 186,379 74,309 Updated Nov 15, 2024

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

Python 21,911 1,191 Updated Nov 15, 2024

Automated listing of repos in GitHub with XML files containing teiHeader. Find a project using TEI today!

JavaScript 15 3 Updated Nov 15, 2024

LLM inference in C/C++

C++ 67,832 9,730 Updated Nov 15, 2024

Parallel computing with task scheduling

Python 12,593 1,708 Updated Nov 14, 2024

Document Layout Analysis

Python 348 29 Updated Nov 14, 2024

🎡 Build Python wheels for all the platforms with minimal configuration.

Python 1,870 239 Updated Nov 14, 2024
Python 48 19 Updated Nov 14, 2024

Deep Learning for humans

Python 62,036 19,482 Updated Nov 14, 2024

Project repository for the backend module of OCR-D Implementation Project OLA-HD

Java 6 2 Updated Nov 14, 2024

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 44,243 7,820 Updated Nov 14, 2024

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 12,805 2,886 Updated Nov 14, 2024

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook 8,112 1,114 Updated Nov 13, 2024

METS 1.x and (draft) METS 2 schemas

Shell 21 2 Updated Nov 13, 2024

Read and extract text and other content from PDFs in C# (port of PDFBox)

C# 1,729 241 Updated Nov 13, 2024

A high-performance, zero-overhead, extensible Python compiler using LLVM

C++ 15,148 520 Updated Nov 13, 2024

Layout analysis to find layout elements in documents (similar to P2PaLA)

Python 17 6 Updated Nov 13, 2024

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 3,853 443 Updated Nov 13, 2024

ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation

Jupyter Notebook 127 11 Updated Nov 13, 2024

A feature-rich command-line audio/video downloader

Python 89,858 6,966 Updated Nov 12, 2024

Line based ATR Engine based on OCRopy

Python 1,048 209 Updated Nov 12, 2024

Custom tooling for pylint and other repo management tools

Python 51 26 Updated Nov 12, 2024

A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.

Python 22,629 2,001 Updated Nov 11, 2024

Collection of OCR-related python tools and wrappers from @OCR-D

Python 119 31 Updated Nov 11, 2024

Website for OCR-D specs, formats, requirements

HTML 5 2 Updated Nov 11, 2024

Tesseract Open Source OCR Engine (main repository)

C++ 62,341 9,520 Updated Nov 11, 2024

A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.

Java 180 33 Updated Nov 11, 2024

A Repo For Document AI

Python 2,585 140 Updated Nov 10, 2024

An in-browser Python profile viewer

Python 2,354 139 Updated Nov 9, 2024
Next