bertsky

Follow

Robert Sachunsky bertsky

Follow

54 followers · 26 following

Achievements

Achievements

Organizations

Stars

tensorflow / tensorflow

An Open Source Machine Learning Framework for Everyone

C++ 186,379 74,309 Updated Nov 15, 2024

paperless-ngx / paperless-ngx

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

Python 21,911 1,191 Updated Nov 15, 2024

philipallfrey / teihub

Automated listing of repos in GitHub with XML files containing teiHeader. Find a project using TEI today!

JavaScript 15 3 Updated Nov 15, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 67,832 9,730 Updated Nov 15, 2024

dask / dask

Parallel computing with task scheduling

Python 12,593 1,708 Updated Nov 14, 2024

qurator-spk / eynollah

Document Layout Analysis

Python 348 29 Updated Nov 14, 2024

pypa / cibuildwheel

🎡 Build Python wheels for all the platforms with minimal configuration.

Python 1,870 239 Updated Nov 14, 2024

DCGM / pero-ocr

Python 48 19 Updated Nov 14, 2024

keras-team / keras

Deep Learning for humans

Python 62,036 19,482 Updated Nov 14, 2024

subugoe / olahd_backend

Project repository for the backend module of OCR-D Implementation Project OLA-HD

Java 6 2 Updated Nov 14, 2024

PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 44,243 7,820 Updated Nov 14, 2024

PaddlePaddle / PaddleDetection

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 12,805 2,886 Updated Nov 14, 2024

alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook 8,112 1,114 Updated Nov 13, 2024

mets / METS-schema

METS 1.x and (draft) METS 2 schemas

Shell 21 2 Updated Nov 13, 2024

UglyToad / PdfPig

Read and extract text and other content from PDFs in C# (port of PDFBox)

C# 1,729 241 Updated Nov 13, 2024

exaloop / codon

A high-performance, zero-overhead, extensible Python compiler using LLVM

C++ 15,148 520 Updated Nov 13, 2024

stefanklut / laypa

Layout analysis to find layout elements in documents (similar to P2PaLA)

Python 17 6 Updated Nov 13, 2024

mindee / doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 3,853 443 Updated Nov 13, 2024

phamquiluan / jdeskew

ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation

Jupyter Notebook 127 11 Updated Nov 13, 2024

yt-dlp / yt-dlp

A feature-rich command-line audio/video downloader

Python 89,858 6,966 Updated Nov 12, 2024

Calamari-OCR / calamari

Line based ATR Engine based on OCRopy

Python 1,048 209 Updated Nov 12, 2024

openedx / edx-lint

Custom tooling for pylint and other repo management tools

Python 51 26 Updated Nov 12, 2024

cookiecutter / cookiecutter

A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.

Python 22,629 2,001 Updated Nov 11, 2024

OCR-D / core

Collection of OCR-related python tools and wrappers from @OCR-D

Python 119 31 Updated Nov 11, 2024

OCR-D / ocr-d.github.io

Website for OCR-D specs, formats, requirements

HTML 5 2 Updated Nov 11, 2024

tesseract-ocr / tesseract

Tesseract Open Source OCR Engine (main repository)

C++ 62,341 9,520 Updated Nov 11, 2024

OCR4all / LAREX

A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.

Java 180 33 Updated Nov 11, 2024

linuxserver / docker-openssh-server

Dockerfile 533 182 Updated Nov 10, 2024

deepdoctection / deepdoctection

A Repo For Document AI

Python 2,585 140 Updated Nov 10, 2024

jiffyclub / snakeviz

An in-browser Python profile viewer

Python 2,354 139 Updated Nov 9, 2024