ocr2pdf

Merge images into actual PDFs with AI

Merge images and scans into searchable and selectable PDFs! The core logic resides in a Python script that transforms the files with Tesseract via OCRmyPDF. For information about available options, see the OCRmyPDF documentation.

A Bash script is provided to automate the installation of dependencies and the execution of the Python script. The Docker image provides a self-contained virtual environment that runs the Bash script in a container. The Google Colab notebook and GitHub Actions workflow both run the container in the cloud.

Note

Files in subfolders will be merged in alphabetical order, but will still be available individually.

Fast Start

Get up and going in no time with these options:

Cloud: Google Colab Notebook

Are you on mobile or simply want an easy and seamless experience?

Open Colab in Chrome
Run the cell and follow the prompts
Find the PDFs in your Drive/ocr-pdf

To add OCRmyPDF options, append them to the run command.

Self-hosted

Do you want to run it on your own machine, but don't want to clone the repo?

Ensure you have Docker, or Bash and cURL, installed
Make two new nested folders and put your files in them: pdf/todo/*
Run one of the following from the outer pdf folder:

Docker Container

If you want to skip building an image, just use mine:

docker run --rm -v .:/app/pdf ghcr.io/ipitio/ocr-pdf \
bash predict.sh pdf [OCRmyPDF options]

Bash Script

Don't want to install Docker? No problem!

curl -sSLNZ https://ipitio.github.io/ocr-pdf/src/predict.sh |\
bash -s -- . [OCRmyPDF options]

Quick Start

It's still as easy as 1, 2, 3!

Fork and clone this repo
Put your files in pdf/todo/
Complete one of the following from the root of the repo:

Cloud: GitHub Actions Workflow

Enable Actions on GitHub, then push your files:

git add .
git commit -m "add files"
git push
# wait for the magic to happen
git pull

To add OCRmyPDF options, edit the command in the predict.yml file before committing.

Self-hosted

Docker Container

To avoid polluting your system, use Docker Compose (which is included with Docker Desktop):

docker compose up

To add OCRmyPDF options, edit the command in the compose.yml file.

Bash Script

Do you want to make the most out of your hardware?

bash src/predict.sh pdf [OCRmyPDF options]

Name		Name	Last commit message	Last commit date
Latest commit History 127 Commits
.github/workflows		.github/workflows
pdf/todo		pdf/todo
public		public
src		src
LICENSE		LICENSE
README.md		README.md
colab.ipynb		colab.ipynb
compose.yml		compose.yml
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ocr2pdf

Fast Start

Cloud: Google Colab Notebook

Self-hosted

Docker Container

Bash Script

Quick Start

Cloud: GitHub Actions Workflow

Self-hosted

Docker Container

Bash Script

About

Packages

Contributors 2

Languages

License

ipitio/ocr-pdf

Folders and files

Latest commit

History

Repository files navigation

ocr2pdf

Fast Start

Cloud: Google Colab Notebook

Self-hosted

Docker Container

Bash Script

Quick Start

Cloud: GitHub Actions Workflow

Self-hosted

Docker Container

Bash Script

About

Topics

Resources

License

Stars

Watchers

Forks

Packages 0

Contributors 2

Languages

Packages