OCR Service

This is a stand alone character recognition microservice that can be accessed via a REST API.

The service supports two OCR engines:

Tesseract: Fast and efficient for standard document OCR
PaddleOCR: Better for multi-directional and rotated text (default)

Documentation

https://gunthercox.com/ocr-service/

Docker Hub Image

https://hub.docker.com/r/gunthercox/ocr-service

Available Image Variants

The service provides three Docker image variants to suit different deployment needs:

Combined (default) - Both engines available
```
docker pull gunthercox/ocr-service:latest
docker pull gunthercox/ocr-service:1.0.0
```
- Python 3.12
- Includes both Tesseract and PaddleOCR
- Largest image size (includes 140+ Tesseract language packs)
- Use when you need flexibility to choose engines per request
Tesseract only - Smaller image for document OCR
```
docker pull gunthercox/ocr-service:latest-tesseract
docker pull gunthercox/ocr-service:1.0.0-tesseract
```
- Python 3.12
- Includes only Tesseract with all language packs
- Use for standard document OCR workloads
PaddleOCR only - Smallest image, newest Python
```
docker pull gunthercox/ocr-service:latest-paddleocr
docker pull gunthercox/ocr-service:1.0.0-paddleocr
```
- Python 3.13
- Includes only PaddleOCR (no Tesseract language packs)
- Smallest image size
- Use for rotated/multi-directional text or when image size matters

Note: Engine-specific images will return an error if you request an unavailable engine. For example, the Tesseract-only image will reject requests when engine=paddleocr is specified in the request body.

API Usage

The service expects a multipart/form-data request with the following fields:

image (required): The image file to process
engine (optional): 'tesseract' or 'paddleocr' (default: 'paddleocr')
lang (optional): Language code (format depends on engine)

Example Request

# Using default engine (PaddleOCR)
curl -X POST -F "image=@example.png" http://localhost:5000/

# Using Tesseract engine
curl -X POST -F "image=@example.png" -F "engine=tesseract" -F "lang=eng" http://localhost:5000/

# Using PaddleOCR with Chinese
curl -X POST -F "image=@example.png" -F "engine=paddleocr" -F "lang=ch" http://localhost:5000/

Response Format

Data will be returned in the following format:

{
    "text": "Extracted text from the image",
    "regions": [
        {
            "bbox": [[10, 20], [100, 20], [100, 40], [10, 40]],
            "text": "Detected text",
            "confidence": 0.95
        }
    ]
}

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
.github/workflows		.github/workflows
app		app
docs		docs
tests		tests
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
requirements-base.txt		requirements-base.txt
requirements-dev.txt		requirements-dev.txt
requirements-paddleocr-py313.txt		requirements-paddleocr-py313.txt
requirements-paddleocr.txt		requirements-paddleocr.txt
requirements-tesseract.txt		requirements-tesseract.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR Service

Documentation

Docker Hub Image

Available Image Variants

API Usage

Example Request

Response Format

About

Uh oh!

Releases 7

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OCR Service

Documentation

Docker Hub Image

Available Image Variants

API Usage

Example Request

Response Format

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 7

Uh oh!

Contributors

Uh oh!

Languages