POSS:PDF OCR Stream Server

POSS is a server designed for efficient, real-time OCR processing of PDF files. By streaming each page, it enables on-the-fly text extraction and processing, making it ideal for applications requiring quick and continuous handling of PDF documents. POSS leverages powerful OCR tools to handle both text and image-based PDFs, providing accurate and structured text outputs, page by page, as the files are processed. Perfect for integration into document workflows, POSS is lightweight, flexible, and built with scalability in mind.

Install

poss uses poetry to manage the dependencies

curl -sSL https://install.python-poetry.org | python3 -

poetry insall
poetry run pip install magic-pdf[full] --extra-index-url  https://wheels.myhloli.com

Run server

uvicorn server:app --host 0.0.0.0 --reload

Use client

Please refer the client.py for how to send pdf files to the server.

Todo

Use docker to deploy
LocalImageWriter
Interactive web page
Window ocr and chunking

License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

POSS:PDF OCR Stream Server

Install

Run server

Use client

Todo

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

POSS:PDF OCR Stream Server

Install

Run server

Use client

Todo

License