System for converting pdf documents of any format/model/layout into a new and unified pdf automatically. The main use case is to create a neat and clean compilation of practice exercises from textbooks and other exams, so I can use and solve daily.
Some similar projects for future reference:
Before downloading, certify that you have the following requirements:
- You have the most recent version of
python
- Your operating system is
Windows/Linux
based.
To run PDF2PDF: PDF layout converter, run the following commands:
pip install -r requirements.txt
Lauching the application:
python3 main.py
This software is a work in progress. For now there are only test codes. Implementation/Tech recommendations are more than welcomed! The full version will be capable of/have implemented:
Features
- Execution by command line (see Installation).
- Web Crawler to collect desired pdfs automatically.
- Multimodel approach to retrieve the important information from the collected pdfs. See here for more info.
- Database to save information.
- Final document layout definition option and configuration.
- Python-based script to parse collect info by the trained ML model.
- Automatic feedback and update system.
See License for more information.
⬆ Back to the top