This is a PDF to Text translator that is close to conformant to LexMed's formatting requirements. This repo holds the sample PDF and the output my script gives
Dependencies/libraries used:
- pytesseract
- pdf2Image
- re (regular expression)
- 55% Error rate
- Due to typos from OCR nuances