0.3.1
Changes
-
Support for
page_range
argument (#16, #18).import spacy from spacypdfreader import pdf_reader from spacypdfreader.parsers import pytesseract nlp = spacy.load("en_core_web_sm") doc = pdf_reader("tests/data/test_pdf_01.pdf", nlp, pytesseract.parser, n_processes=4, page_range=(2, 3))
Fixes
- Remove
shed
as a dependency. It was removing unused imports that were required (#17).