Skip to content

0.3.1

Compare
Choose a tag to compare
@SamEdwardes SamEdwardes released this 17 Oct 16:16
· 10 commits to main since this release
802ec31

Changes

  • Support for page_range argument (#16, #18).

    import spacy
    from spacypdfreader import pdf_reader
    from spacypdfreader.parsers import pytesseract
    
    nlp = spacy.load("en_core_web_sm")
    doc = pdf_reader("tests/data/test_pdf_01.pdf", nlp, pytesseract.parser, n_processes=4, page_range=(2, 3))

Fixes

  • Remove shed as a dependency. It was removing unused imports that were required (#17).