Skip to content

v2.2.7

Compare
Choose a tag to compare
@NastyBoget NastyBoget released this 16 Aug 13:35
· 2 commits to master since this release
765aae2
  • Fix bugs with start, end of BBoxAnnotation in PdfTabbyReader.
  • Improve columns classification and orientation detection for PDF and images (is_one_column_document and document_orientation parameters).
  • Upgrade docker: docker-compose is no longer supported, use docker compose instead.
  • Fix bug of tables parsing in DocxReader (see issue).
  • Added simple textual layer detection in PdfAutoReader (fast_textual_layer_detection parameter).
  • Improve paragraph extraction from PDF documents and images.
  • Retrain a classifier for diplomas (document_type="diploma") on a new dataset.