Skip to content
@datalab-to

Datalab

Developing state of the art document intelligence models.

Pinned Loading

  1. marker marker Public

    Convert PDF to markdown + JSON quickly with high accuracy

    Python 29.7k 2k

  2. surya surya Public

    OCR, layout analysis, reading order, table recognition in 90+ languages

    Python 18.9k 1.3k

  3. pdftext pdftext Public

    Extract structured text from pdfs quickly

    Python 620 60

  4. chandra chandra Public

    OCR model that handles complex tables, forms, handwriting with full layout.

    Python 2k 222

Repositories

Showing 9 of 9 repositories

Top languages

Python Shell

Most used topics

Loading…