[Enhancement] End-to-end support for images (as well as PDFs) #5

athewsey · 2021-11-17T09:10:56Z

While this sample was originally created for multi-page documents in PDF, other related use-cases (such as ID document or receipt extraction) may operate on single-page images/photographs/scans instead.

Today there's support for images in some aspects of the pipeline, but others assume PDF. It would be great to round out support for images as source documents - particularly for common JPEG+PNG formats which have good native support in e.g. Amazon Textract, SageMaker Ground Truth, and web browsers.

1. (Believe so but need to double-check) Core Textract state machine component supports OCRing image files
2. Notebook entity recognition data prep flow supports image files
3. (Need to check) OCR pipeline trigger and Textract orchestration supports image files
4. (Known gap) A2I human review UI supports image files

Fix thumbnailing endpoint and model inference wrapper's logic to correctly process single image files (as well as PDFs). Fixes #18. Relates to #5. Co-authored-by: David <[email protected]>

athewsey added enhancement New feature or request good first issue Good for newcomers labels Nov 17, 2021

athewsey mentioned this issue Feb 7, 2022

Support image files in notebook 1 data prep #9

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enhancement] End-to-end support for images (as well as PDFs) #5

[Enhancement] End-to-end support for images (as well as PDFs) #5

athewsey commented Nov 17, 2021 •

edited

Loading

[Enhancement] End-to-end support for images (as well as PDFs) #5

[Enhancement] End-to-end support for images (as well as PDFs) #5

Comments

athewsey commented Nov 17, 2021 • edited Loading

athewsey commented Nov 17, 2021 •

edited

Loading