Consider using Document Intelligence method for extracting figure images #2311

pamelafox · 2025-01-27T17:16:58Z

We currently use Python Pillow plus pymupdf, but apparently DI has a method as well:
https://learn.microsoft.com/en-us/python/api/overview/azure/ai-documentintelligence-readme?view=azure-python-preview#extract-figures-from-documents

We should try that and see if the results are the same (quality/latency/cost).
We could also look at the code to see if they basically do the same thing.

pamelafox added the vision label Jan 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider using Document Intelligence method for extracting figure images #2311

Consider using Document Intelligence method for extracting figure images #2311

pamelafox commented Jan 27, 2025

Consider using Document Intelligence method for extracting figure images #2311

Consider using Document Intelligence method for extracting figure images #2311

Comments

pamelafox commented Jan 27, 2025