Image OCR index using vision model #5642

vccler · 2024-06-26T12:06:41Z

vccler
Jun 26, 2024

some pptx,docx,pdf documents might have images embeded inside. Could we discuss about the implementation of parsing the image via vision model to extract to text format and index into vector database. In this way, it will provide better answer when user ask question about the images.

guchenhe · 2024-07-26T06:08:18Z

guchenhe
Jul 26, 2024
Maintainer

working on this as part of our RAG pipeline restructure!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image OCR index using vision model #5642

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Image OCR index using vision model #5642

vccler Jun 26, 2024

Replies: 1 comment

guchenhe Jul 26, 2024 Maintainer

vccler
Jun 26, 2024

guchenhe
Jul 26, 2024
Maintainer