Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parse structured data from an Image #301

Open
devtanna opened this issue Jun 28, 2024 · 0 comments
Open

Parse structured data from an Image #301

devtanna opened this issue Jun 28, 2024 · 0 comments

Comments

@devtanna
Copy link

Hello 馃憢

Now that many models support image input as part of the prompt, what do you think of kor having support for parsing data from images? I would love to try and put up a draft PR :)

The typical use case would be, user inputs a pdf invoice, it's converted to an image, image is input to kor for data extraction.
Currently, the pdf is converted to text and then input to kor for data extraction.
The image flow is really advantageous when the document has handwritten parts.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant