-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add image extraction to PDF. Polish code #37
base: main
Are you sure you want to change the base?
Add image extraction to PDF. Polish code #37
Conversation
Hey @dSupertramp this is awesome! Could you try to use something else than FITZ ? Its license is AGPL, meaning it can't be used in production if you don't have a commercial license with them. |
Done! I used pypdf |
@chloedia @AmineDiro can you test ? :) |
@dSupertramp it seems you forgot to add it to the dependencies ;) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the PR ! Nice work, small remarks on some minor changes.
@@ -19,6 +19,7 @@ poppler-utils = "*" | |||
langchain-openai = "*" | |||
langchain-core = "*" | |||
python-dotenv = "*" | |||
pypdf = "*" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should probably fix the version
Co-authored-by: AmineDiro <[email protected]>
Co-authored-by: AmineDiro <[email protected]>
Everything solved! |
Hi everyone! Danilo's here
I added the extraction of the images from PDF (images are saved in a local folder)
I also cleaned the code a little bit
Thanks for all!