Skip to content

Conversation

@NebulaMao
Copy link

This pull request introduces a new Gradio-based web interface for the DeepSeek-OCR model, making it easier for users to interact with the model through a user-friendly UI. The main changes include the addition of a comprehensive application script, a detailed README for setup and usage, and a requirements file specifying dependencies.

New Gradio Web Interface and Supporting Files:

  • Application Implementation: Added a new app.py script that implements the Gradio UI for DeepSeek-OCR, including model loading, inference logic, bounding box visualization, and support for multiple OCR tasks such as free text extraction, document conversion, chart parsing, and object localization.
  • Setup and Usage Documentation: Added a detailed README.md with installation instructions, dependency list, startup steps, and usage guide for the Gradio interface.

Environment and Dependency Management:

  • Requirements Specification: Added a new requirements.txt listing all necessary Python packages (PyTorch, transformers, gradio, etc.) to run the Gradio interface and DeepSeek-OCR model.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant