Image Description with Ollama

This project is a simple Streamlit-based web application that allows users to upload an image and get a description of it using the Ollama llava model. The app provides an intuitive interface for interacting with the model, making it easy to describe uploaded images.

Features

Upload images in .jpg, .jpeg, or .png formats.
Display the uploaded image for user confirmation.
Use the ollama model to generate a description of the image.
Streamlit interface for simplicity and accessibility.

Requirements

To run this project, you need the following:

Python 3.7 or higher
Required Python libraries:
- streamlit
- ollama

Installation

Clone this repository:

git clone https://github.com/your-username/your-repo-name.git
cd your-repo-name

Install the required dependencies:
```
pip install streamlit ollama
```
Run the Streamlit app:
```
streamlit run test.py
```

Usage

Launch the app locally by running the above command.
Upload an image using the file uploader.
Click the "Describe Image" button to get the model's description of the image.

Code Overview

The main functionality is implemented in the test.py file:

Image Upload: Allows users to upload an image via Streamlit's file_uploader widget.
Image Display: The uploaded image is displayed using st.image().
Model Interaction: The uploaded image is saved temporarily and passed to the ollama.chat function to get a description.
Error Handling: Handles exceptions to ensure a smooth user experience.

Screenshot

Folder Structure

.
├── test.py               # Main application file
├── images/              # Folder for storing example images or screenshots
└── README.md            # Project documentation

Example Output

Input: An uploaded image of a cat.
Output: "This is an image of a cat sitting on a windowsill, looking outside."

Future Improvements

Add support for multiple image uploads.
Provide additional model configuration options.
Enhance the UI with more styling and interactivity.

Contributing

Contributions are welcome! Please fork this repository and submit a pull request for any enhancements or bug fixes.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgments

Ollama for the llava model.
Streamlit for making web app development so accessible.

Feel free to reach out if you have any questions or suggestions!

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
2 cat.jpg		2 cat.jpg
README.md		README.md
brdge and rainbow.jpg		brdge and rainbow.jpg
preview.png		preview.png
temp_image.jpg		temp_image.jpg
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Description with Ollama

Features

Requirements

Installation

Usage

Code Overview

Screenshot

Folder Structure

Example Output

Future Improvements

Contributing

License

Acknowledgments

About

Releases

Packages

Languages

asshejan/Image-Description-Locally-using-LLava

Folders and files

Latest commit

History

Repository files navigation

Image Description with Ollama

Features

Requirements

Installation

Usage

Code Overview

Screenshot

Folder Structure

Example Output

Future Improvements

Contributing

License

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages