Local LLM Stack: Run Any Ollama Model with Open WebUI

I put together this flexible microservices setup to run any Ollama-compatible model locally. This stack combines Ollama with Open WebUI to create a complete local AI solution. While it's pre-configured for DeepSeek r1, you can easily swap in any model you prefer!

What's in the Box

This repo gives you a reusable template with two separate microservices:

Ollama container: Handles all the AI model stuff (runs whatever model you choose)
Open WebUI container: Gives you a clean chat interface to talk to the model

They communicate with each other but run independently - proper microservice architecture!

Quick Start

Get Docker running
You'll need Docker installed → Get Docker here

Grab this repo

git clone <repository-url>
cd local-llm-stack

Choose your model (optional)

The default is set to DeepSeek r1, but you can easily change it:
- Edit the Dockerfile in the ollama directory to replace "deepseek-r1" with any model from Ollama's library
- Examples: llama3, mistral, phi3, codellama, etc.
Fire it up!
```
docker compose up -d
```
First run will take a while (maybe grab a coffee?) as it downloads the model
Start chatting
Just open http://localhost:8080 in your browser

Behind the Scenes

What's actually happening:

The Ollama container boots up and automatically downloads your chosen model if needed
The Open WebUI connects to Ollama's API (they talk to each other through Docker's internal network)
Everything runs locally - your data stays on your machine

System Architecture

┌────────────────────────────────────────────────────────────┐
│                        Your Machine                        │
│                                                            │
│  ┌─────────────────────────────────────────────────────┐   │
│  │               Docker Environment                    │   │
│  │                                                     │   │
│  │   ┌─────────────────┐         ┌─────────────────┐   │   │
│  │   │                 │         │                 │   │   │
│  │   │  Ollama Engine  │◄────────┤   Open WebUI    │   │   │
│  │   │   Container     │         │    Container    │   │   │
│  │   │                 │         │                 │   │   │
│  │   └────────┬────────┘         └────────▲────────┘   │   │
│  │            │                           │            │   │
│  │            │                           │            │   │
│  │   ┌────────▼────────┐                  │            │   │
│  │   │                 │                  │            │   │
│  │   │   Ollama Data   │                  │            │   │
│  │   │     Volume      │                  │            │   │
│  │   │                 │                  │            │   │
│  │   └─────────────────┘                  │            │   │
│  │                                        │            │   │
│  └────────────────────────────────────────┼────────────┘   │
│                                           │                │
│  ┌─────────────────┐                      │                │
│  │                 │                      │                │
│  │  Web Browser    │◄─────────────────────┘                │
│  │                 │                                       │
│  └─────────────────┘                                       │
│                                                            │
└────────────────────────────────────────────────────────────┘

          │                                     │
          │                                     │
          ▼                                     ▼
   Port 11434 (API)                      Port 8080 (UI)
   (Not exposed externally)              (User access)

Useful Commands

When you need to check on things:

See what's running

docker compose ps

Check the logs (helpful for debugging)

docker compose logs -f

Shut it down

docker compose down

Removes all data

docker compose down -v

Tips

Larger models like DeepSeek r1 require decent hardware
If you're on a laptop, expect your fans to spin up
First responses might be slow as the model warms up
Try smaller models like Phi-3 mini if you need faster responses on modest hardware

Credits

This project builds upon these amazing open source projects:

Ollama - For running LLMs locally
Open WebUI - For the intuitive chat interface

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ollama		ollama
LICENSE.md		LICENSE.md
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Local LLM Stack: Run Any Ollama Model with Open WebUI

What's in the Box

Quick Start

Behind the Scenes

System Architecture

Useful Commands

Tips

Credits

About

Uh oh!

Releases

Packages

Languages

License

oburay/local-llm-stack

Folders and files

Latest commit

History

Repository files navigation

Local LLM Stack: Run Any Ollama Model with Open WebUI

What's in the Box

Quick Start

Behind the Scenes

System Architecture

Useful Commands

Tips

Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages