GitHub - gouravshah/hprice-predictor: Project Ready Code with House Price Predictor Model along with Streamlit and FastAPI App

🧪 Project Spec – Simulate a Production-Grade ML System with Docker Compose

🎯 Project Goal

As an MLOps or AI/ML Engineer on the AI Platform Engineering team, your mission is to simulate a real-world ML application environment by automating its deployment using Docker Compose.

You will create a local dev/test setup that combines:

ML model training and tracking (with MLflow)
Model serving (with FastAPI)
User interaction (with Streamlit)

This kind of environment is used by real teams to test integration, debug workflows, and enable reproducibility — before handing it off to production infrastructure like Kubernetes.

⚙️ What We’re Automating

In a non-Compose setup, you’d be spinning up each component manually using long docker run commands. Here's what that looks like:

🔹 MLflow Tracking Server

docker run -d --name mlflow -p 5555:5000 \
  ghcr.io/mlflow/mlflow:latest \
  mlflow server --host 0.0.0.0 \

🔹 FastAPI Inference Server

docker build -t fastapi-app .
docker run -d --name fastapi -p 8000:8000

🔹 Streamlit Frontend

docker build -t streamlit-app ./streamlit_app
docker run -d --name streamlit -p 8501:8501 \
  --env API_URL=http://fastapi:8000 \
  --link fastapi streamlit-app

Running and linking these services manually is tedious, error-prone, and non-reproducible.

🐳 What You’ll Do Instead

You’ll use Docker Compose to automate all of the above by writing a single docker-compose.yml file. This will:

Build and launch all three services
Set up internal networking so services can find each other by name (e.g., http://fastapi:8000)
Automatically manage service dependencies
Enable consistent and reproducible environments across your team

🧱 Stack Overview

Service	Build Context/Image	Port	Depends On
`mlflow`	`ghcr.io/mlflow/mlflow:latest`	5555	-
`fastapi`	`.` (uses root `Dockerfile`)	8000	mlflow
`streamlit`	`./streamlit_app` (Dockerfile)	8501	fastapi

📂 Project Structure

house-price-predictor/
├── run_pipeline.sh                # Generates model artifacts
├── Dockerfile                     # FastAPI app
├── streamlit_app/
│   ├── Dockerfile                 # Streamlit app
│   └── app.py
└── docker-compose.yml             # You will create this

🔄 Workflow

Step 1: Generate Model Artifacts

Run the provided training script:

./run_pipeline.sh

This will:

Clean raw data
Engineer features
Train a model and preprocessor
Log the run to MLflow (if it's running)
Save model files under models/

Step 2: Create Docker Compose File

Write a docker-compose.yml that:

Builds the FastAPI and Streamlit images from the existing Dockerfiles
Uses the public MLflow image
Maps the appropriate ports
Connects the services using internal DNS (fastapi, mlflow)
Passes API_URL=http://fastapi:8000 as an environment variable to the Streamlit app

✅ Validation Checklist

Milestone	Status
`run_pipeline.sh` runs successfully and generates artifacts	[ ]
MLflow UI accessible at http://localhost:5555	[ ]
FastAPI docs available at http://localhost:8000/docs	[ ]
Streamlit UI loads at http://localhost:8501	[ ]
Streamlit connects to FastAPI and returns predictions	[ ]
All services run together via `docker-compose up`	[ ]

🚀 Why This Matters

Using Docker Compose this way lets you:

Create consistent dev/test environments
Share portable ML apps with teammates
Validate service integration before scaling to Kubernetes
Work more like a real AI/ML Platform Engineering team

You’re not just containerizing — you’re simulating production architecture in a controlled, local setup.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
configs		configs
data		data
mlflow		mlflow
models/trained		models/trained
notebooks		notebooks
src		src
streamlit_app		streamlit_app
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
compose.yaml		compose.yaml
requirements.txt		requirements.txt
run_pipeline.sh		run_pipeline.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧪 Project Spec – Simulate a Production-Grade ML System with Docker Compose

🎯 Project Goal

⚙️ What We’re Automating

🔹 MLflow Tracking Server

🔹 FastAPI Inference Server

🔹 Streamlit Frontend

🐳 What You’ll Do Instead

🧱 Stack Overview

📂 Project Structure

🔄 Workflow

Step 1: Generate Model Artifacts

Step 2: Create Docker Compose File

✅ Validation Checklist

🚀 Why This Matters

About

Uh oh!

Releases

Packages

Languages

License

gouravshah/hprice-predictor

Folders and files

Latest commit

History

Repository files navigation

🧪 Project Spec – Simulate a Production-Grade ML System with Docker Compose

🎯 Project Goal

⚙️ What We’re Automating

🔹 MLflow Tracking Server

🔹 FastAPI Inference Server

🔹 Streamlit Frontend

🐳 What You’ll Do Instead

🧱 Stack Overview

📂 Project Structure

🔄 Workflow

Step 1: Generate Model Artifacts

Step 2: Create Docker Compose File

✅ Validation Checklist

🚀 Why This Matters

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages