ProxiMeter — RTSP Object Detection Scoring for Home Automation

A FastAPI + React TypeScript application for real-time object detection scoring on RTSP camera streams. Create polygon zones, define scoring criteria (distance, coordinates, size), and stream scores to home automation systems via SSE or MQTT. NOT a video recorder or NVR.

Backend: Python 3.12, FastAPI, Uvicorn, Pydantic v2, PyYAML, FFmpeg (RTSP processing), Shapely (polygon geometry)
Frontend: React 19.2, TypeScript 5+, Vite, Tailwind CSS, shadcn/ui component system (optional animation: framer-motion, react-bits, aceternity UI, motion-bits)
Features:
- RTSP stream management (add/edit/delete)
- YOLO object detection (YOLOv11/v9) with GPU acceleration (CUDA/ROCm/OpenVINO)
- Per-stream detection configuration: enable/disable, label filtering, confidence thresholds
- Model management UI: view cached models, delete unused models
- Polygon zone editor with visual overlays on live stream preview
- Real-time object detection scoring: distance from target, camera coordinates, bounding box size
- SSE score streaming (mandatory) + optional MQTT publishing
- NO video recording or storage (live frames only for inference)
Endpoints: / (React SPA), /api/streams (REST), /api/zones (REST), /api/scores/stream (SSE), /health, /metrics
Security Warning: LAN-only deployment; no authentication. RTSP credentials stored in plaintext. Do NOT expose to the internet without proper hardening.

Quick Start

Run with Docker (recommended)

# From repo root
docker compose up --build

Open http://localhost:8000 to view the landing page.

Add Your First Stream

Click "Add stream" on the landing page
Enter a name (e.g., "Front Door Camera")
Enter the RTSP URL (e.g., rtsp://username:[email protected]:554/stream)
Click "Add Stream"
Click the stream button to start playback

Note: RTSP URLs with credentials are stored in plaintext in config/config.yml. This is a LAN-only tool.

Health Check

curl http://localhost:8000/health
# Returns: {"status":"ok"}

Metrics

curl http://localhost:8000/metrics
# Returns Prometheus-format metrics

Stop the Application

docker compose down

FFmpeg Configuration

ProxiMeter uses FFmpeg for RTSP stream processing with hardware acceleration support.

GPU Acceleration:

NVIDIA: Requires NVIDIA drivers + CUDA. Docker: --gpus all. Env: GPU_BACKEND=nvidia.
AMD: Requires ROCm. Env: GPU_BACKEND=amd.
Intel: Requires oneAPI. Env: GPU_BACKEND=intel.
Detection: entrypoint.sh sets GPU_BACKEND_DETECTED (nvidia/amd/intel/none).

Custom FFmpeg Params:

In UI: Textarea for flags (e.g., -rtsp_transport tcp -timeout 10000000).
Defaults: -hide_banner -loglevel warning -threads 2 -rtsp_transport tcp -timeout 10000000 + GPU flags.
Validation: Whitelists safe flags; rejects shell metachars (; & | > <); probes with ffprobe on save.

Troubleshooting FFmpeg:

Logs: Check container logs for FFmpeg stderr (docker logs proximeter).
Test: docker exec -it proximeter ffmpeg -version.
GPU: docker exec -it proximeter nvidia-smi (NVIDIA).

YOLO Object Detection

ProxiMeter includes real-time YOLO object detection powered by Ultralytics YOLO11/YOLOv9 models and ONNX Runtime.

Features:

GPU-accelerated inference: Supports NVIDIA CUDA, AMD ROCm, Intel OpenVINO
80 COCO classes: person, car, bicycle, dog, cat, etc.
Per-stream configuration: Enable/disable detection, select labels, adjust confidence threshold
Model caching: Models are downloaded once and persisted in Docker volume
Real-time rendering: Bounding boxes drawn on video frames with class labels and confidence scores

Quick Start:

# Configure YOLO model and image size (optional, defaults shown)
export YOLO_MODEL=yolo11n    # Options: yolo11n, yolo11s, yolo11m, yolo11l, yolo11x
export YOLO_IMAGE_SIZE=640   # Options: 320, 640, 1280

# Start with GPU acceleration (NVIDIA example)
docker compose up --build

Model Selection:

Model	Size	Speed (GPU)	Speed (CPU)	mAP	Use Case
yolo11n	6 MB	~150ms	~250ms	39.5	Real-time (default)
yolo11s	22 MB	~200ms	~400ms	47.0	Balanced accuracy/speed
yolo11m	50 MB	~300ms	~800ms	51.5	Higher accuracy
yolo11l	63 MB	~400ms	~1200ms	53.4	Maximum accuracy
yolo11x	138MB	~600ms	~2000ms	54.7	Research/offline analysis

Configuration via UI:

Enable Detection: Navigate to a stream's detection settings page
Select Labels: Choose which object classes to detect (e.g., "person", "car")
Set Confidence: Adjust minimum confidence threshold (0-100%)
View Live Preview: See detections rendered on video feed in real-time

Configuration via API:

# Get current YOLO configuration
curl http://localhost:8000/api/yolo/config

# Configure detection for a stream
curl -X PUT http://localhost:8000/api/streams/your-stream-id/detection \
  -H "Content-Type: application/json" \
  -d '{
    "enabled": true,
    "enabled_labels": ["person", "car", "dog"],
    "min_confidence": 0.7
  }'

# List cached models
curl http://localhost:8000/api/models

# Delete unused model (frees disk space)
curl -X DELETE http://localhost:8000/api/models/yolo11s_640

Model Management UI:

View all cached models with file sizes and download dates
Delete unused models to free disk space
Active models (currently in use) cannot be deleted

Environment Variables:

Variable	Default	Description
`YOLO_MODEL`	`yolo11n`	YOLO model name (yolo11n/s/m/l/x)
`YOLO_IMAGE_SIZE`	`640`	Input image size for inference (320/640/1280)
`GPU_BACKEND`	Auto	Force GPU backend (nvidia/amd/intel/none)

Storage:

Models are cached in /app/models Docker volume
First startup downloads and exports model to ONNX format (~30s)
Subsequent startups use cached ONNX model (instant)
Volume persists between container restarts

Performance:

GPU (NVIDIA RTX 3060): ~5 FPS per stream with yolo11n @ 640x640
CPU (8-core): ~3 FPS per stream with yolo11n @ 640x640
Multiple streams: Tested with 4 concurrent streams at 5 FPS
Frame drop rate: <10% under normal load

Troubleshooting:

# Verify GPU access
docker exec -it proximeter nvidia-smi

# Check YOLO model initialization logs
docker logs proximeter | grep -i yolo

# Test inference manually
docker exec -it proximeter python -c "
from app.services.yolo import create_onnx_session
session = create_onnx_session('/app/models/yolo11n_640.onnx', 'nvidia')
print('Inference session created successfully')
"

# View Prometheus metrics
curl http://localhost:8000/metrics | grep yolo

Advanced Configuration:

For custom model paths or advanced ONNX Runtime settings, see specs/005-yolo-object-detection/requirements.md.

Custom Port

If port 8000 is in use, set the APP_PORT environment variable:

APP_PORT=8080 docker compose up --build

Or edit docker-compose.yml and change the ports mapping.

Frontend Development Workflow

Use the Vite development server for rapid UI iteration. The frontend uses Tailwind CSS and the shadcn/ui component library.

cd frontend
npm install
npx shadcn@latest init # idempotent: ensures shadcn config is up to date
npm run dev

The dev server runs on http://localhost:5173 and proxies API calls to the backend (configured in vite.config.ts).

Tailwind CSS & shadcn/ui Guidelines

Design Tokens:

Tailwind tokens are defined in tailwind.config.ts; extend this file instead of writing ad-hoc CSS.
Use Tailwind utilities for spacing, colors, typography, and responsive breakpoints.
Breakpoints: sm (640px), md (768px), lg (1024px), xl (1280px), 2xl (1536px).
Minimum touch target size: 44x44px (use h-11 w-11 or equivalent Tailwind spacing).

Component Architecture:

UI primitives live under src/components/ui/ and are generated via npx shadcn@latest add <component>.
Custom components SHOULD compose shadcn/ui exports and utilities such as cn for class merging.
Use class-variance-authority (CVA) for component variants; see shadcn/ui examples.
Global theming (light/dark) is managed through the ThemeProvider established in main.tsx.

Adding New Components:

cd frontend
npx shadcn@latest add button  # Adds Button component to src/components/ui/button.tsx
npx shadcn@latest add card    # Adds Card component
npx shadcn@latest add dialog  # Adds Dialog component

Component Documentation:

Document prop types and shadcn/ui primitives used in JSDoc comments.

Example:

/**
 * StreamCard - Displays a single RTSP stream with status and actions.
 * Composes shadcn/ui Card, Badge, Button, and DropdownMenu primitives.
 * @param stream - Stream object with id, name, url, status
 * @param onEdit - Callback when edit button is clicked
 * @param onDelete - Callback when delete button is clicked
 */
export function StreamCard({ stream, onEdit, onDelete }: StreamCardProps) {
  // ...
}

TypeScript & Code Quality

Strict Mode: TypeScript strict mode is enabled in tsconfig.json. All types must be explicit.
Linting: ESLint with Tailwind CSS and accessibility plugins. Run npm run lint to check.
Formatting: Prettier is configured. Run npm run format to auto-format code.
Testing: Vitest + React Testing Library. Run npm run test to execute tests.

Build & Optimization

Production Build: npm run build creates an optimized bundle in dist/.
Bundle Size: Target <500KB gzipped. Tree-shake unused shadcn/ui components by importing only what you use.
Environment Variables: Frontend uses hardcoded API base URL /api (relative path). No build-time configuration needed.

Common Tasks

cd frontend

# Development
npm run dev              # Start Vite dev server (http://localhost:5173)
npm run build           # Build production bundle
npm run preview         # Preview production build locally

# Code Quality
npm run lint            # Run ESLint
npm run format          # Format code with Prettier
npm run test            # Run Vitest tests
npm run test:ui         # Run tests with UI

# shadcn/ui
npx shadcn@latest add <component>  # Add a new component
npx shadcn@latest list             # List available components

Project Structure

backend/
  src/app/
    main.py                    # FastAPI ASGI application entry point
    config_io.py               # YAML persistence (atomic writes)
    logging_config.py          # JSON logging configuration
    metrics.py                 # Prometheus metrics
    api/
      health.py                # Health endpoint
      streams.py               # REST API for streams + playback
      detection.py             # REST API for YOLO detection configuration
      errors.py                # Error schemas and handlers
    models/
      stream.py                # Pydantic models (Stream, NewStream, EditStream)
      detection.py             # Pydantic models (YOLOConfig, Detection, StreamDetectionConfig)
    services/
      streams_service.py       # Business logic for stream management
      yolo.py                  # YOLO model loading, ONNX export, session creation
      detection.py             # Detection pipeline (preprocess, inference, postprocess)
    utils/
      rtsp.py                  # FFmpeg-based RTSP/MJPEG playback utilities
      validation.py            # RTSP URL validation with FFmpeg probe
      strings.py               # Credential masking helpers
    middleware/
      rate_limit.py            # Rate limiting middleware
      request_id.py            # Request ID middleware
  tests/
    unit/
      test_yolo.py             # Unit tests for YOLO service
      test_detection.py        # Unit tests for detection pipeline
      test_detection_api.py    # Unit tests for detection API endpoints
    integration/
      test_detection_e2e.py    # End-to-end integration tests for detection

frontend/
  src/
    components/
      LabelSelector.tsx        # Multi-select for COCO classes
      ConfidenceSlider.tsx     # Confidence threshold slider
      ModelManagement.tsx      # Model cache management component
      ui/                      # shadcn/ui primitives
    pages/
      StreamDetection.tsx      # Stream detection configuration page
      ModelManagement.tsx      # Model management page
    hooks/                     # Custom React hooks
    services/
      detection.ts             # Detection API client
    lib/                       # Utility functions
  tests/                       # Frontend tests
  package.json
  tsconfig.json
  vite.config.ts
  index.html

config/
  config.yml                   # Stream persistence (mounted volume)

models/
  *.onnx                       # Cached YOLO models (Docker volume)

API Endpoints

REST API

GET /api/streams - List all streams (credentials masked)
POST /api/streams - Create a new stream
GET /api/streams/{id} - Get stream details
PATCH /api/streams/{id} - Update stream (partial)
DELETE /api/streams/{id} - Delete stream
POST /api/streams/reorder - Reorder streams (drag-drop persistence)

Object Detection

GET /api/yolo/config - Get current YOLO model configuration
GET /api/models - List all cached YOLO models with metadata
DELETE /api/models/{model_name} - Delete cached model (409 if active)
GET /api/streams/{id}/detection - Get detection config for stream
PUT /api/streams/{id}/detection - Update detection config (validates labels, applies immediately)

Playback

GET /play/{id}.mjpg - MJPEG stream (multipart/x-mixed-replace, ≤5 FPS, no audio, with detection if enabled)

Monitoring

GET /health - Health check (returns {"status":"ok"})
GET /metrics - Prometheus metrics (includes YOLO inference metrics)

Features

Stream Management

Add streams: Name + RTSP URL with validation (2s connectivity probe)
Edit streams: Update name or URL; re-validates on save
Delete streams: Confirmation dialog before deletion
Reorder streams: Drag-and-drop with visual handles; persists immediately
Status tracking: Streams marked Inactive if RTSP unreachable

Playback

MJPEG streaming: Server-side RTSP decode via OpenCV/FFmpeg
Frame rate cap: ≤5 FPS to reduce bandwidth and CPU
No audio: Video only
Error handling: Banner with "Back to streams" link on failure

UI/UX

Design system: shadcn/ui primitives on Tailwind CSS ensure consistent spacing, typography, and theming.
Header animation: Centered on landing, animates to top-left on playback (400-700ms)
Equal-width buttons: Stream buttons in responsive grid (same width per row)
Mobile-friendly: Responsive layout
Accessibility: Keyboard navigation, ARIA labels, focus management baked into shadcn/ui components

CI/CD

GitHub Actions builds the linux/amd64 Docker image, performs a smoke test hitting /health until it returns 200 ok (30s timeout), and publishes the image to GitHub Container Registry (ghcr.io).

Pull the latest published image:

docker pull ghcr.io/clsferguson/proximeter:latest

Security & Safety

⚠️ LAN-ONLY DEPLOYMENT ⚠️

No authentication: Anyone on your network can view/manage streams
No TLS: All traffic is unencrypted
Plaintext credentials: RTSP URLs with passwords stored in config/config.yml
File writes: Only writes to /app/config/config.yml (mounted volume)
Rate limiting: Basic protection (5 req/s, burst 10) on mutating endpoints

DO NOT expose this application to the internet without proper hardening:

Add authentication (e.g., OAuth, basic auth with TLS)
Enable TLS/HTTPS
Encrypt credentials at rest
Implement proper access controls
Add CSRF protection for production use

Name		Name	Last commit message	Last commit date
Latest commit History 143 Commits
.github/workflows		.github/workflows
config		config
frontend		frontend
src/app		src/app
tests		tests
.dockerignore		.dockerignore
.eslintignore		.eslintignore
.gitignore		.gitignore
DEVELOPMENT.md		DEVELOPMENT.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
QUICK_START.md		QUICK_START.md
README.md		README.md
docker-compose.yml		docker-compose.yml
entrypoint.sh		entrypoint.sh
requirements.txt		requirements.txt

License

clsferguson/ProxiMeter

Folders and files

Latest commit

History

Repository files navigation

ProxiMeter — RTSP Object Detection Scoring for Home Automation

Quick Start

Run with Docker (recommended)

Add Your First Stream

Health Check

Metrics

Stop the Application

FFmpeg Configuration

YOLO Object Detection

Custom Port

Frontend Development Workflow

Tailwind CSS & shadcn/ui Guidelines

TypeScript & Code Quality

Build & Optimization

Common Tasks

Project Structure

API Endpoints

REST API

Object Detection

Playback

Monitoring

Features

Stream Management

Playback

UI/UX

CI/CD

Security & Safety

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages