RHAIIS Performance Dashboard

A comprehensive performance analysis dashboard for RHAIIS (Red Hat AI Inference server) benchmarks. This dashboard provides interactive visualizations and analysis of AI model performance across different accelerators, versions, and configurations.

Features

Interactive Performance Plots: Compare throughput, latency, and efficiency metrics
Cost Analysis: Calculate cost per million tokens with cloud provider pricing
Performance Rankings: Identify top performers by throughput and latency
Regression Analysis: Track performance changes between versions
Runtime Configuration Tracking: View inference server arguments used
Multi-Accelerator Support: Compare H200, MI300X, and TPU performance

Key Metrics Analyzed

Throughput: Output tokens per second
Latency: Time to First Token (TTFT) and Inter-Token Latency (ITL)
Efficiency: Throughput per tensor parallelism unit
Cost Efficiency: Cost per million tokens across cloud providers
Error Rates: Request success/failure analysis
Concurrency Performance: Performance at different load levels

Directory Structure

performance-dashboard/
├── dashboard.py                    # Main dashboard application
├── dashboard_styles.py             # CSS styling file
├── pyproject.toml                  # Project metadata and dependencies
├── requirements.txt                # Python dependencies
├── Dockerfile.openshift            # Container build configuration
├── .pre-commit-config.yaml         # Pre-commit hooks configuration
├── Makefile                        # Development commands
├── manual_runs/scripts/            # Data processing scripts
│   └── import_manual_run_jsons.py  # Import manual benchmark results
├── deploy/                         # OpenShift deployment files
│   ├── openshift-deployment.yaml   # Application deployment
│   ├── openshift-service.yaml      # Service configuration
│   └── openshift-route.yaml        # Route/ingress configuration
├── tests/                          # Test suite
│   ├── test_data_processing.py     # Data processing unit tests
│   ├── test_import_script.py       # Import script tests
│   ├── test_integration.py         # Integration tests
│   ├── conftest.py                 # Shared fixtures
│   └── README.md                   # Test documentation
├── docs/                           # Documentation
│   └── CODE_QUALITY.md             # Code quality guidelines
└── data/                           # Data files (excluded from git)
    └── consolidated_dashboard.csv  # Benchmark data, Get the latest csv data from the AWS S3 bucket.

Quick Start

Local Development

Clone the repository:

git clone https://github.com/openshift-psap/performance-dashboard.git
cd performance-dashboard

Set up Python environment:

python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Add your data:
- Place your consolidated_dashboard.csv in the root directory
- Use the utilities in manual_runs/scripts/ to process new benchmark data
Run the dashboard:
```
streamlit run dashboard.py
```
Access: Open http://localhost:8501 in your browser

Development Environment Setup

For a complete development environment with linting, formatting, and code quality tools:

# Create and activate virtual environment
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

# Install development dependencies from pyproject.toml
pip install -e ".[dev]"

# Install pre-commit hooks
pre-commit install

Available development commands:

make format - Auto-format code (Black, Ruff)
make lint - Run linting checks
make type-check - Run static type checking
make test - Run tests with coverage
make ci-local - Run all CI checks locally
make clean - Clean temporary files

Code Quality:

All code is checked with ruff, black, mypy
Pre-commit hooks enforce code standards
Tests must pass before merging
Documentation required for public functions

See Code Quality Documentation for detailed information.

Container Deployment

Build the container:

podman build -f Dockerfile.openshift -t performance-dashboard .

Run locally:

podman run -p 8501:8501 performance-dashboard

OpenShift Deployment

Prerequisites

OpenShift CLI (oc) installed and configured
Access to an OpenShift cluster with permissions to create projects
Container registry access (quay.io or internal registry)
Latest CSV data file in the project directory

Step-by-Step Deployment

Create the namespace/project:

oc new-project rhaiis-dashboard --display-name="RHAIIS Performance Dashboard"

Prepare your data:

# Ensure you have the latest consolidated_dashboard.csv in the root directory
# You can download it from the AWS S3 bucket or generate it using the scripts

Build and push the container image:

# Build the container image with your data
podman build -f Dockerfile.openshift -t quay.io/your-username/rhaiis-dashboard:latest .

# Push to your container registry
podman push quay.io/your-username/rhaiis-dashboard:latest

Update the image reference in deployment:

Edit deploy/openshift-deployment.yaml to use your image

Deploy all components:

# Deploy the application, service, and route
oc apply -f deploy/openshift-deployment.yaml
oc apply -f deploy/openshift-service.yaml
oc apply -f deploy/openshift-route.yaml

Access the dashboard:

# Get the dashboard URL
echo "Dashboard URL: http://$(oc get route rhaiis-dashboard -n rhaiis-dashboard -o jsonpath='{.spec.host}')"

Updating the Dashboard

When you have new data or code changes:

Rebuild the image with updated data:

podman build -f Dockerfile.openshift -t quay.io/your-username/rhaiis-dashboard:latest .
podman push quay.io/your-username/rhaiis-dashboard:latest

Restart the deployment to use the new image:

oc rollout restart deployment/rhaiis-dashboard -n rhaiis-dashboard

Data Processing

Processing New Benchmark Data from manual runs

From manual JSON results:

python scripts/import_manual_run_jsons.py benchmark.json \
  --model "RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic" \
  --version "vLLM-0.10.1" \
  --tp 8 \
  --accelerator "H200" \
  --runtime-args "tensor-parallel-size: 8; max-model-len: 8192"

Consolidate data: Merge new results with existing CSV file

Testing

The project includes a comprehensive test suite with unit and integration tests.

Run all tests:

pytest tests/

Run with coverage:

pytest tests/ --cov=. --cov-report=html

Quick test command:

make test

Test Categories:

Data Processing Tests - Core data manipulation functions
Import Script Tests - JSON import and parsing
Integration Tests - End-to-end workflows

See tests/README.md for detailed test documentation.

🔧 Configuration

Environment Variables

STREAMLIT_SERVER_HEADLESS=true: Headless mode for production
STREAMLIT_SERVER_PORT=8501: Server port
STREAMLIT_SERVER_ADDRESS=0.0.0.0: Listen address

Data Requirements

CSV Format: Must include columns for model, version, accelerator, TP, metrics
Runtime Args: Semicolon-separated key-value pairs
Benchmark Profiles: Support for different prompt/output token configurations

Contributing

Fork the repository
Create a feature branch: git checkout -b feature-name
Set up development environment: pip install -e ".[dev]"
Install pre-commit hooks: pre-commit install
Make changes and test locally: pytest tests/
Run code quality checks: make ci-local
Update documentation as needed
Submit a merge request

Development Workflow:

# 1. Create feature branch
git checkout -b feature/my-feature

# 2. Make changes
# ... edit code ...

# 3. Run tests
pytest tests/

# 4. Format and lint
make format
make lint

# 5. Commit (pre-commit hooks will run)
git add .
git commit -m "Add feature"

# 6. Push and create a Pull request against main
git push origin feature/my-feature

⚠️ CONFIDENTIAL: This dashboard displays performance data for internal use only.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RHAIIS Performance Dashboard

Features

Key Metrics Analyzed

Directory Structure

Quick Start

Local Development

Development Environment Setup

Container Deployment

OpenShift Deployment

Prerequisites

Step-by-Step Deployment

Edit deploy/openshift-deployment.yaml to use your image

Updating the Dashboard

Data Processing

Processing New Benchmark Data from manual runs

Testing

🔧 Configuration

Environment Variables

Data Requirements

Contributing

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github/workflows		.github/workflows
deploy		deploy
docs		docs
manual_runs		manual_runs
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile.openshift		Dockerfile.openshift
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
dashboard.py		dashboard.py
dashboard_styles.py		dashboard_styles.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

License

Harshith-umesh/performance-dashboard

Folders and files

Latest commit

History

Repository files navigation

RHAIIS Performance Dashboard

Features

Key Metrics Analyzed

Directory Structure

Quick Start

Local Development

Development Environment Setup

Container Deployment

OpenShift Deployment

Prerequisites

Step-by-Step Deployment

Edit deploy/openshift-deployment.yaml to use your image

Updating the Dashboard

Data Processing

Processing New Benchmark Data from manual runs

Testing

🔧 Configuration

Environment Variables

Data Requirements

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages