YOLOv12n-seg Optimization with Pruna AI

This repository contains code to optimize the YOLOv12n-seg-residual model using Pruna AI's optimization tools. The optimization process significantly improves inference speed while maintaining model accuracy.

Background

YOLOv12 is a state-of-the-art "Attention-Centric Real-Time Object Detector" developed by Yunjie Tian, Qixiang Ye, and David Doermann. The YOLOv12n-seg-residual variant adds instance segmentation capabilities to the base detection model. This repository specifically focuses on optimizing this model using Pruna AI's optimization tools, which leverage PyTorch's compilation capabilities to achieve significant performance improvements.

The optimization is performed using Pruna's smash functionality, which applies various optimization techniques including graph transformations and compilation optimizations to make the model run faster without compromising accuracy.

Model Source

The YOLOv12n-seg-residual model used in this project was sourced from Weights & Biases: YOLOv12n-seg-residual Model

Note: YOLOv12 is relatively new, having been published in February 2025 on arXiv as "YOLOv12: Attention-Centric Real-Time Object Detectors."

Getting Started

Clone the Repository

git clone https://github.com/YOUR_USERNAME/yolov12n-seg-optimization-pruna-ai.git
cd yolov12n-seg-optimization-pruna-ai

Set Up Virtual Environment

# Create a virtual environment
python -m venv venv

# Activate the virtual environment
# On Windows:
venv\Scripts\activate
# On macOS/Linux:
source venv/bin/activate

# Install dependencies
pip install -r requirements.txt

Download the Model

Download the YOLOv12n-seg-residual model from Weights & Biases
Place the downloaded .pt file in the models/ directory:

# Create models directory if it doesn't exist
mkdir -p models
# Move the downloaded model to the models directory
mv path/to/downloaded/yolov12n-seg-residual.pt models/

Run the Optimization Script

python optimize_yolo.py

This will:

Load the YOLOv12n-seg-residual model
Benchmark the original model's performance
Apply Pruna's optimization techniques
Benchmark the optimized model's performance
Save the optimized model to models/yolov12n-seg-residual_smashed_tc_gpu.pt

Configuration Options

You can modify the following parameters in optimize_yolo.py:

MODEL_PATH: Path to the original YOLOv12n-seg-residual model
SMASHED_MODEL_PATH: Path to save the optimized model
NUM_WARMUP_RUNS: Number of warm-up inference runs before benchmarking
NUM_TIMED_RUNS: Number of inference runs for benchmarking
SAMPLE_INPUT_SHAPE: Input shape for the model (default: [1, 3, 640, 640])

Technical Details

Optimization Process

The optimization process uses Pruna AI's smash functionality with PyTorch's compilation backend. The script:

Loads the YOLOv12n-seg-residual model
Configures Pruna's SmashConfig with appropriate compiler settings
Applies PyTorch's inductor backend optimization for NVIDIA GPUs
Benchmarks the original and optimized models to measure performance improvement

Performance Improvements

Typical performance improvements vary depending on hardware, but you can expect:

1.5-3x speedup on NVIDIA GPUs
Improved throughput for real-time applications

About YOLOv12

YOLOv12 is an attention-centric YOLO framework that matches the speed of CNN-based models while harnessing the performance benefits of attention mechanisms. According to the authors:

YOLOv12-N achieves 40.6% mAP with an inference latency of 1.64 ms on a T4 GPU
It outperforms advanced YOLOv10-N / YOLOv11-N by 2.1%/1.2% mAP with comparable speed
The model also surpasses end-to-end real-time detectors like RT-DETR and RT-DETRv2

Requirements

The main dependencies are:

PyTorch (>= 2.0.0)
Ultralytics (for the YOLO class used to load and operate the model)
Pruna (for optimization)

See requirements.txt for the complete list of dependencies.

License

MIT License

Acknowledgments

YOLOv12 by Yunjie Tian, Qixiang Ye, and David Doermann
Pruna AI for their optimization tools

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compare_models.py		compare_models.py
optimize_yolo.py		optimize_yolo.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

YOLOv12n-seg Optimization with Pruna AI

Background

Model Source

Getting Started

Clone the Repository

Set Up Virtual Environment

Download the Model

Run the Optimization Script

Configuration Options

Technical Details

Optimization Process

Performance Improvements

About YOLOv12

Requirements

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

felixkarevo/yolov12n-seg-optimization-pruna-ai

Folders and files

Latest commit

History

Repository files navigation

YOLOv12n-seg Optimization with Pruna AI

Background

Model Source

Getting Started

Clone the Repository

Set Up Virtual Environment

Download the Model

Run the Optimization Script

Configuration Options

Technical Details

Optimization Process

Performance Improvements

About YOLOv12

Requirements

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages