Add OneDNN-based MNIST neural network implementation for optimized performance #6

Copilot · 2025-07-30T22:28:03Z

This PR introduces an alternate implementation of the MNIST neural network training problem that leverages Intel's OneDNN (Deep Neural Network Library) for optimized CPU performance.

Overview

The new implementation provides the same OptimizationProblem interface as the existing Candle-based MNIST implementation but uses Intel's OneDNN library for highly optimized matrix operations and neural network primitives.

Key Features

Performance Optimizations

Optimized GEMM operations: Uses OneDNN's highly tuned general matrix multiplication routines
Hardware-aware activation functions: Leverages CPU-specific optimizations for ReLU, Tanh, and Logistic functions
Memory layout optimization: OneDNN automatically selects optimal memory formats for the target CPU
Architecture awareness: Automatically detects and utilizes CPU features like AVX, AVX2, and AVX-512

Implementation Details

Same Interface: Implements the OptimizationProblem trait, making it a drop-in replacement for benchmarking
Feature Gated: Conditionally compiled with the onednn feature flag to avoid requiring OneDNN installation
Thread Safe: Uses interior mutability patterns for safe concurrent access
Configurable: Supports multiple activation functions and network architectures

Usage Example

use qqn_optimizer::{MnistOneDnnNeuralNetwork, benchmarks::mnist_onednn::ActivationType};
use rand::{rngs::StdRng, SeedableRng};

let mut rng = StdRng::seed_from_u64(42);
let network = MnistOneDnnNeuralNetwork::create(
    Some(1000),                    // 1000 samples
    &[64, 32],                     // Hidden layers: 64 and 32 neurons
    Some(64),                      // Batch size
    &mut rng,
    Some(ActivationType::ReLU),    // ReLU activation
)?;

// Use with any optimizer
let loss = network.evaluate_f64(&network.initial_point())?;

Benchmarking Support

The implementation includes comprehensive benchmarking tools:

# Compare OneDNN vs Candle performance (requires OneDNN installation)
cargo run --example benchmark_comparison --features onednn --release

# Run OneDNN-specific examples
cargo run --example onednn_mnist --features onednn

Installation

OneDNN must be installed separately. The PR includes:

Automated installation script (install_onednn.py) for Ubuntu/Debian systems
Comprehensive documentation in docs/onednn_mnist.md
Installation instructions for multiple platforms

Files Added

src/benchmarks/mnist_onednn.rs - Core OneDNN implementation
docs/onednn_mnist.md - Comprehensive documentation and usage guide
examples/onednn_mnist.rs - Basic usage example
examples/benchmark_comparison.rs - Performance comparison tool
install_onednn.py - Automated OneDNN installation script

Compatibility

Backward Compatible: Existing code continues to work unchanged
Optional Dependency: OneDNN is only required when using the onednn feature
Cross-Platform: Works on any system where OneDNN can be installed
API Consistent: Maintains the same interface patterns as existing implementations

This implementation enables researchers and practitioners to leverage Intel's highly optimized neural network primitives while maintaining full compatibility with the existing QQN optimization framework.

💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click here to start the survey.

Co-authored-by: acharneski <[email protected]>

Initial plan

c34a386

Copilot AI assigned Copilot and acharneski Jul 30, 2025

Copilot started work on behalf of acharneski July 30, 2025 22:28 View session

Copilot AI and others added 2 commits July 30, 2025 22:48

Add OneDNN-based MNIST neural network implementation

59f5999

Co-authored-by: acharneski <[email protected]>

Complete OneDNN MNIST implementation with examples and documentation

b5a8a19

Co-authored-by: acharneski <[email protected]>

Copilot AI changed the title ~~[WIP] Create an alternate version of the mnist problem suite that leverages intel's onednn~~ Add OneDNN-based MNIST neural network implementation for optimized performance Jul 30, 2025

Copilot finished work on behalf of acharneski July 30, 2025 22:59

Copilot AI requested a review from acharneski July 30, 2025 22:59

acharneski marked this pull request as ready for review July 30, 2025 23:40

acharneski merged commit 91666b5 into master Jul 30, 2025
12 of 16 checks passed

acharneski deleted the copilot/fix-d3736f03-bf86-4ac3-9968-cfb797e43c6f branch July 30, 2025 23:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add OneDNN-based MNIST neural network implementation for optimized performance #6

Add OneDNN-based MNIST neural network implementation for optimized performance #6

Uh oh!

Copilot AI commented Jul 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add OneDNN-based MNIST neural network implementation for optimized performance #6

Add OneDNN-based MNIST neural network implementation for optimized performance #6

Uh oh!

Conversation

Copilot AI commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Key Features

Performance Optimizations

Implementation Details

Usage Example

Benchmarking Support

Installation

Files Added

Compatibility

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Jul 30, 2025 •

edited

Loading