BlockZoo

BlockZoo provides a standardized framework to benchmark and profile convolutional blocks in isolation. By embedding blocks into a fixed scaffold architecture at different positions (early, mid, late), we can measure block specialization and performance.

The motivation behind this project was the demise of benchmarks from paperswithcode and a desire to have a more fair comparison of convolutional blocks.

Results

Each data point represents the mean performance from 5 training runs of each block architecture tested across all three scaffold positions (early/mid/late) on CIFAR-100 over 50 epochs.

Iterated benchmarking reveals distinct performance characteristics across block architectures. While some designs like RepMixer and DYMicroBlock show clear limitations for image classification tasks, the majority of blocks form an interesting Pareto frontier trading off accuracy versus inference latency. MobileOneBlock emerges as exceptional for high-throughput applications, delivering competitive accuracy with remarkably low latency through its reparameterization strategy. Meanwhile, inverted residual architectures (InvertedResidual, UniversalInvertedBottleneck, EdgeResidual) consistently achieve high accuracy scores, validating their widespread adoption in mobile-optimized networks. It's difficult to make the comparison strictly fair given differing FLOPs and parameter counts, but these results use the "default" configurations of each block as defined in their original papers.

Features

Scaffold Architecture: Fixed stem → StageA → StageB → StageC → head for consistent evaluation
Positional Analysis: Test blocks in early/mid/late positions to measure specialization
Comprehensive Profiling: FLOPs, parameters, memory usage, and runtime benchmarking
Lightning Integration: Robust training pipeline with PyTorch Lightning
CSV Export: Structured results for analysis and visualization

Positional Specialization Protocol

The core innovation of BlockZoo is measuring how blocks perform across different network positions. This protocol helps identify whether blocks are specialized for local features (early layers) or global features (late layers).

Protocol Overview

Fixed Scaffold: Use ScaffoldNet with identical stem/head across all experiments
Three Canonical Positions:
- Early (Stage A): High resolution, small receptive field, local context
- Mid (Stage B): Medium resolution with 2x downsampling, balances local and global context
- Late (Stage C): Low resolution with 4x downsampling, global context
Isolation Testing: Only one stage active per experiment
Consistent Training: Same optimizer, schedule, and data across positions

Architecture Details

ScaffoldNet Structure

Input (3×32×32)
    ↓
Stem: Conv3x3(3→64) + BN + ReLU
    ↓
Stage A (Early): 64→64, stride=1, high-res, 3x blocks
    ↓
Stage B (Mid): 64→128, stride=2, medium-res, 3x blocks
    ↓
Stage C (Late): 128→256, stride=2, low-res, 3x blocks
    ↓
Head: AdaptiveAvgPool + Linear(256→classes)
    ↓
Output (classes,)

Key Design Choices:

Fixed Stem/Head: Ensures consistent feature extraction/classification
Single Active Stage: Isolates block performance
Progressive Channels: 64 → 128 → 256 following common practices
Controlled Downsampling: 2× at each transition

Available Block Types:

All blocks are defined in the BlockZoo registry and follow the unified (in_channels, out_channels, stride, position) interface:

ResNet blocks: ResNetBasicBlock, ResNetBottleneck
MobileNet/EfficientNet blocks: InvertedResidual, UniversalInvertedBottleneck, EdgeResidual
Advanced mobile blocks: MobileOneBlock, ReparamLargeKernelConv, GhostBottleneckV3, MBConvLNBlock
Vision transformer blocks: RepMixer

To see currently available blocks:

from blockzoo import list_available_blocks
print("Available blocks:", list_available_blocks())

Getting Started

Installation

git clone [email protected]:Teque5/blockzoo.git
pip install --editable .

Usage

Profile a Block

python -m blockzoo.profile ResNetBasicBlock --position mid

Benchmark Runtime Performance

python -m blockzoo.benchmark ResNetBasicBlock --position mid --device cuda

Full Training & Evaluation

python -m blockzoo.train ResNetBasicBlock --position mid --epochs 5
# Results saved to results/results.csv

Run All Benchmarks

python3 scripts/bench_all.py

Plot Results

python3 -m blockzoo.visualize results/results.csv

Results Analysis

Results are saved to CSV with columns for accuracy, parameters, FLOPs, latency, and more:

# Load and analyze results
import pandas as pd
df = pd.read_csv('results/results.csv')
position_analysis = df.groupby(['block', 'position']).agg({
    'val_acc': 'mean',
    'params_total': 'mean',
    'latency_ms': 'mean'
}).round(4)

print(position_analysis)

Position Comparison Example

# Test ResNet BasicBlock across all positions
python -m blockzoo.train ResNetBasicBlock --position early --epochs 10
python -m blockzoo.train ResNetBasicBlock --position mid --epochs 10
python -m blockzoo.train ResNetBasicBlock --position late --epochs 10

# Analyze results
import pandas as pd
df = pd.read_csv('results/results.csv')
print(df.groupby('position')[['val_acc', 'params_total', 'latency_ms']].mean())

Testing

# Run all tests
coverage run

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
blockzoo		blockzoo
docs		docs
scripts		scripts
tests		tests
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BlockZoo

Results

Features

Positional Specialization Protocol

Protocol Overview

Architecture Details

ScaffoldNet Structure

Getting Started

Installation

Usage

Profile a Block

Benchmark Runtime Performance

Full Training & Evaluation

Run All Benchmarks

Plot Results

Results Analysis

Position Comparison Example

Testing

About

Uh oh!

Languages

Teque5/blockzoo

Folders and files

Latest commit

History

Repository files navigation

BlockZoo

Results

Features

Positional Specialization Protocol

Protocol Overview

Architecture Details

ScaffoldNet Structure

Getting Started

Installation

Usage

Profile a Block

Benchmark Runtime Performance

Full Training & Evaluation

Run All Benchmarks

Plot Results

Results Analysis

Position Comparison Example

Testing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages