Multimodal Masked Autoencoder Pre-training for 3D MRI-based Brain Tumor Analysis with Missing Modalities

This is a Python repository for recovering weights or re-training a multimodal masked autoencoder on anatomical brain MRIs. It naturally handles missing modalities and processes any combination of them. The model supports fine-tuning for classification, segmentation, survival analysis and enables 3D MRI reconstruction from available input modalities.

Supporting code for the corresponding paper

🌰 In a nutshell

BM-MAE is based solely on block transformers for extracting multimodal anatomical MRI features that can later be used to fine-tune model. The considered anatomical modalities are: T1, T1c, FLAIR, T2.

🚀 Installation

We recommend using conda and installing the required libraries from the CLI as follows:

$ conda create -n bmmae python=3.10.10
$ conda activate bmmae
$ pip install requirements.txt

and you're good to go!

📥 Download pre-trained weights

The weights of all models are available for download from Google Drive. You can also simply download them from the HuggingFace Hub Once downloaded, place all models in pretrained_models. There are two available models:

1️⃣ bmmae.pth – The original base model pretrained on BraTS2021.
2️⃣ bmmae_tcga.pth – A specialized model for pre-trained without the TCGA subset.

You can also load them directly from HuggingFace with the following code

from bmmae.model import BMMAE, ViTEncoder
model = BMMAE.from_pretrained()
encoder_only = ViTEncoder.from_pretrained()

📂 The Data

To train the model or replicate the fine-tuning results presented in the paper, you will need to download the BraTS2021.

⚡️ Quickstart

The simplest way to use BM-MAE is to extract a relevant representation through the ViT encoder. Suppose, for a patient, you only have two modalities: T1 and T2

import torch
from bmmae.model import ViTEncoder
from bmmae.tokenizers import MRITokenizer

modalities = ['t1', 't2']
tokenizers = {
            modality: MRITokenizer(
                patch_size=(16, 16, 16),
                img_size=(128, 128, 128),
                hidden_size=768,
            )
            for modality in modalities
        }

encoder = ViTEncoder(
            modalities=modalities,
            tokenizers=tokenizers,
            cls_token=True
        )

state_dict = torch.load('pretrained_models/bmmae.pth')
encoder.load_state_dict(state_dict, strict=False)
inputs = {'t1': torch.randn(1, 1, 128, 128, 128), 't2': torch.randn(1, 1, 128, 128, 128)}
outputs = encoder(inputs) # shape of [1, 1025, 768]

🧠 Reconstructing missing modalities

BM-MAE allows simple reconstruction of any combination of modalities. Concrete examples are given in the file: reconstructions.ipynb.

🔄 Pretraining BM-MAE

Although weights are supplied, it is still possible to train the model from scratch using multimodal MRI data:

$ python pretrain_bmmae.py --data_dir PATH/TO/DATASET

🎯 Fine-tuning

Fine-tuning BM-MAE for segmentation, subtyping or survival analysis is straightforward. Follow the steps below based on your task:

To adapt a UNETR model using the pre-trained ViT for segmentation on the T1c and FLAIR modalities for instance

$ python finetune_seg.py --modalities t1ce flair

To fine-tune a ViT for subtyping classification (e.g. LGG vs GBM) using only the T1 modality for instance, use the following CLI.

$ python finetune_cls.py --task project_id --modalities t1

Note that this setup uses the dataset located at data/data.csv. The --task argument corresponds to any column from this file. Currently available tasks include project_id (subtyping), IDH, MGMT, and TERT labels.

Finally, for survival analysis, simply use the same nomenclature as before (listed after --modalities) and run, for example, on all modalities:

$ python finetune_cls.py --task project_id --modalities t1 t1ce t2 flair

📒 Notebooks

Two notebooks are included in this repository in addition to reconstructions.ipynb.

eval_seg.ipynb allows you to evaluate fine-tuned segmentation models and perform Wilcoxon tests.
clustering.ipynb groups TCGA patients by predicted scores across the 4 MRI modalities and plots Kaplan-Meier curves to assess the stratification power of the models.

⭐ Contribute & Support

💡 Found a bug? Want to add features? Feel free to submit a pull request!
📩 Questions? Open an issue.

📝 Citation

@misc{robinet2025multimodalmaskedautoencoderpretraining,
      title={Multimodal Masked Autoencoder Pre-training for 3D MRI-Based Brain Tumor Analysis with Missing Modalities}, 
      author={Lucas Robinet and Ahmad Berjaoui and Elizabeth Cohen-Jonathan Moyal},
      year={2025},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multimodal Masked Autoencoder Pre-training for 3D MRI-based Brain Tumor Analysis with Missing Modalities

🌰 In a nutshell

🚀 Installation

📥 Download pre-trained weights

📂 The Data

⚡️ Quickstart

🧠 Reconstructing missing modalities

🔄 Pretraining BM-MAE

🎯 Fine-tuning

📒 Notebooks

⭐ Contribute & Support

📝 Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
bmmae		bmmae
data		data
static		static
LICENSE		LICENSE
README.md		README.md
clustering.ipynb		clustering.ipynb
eval_seg.ipynb		eval_seg.ipynb
finetune_cls.py		finetune_cls.py
finetune_seg.py		finetune_seg.py
finetune_survival.py		finetune_survival.py
pretrain_bmmae.py		pretrain_bmmae.py
reconstructions.ipynb		reconstructions.ipynb
requirements.txt		requirements.txt

License

Lucas-rbnt/BM-MAE

Folders and files

Latest commit

History

Repository files navigation

Multimodal Masked Autoencoder Pre-training for 3D MRI-based Brain Tumor Analysis with Missing Modalities

🌰 In a nutshell

🚀 Installation

📥 Download pre-trained weights

📂 The Data

⚡️ Quickstart

🧠 Reconstructing missing modalities

🔄 Pretraining BM-MAE

🎯 Fine-tuning

📒 Notebooks

⭐ Contribute & Support

📝 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages