Towards Understanding and Quantifying Uncertainty for Text-to-Image Generation

📖 Project Overview

This repository contains the academic website for our CVPR 2025 paper on uncertainty quantification in text-to-image (T2I) generative models. We introduce PUNC (Prompt-based UNCertainty Estimation), a novel method that leverages Large Vision-Language Models (LVLMs) to better address uncertainties arising from the semantics of prompts and generated images.

🎯 Key Contributions

First work to quantify and evaluate uncertainty of T2I models with respect to the prompt
Novel PUNC method using LVLMs for semantic uncertainty estimation in text space
Uncertainty disentanglement of aleatoric and epistemic uncertainties via precision and recall
Comprehensive dataset of text prompts and generation pairs for further research
Practical applications in bias detection, copyright protection, and OOD detection

🚀 Setup & Usage

Local Development

Clone the repository

git clone https://github.com/ENSTA-U2IS-AI/Uncertainty_diffusion.git
cd Uncertainty_diffusion

📚 Research Details

Authors

Gianni Franchi - ENSTA Paris
Nacim Belkhir - mirai
Dat Trong NGUYEN - ENSTA Paris
Guoxuan Xia - Imperial College London
Andrea Pilzer - NVIDIA

Citation

@inproceedings{franchi2024uncertainty,
  title={Towards Understanding and Quantifying Uncertainty for Text-to-Image Generation},
  author={Franchi, Gianni and Belkhir, Nacim and Nguyen, Dat Trong and Xia, Guoxuan and Pilzer, Andrea},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year={2025}
}

🙏 Acknowledgements

This work was performed using HPC resources from:

GENCI-IDRIS (Grant 2023-[AD011011970R3])
EuroHPC Development access to LEONARDO, hosted by CINECA

Website Credits

Design & Development: Nacim Belkhir

📄 License

This project is open source. Please cite our work if you use this code or website template.

🔗 Links

📄 Paper: ArXiv:2412.03178
💻 Code: GitHub Repository
🎓 Conference: CVPR 2025

For questions about the research, please contact the authors. For website technical issues, contact Nacim Belkhir.

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
.gitignore		.gitignore
resources		resources
README.md		README.md
index.html		index.html
script.js		script.js
styles.css		styles.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Towards Understanding and Quantifying Uncertainty for Text-to-Image Generation

📖 Project Overview

🎯 Key Contributions

🚀 Setup & Usage

Local Development

📚 Research Details

Authors

Citation

🙏 Acknowledgements

Website Credits

📄 License

🔗 Links

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

ENSTA-U2IS-AI/diffusion-uncertainty

Folders and files

Latest commit

History

Repository files navigation

Towards Understanding and Quantifying Uncertainty for Text-to-Image Generation

📖 Project Overview

🎯 Key Contributions

🚀 Setup & Usage

Local Development

📚 Research Details

Authors

Citation

🙏 Acknowledgements

Website Credits

📄 License

🔗 Links

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages