Skip to content

TsinghuaC3I/Awesome_Image_Generation_with_Thinking

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

1 Commit
Β 
Β 
Β 
Β 

Repository files navigation

πŸ’‘πŸŽžοΈ Awesome_Image_Generation_with_Thinking

Logo

Image Generation with Thinking.

Awesome License: MIT

Welcome to the Awesome-Image-Generation-with-Thinking repository! This repository represents a comprehensive collection of research focused on empowering models to think during image generation. We explore current works and summarize them into three approaches: explicit reflection, reinforcement learning, and unified multimodal models.


πŸ”” News

  • [2025-06] We created this repository to maintain a paper list on Awesome-Image-Generation-With-Thinking. Contributions are welcome!

πŸ“œ Table of Contents


πŸ“– Survey

🧠 Reinforcement Learning

Reinforcement learning has been proven to be a crucial step in enhancing reasoning capabilities. Here, we summarize methods that utilize reinforcement learning, such as GRPO, into image generation process.

πŸ—’οΈ Explicit Reflection

Reflection is an essantial step in thinking processes. Explicit reflection, which leverages modalities such as text, object coordinates, and image with editing instructions, is a typical approach.


πŸš€ Unified LMMs

Unified LMMs inherently excel at text-to-image controllability, hence we collect a list of relevant works.


πŸ“š Benchmarks

Essential resources for understanding the broader landscape and evaluating progress in visual reasoning.

Star History

Star History Chart

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published