GitHub - NVIDIA/flashdreams: high-performance inference and serving library for interactive autoregressive video and world models

FlashDreams is a high-performance inference and serving library for interactive autoregressive video and world models. It began as the optimized runtime behind the NVIDIA OmniDreams closed-loop demo for GTC 2026 and has grown into a general platform for real-time world-model applications across gaming, autonomous vehicles, robotics, simulated or virtual environments, and more.

flashdreams-promo-hq-0601-720P.mp4

System Requirements

NVIDIA GPU with 80 GB VRAM or more (e.g. H100 80GB), see notes below.
NVIDIA driver from the R580 series or newer (compatible with CUDA 13.x)
CUDA 13.x (PyTorch 2.11.0+cu130 and the nvidia-*-cu13 libraries are resolved by uv sync. A system CUDA toolkit is needed only for the developer extras and is included in nvidia/cuda:13.2.1-cudnn-devel-ubuntu24.04)
Python >= 3.10
PyTorch >= 2.11.0+cu130 (>= 2.9 for bare PyPI library install)
Linux x86-64 or arm64
100 GB+ free storage space recommended for environment and model checkpoints.
Docker with the NVIDIA Container Toolkit (optional, only for the container workflow)

Development and testing were performed on GPUs with 80 GB of VRAM or more. Inference can fail (out-of-memory) on consumer and even enthusiast GPUs. Per-model GPU and VRAM requirements are listed on each model page in the model gallery.

Quickstart

The complete setup is in the installation guide. Assuming uv is installed, the shortest viable path is:

git clone https://github.com/NVIDIA/flashdreams.git
cd flashdreams
uv sync --extra runners
export HF_TOKEN=<your-hf-token>
uv run flashdreams-run --help

Note for developers/maintainers you would want to run uv sync --extra dev --extra runners instead.

Then launch your first model by following the quickstart guide. For example, the offline Self-Forcing T2V quickstart command is:

uv run --project integrations/self_forcing \
    flashdreams-run self-forcing-wan2.1-t2v-1.3b \
    --total-blocks 7

You can also install FlashDreams as a library from PyPI:

pip install flashdreams

Try the interactive driving demo

Drive a world model in real time with the OmniDreams interactive-drive demo. See the interactive demo guide.

Supported models

FlashDreams ships first-party integrations under integrations/. Each model has a dedicated docs page with runner slugs, multi-GPU commands, and (where available) profiling benchmarks.

Model	Family
Self-Forcing	Streaming Wan2.1 T2V
OmniDreams	HDMap-conditioned driving world model
LingBot-World	Camera-controllable I2V world model
Wan2.1	Bidirectional T2V / I2V
Causal-Forcing	Streaming Wan2.1 T2V / I2V
Causal Wan2.2	FastVideo Causal Wan 2.2 14B MoE T2V
FlashVSR	Streaming video super-resolution
Cosmos-Predict2.5	Bidirectional T2V / I2V

See the model gallery and the new method guide to add your own.

Developer guides

For day-to-day development:

uv sync --extra dev --extra runners
uv run pre-commit run -a
uv run pytest -m "not manual"

See DEV.md for repository-specific workflow notes.

Contributing

For how to contribute, see CONTRIBUTING.md. New integrations, bug reports, feature requests, performance tuning, and documentation edits are all welcome.

Use GitHub Issues to report defects or request improvements.

Join us on the NVIDIA Omniverse Discord to share your results and take part in technical discussion! Channel: #flashdreams

Security

To report a potential security vulnerability, follow the coordinated disclosure process in SECURITY.md.

License

FlashDreams is released under the Apache License 2.0. Third-party components and their licenses are listed in THIRD-PARTY-NOTICES and NOTICE. The repository is REUSE-compliant; see REUSE.toml and LICENSES/.

Citation

If FlashDreams is useful in your research or product, please cite the project:

@misc{flashdreams2026,
  title        = {FlashDreams: High-performance inference and serving for
                  interactive autoregressive video and world models},
  author       = {{FlashDreams Contributors}},
  year         = {2026},
  howpublished = {\url{https://github.com/NVIDIA/flashdreams}},
}

@misc{nvidia2026omnidreams,
  title={OmniDreams: Real-Time Generative Closed-Loop Autonomous Vehicle Simulation Built on NVIDIA Cosmos},
  author={Basant, Aarti and Kar, Amlan and Paschalidou, Despoina and Garcia Cobo, Guillermo and Turki, Haithem and Ling, Huan and Seo, Jaewoo and Wang, Jialiang and Lucas, James and Wu, Jay and Lorraine, Jonathan and Gao, Jun and He, Kai and Tothova, Katarina and Xie, Kevin and Tyszkiewicz, Michal and Wu, Qi and de Lutio, Riccardo and Li, Ruilong and Fidler, Sanja and Kim, Seung Wook and Shen, Tianchang and Cao, Tianshi and Pfaff, Tobias and Lew, William and Ren, Xuanchi and Lu, Yifan and Gojcic, Zan and Wang, Zian},
  year={2026},
  note={Technical report},
  howpublished={\url{https://research.nvidia.com/labs/sil/projects/omnidreams-blog/paper.pdf}}
}

Name		Name	Last commit message	Last commit date
Latest commit History 535 Commits
.github		.github
LICENSES		LICENSES
agentic/skills		agentic/skills
assets		assets
docker		docker
docs		docs
flashdreams		flashdreams
integrations		integrations
tests		tests
.gitignore		.gitignore
.pre-commit-config-base.yaml		.pre-commit-config-base.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
DEV.md		DEV.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
REUSE.toml		REUSE.toml
SECURITY.md		SECURITY.md
THIRD-PARTY-NOTICES		THIRD-PARTY-NOTICES
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

System Requirements

Quickstart

Try the interactive driving demo

Supported models

Developer guides

Contributing

Security

License

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

System Requirements

Quickstart

Try the interactive driving demo

Supported models

Developer guides

Contributing

Security

License

Citation

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages