multi-modal-transformers

Here are 2 public repositories matching this topic...

Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"

Multi modal BiTransformer [ Reimplementation ] in Pytorch That Acutally Works !

Add a description, image, and links to the multi-modal-transformers topic page so that developers can more easily learn about it.

To associate your repository with the multi-modal-transformers topic, visit your repo's landing page and select "manage topics."