Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"
-
Updated
Nov 11, 2024 - Python
Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"
Multi modal BiTransformer [ Reimplementation ] in Pytorch That Acutally Works !
Add a description, image, and links to the multi-modal-transformers topic page so that developers can more easily learn about it.
To associate your repository with the multi-modal-transformers topic, visit your repo's landing page and select "manage topics."