A Large Short-video Recommendation Dataset with Raw Text/Audio/Image/Videos (Talk Invited by DeepMind).
-
Updated
Jan 27, 2025 - Python
A Large Short-video Recommendation Dataset with Raw Text/Audio/Image/Videos (Talk Invited by DeepMind).
🔥 Omni large models and datasets for understanding and generating multi-modalities.
A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency of long-video VLMs.
Add a description, image, and links to the video-understanding-dataset topic page so that developers can more easily learn about it.
To associate your repository with the video-understanding-dataset topic, visit your repo's landing page and select "manage topics."