-
Notifications
You must be signed in to change notification settings - Fork 126
Open
Description
Hi, huge thanks to the authors for releasing this amazing project!
I'm looking forward to using your model and data in my own research.
Regarding video-text data curation, it seems you re-captioned the original data using VideoChat2 and Gemini and re-annotated the videos for instruction fine-tuning.
To construct a same video-text dataset with yours, could you please share the detailed recaptioning & annotating pipeline and code, demonstrated in the figure below?
It must be helpful for my research. Thank you so much!

Metadata
Metadata
Assignees
Labels
No labels