Questions about downstream tasks #15

Yipinggggg · 2024-04-05T13:49:23Z

Hi, great work! But I have a question I don't understand.

The backbone you used for training is a timesformer which takes a sequence of frames as input, but for all the downstream tasks the input is a single frame. Maybe I haven't fully understood the code, but what does the time dimension do in downstream tasks?

Thank you very much!

Kyfafyd · 2024-04-05T15:30:59Z

Hi @Yipinggggg
Thanks for your interest!
All of our downstream tasks take video sequences as the model input to model the temporal information.
May I learn which part of code is confusing?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about downstream tasks #15

Questions about downstream tasks #15

Yipinggggg commented Apr 5, 2024

Kyfafyd commented Apr 5, 2024

Questions about downstream tasks #15

Questions about downstream tasks #15

Comments

Yipinggggg commented Apr 5, 2024

Kyfafyd commented Apr 5, 2024