[Benchmark] Support Video MCQ with TaskMeAnything-v1-video-random as an example #359

weikaih04 · 2024-08-05T19:57:26Z

Hi,

For this PR

I added TaskMeAnything-v1-video-random video benchmark which includes 2700 video mcq questions.
Along with the benchmark, I found that unlike image mcq, video datasets don’t have a video_mcq.py file, which might hard for adding other mcq video benchmark. Therefore, I implemented video_mcq.py following the logic of image_mcq.py.

The usage of video_mcq.py is the same as image_mcq.py:
Just convert the benchmark to a TSV file, and encode the MP4 video to base64. I have provided the function named mp4_to_base64 in vlmeval/dataset/utils/video_mcq_utils.py.

I added the TaskMeAnything-v1-video-random video benchmark as an example for video_mcq.py and tested it on Paligemma (ImageQA model) and Video-LLaVA (VideoQA model), and it works well.

…dd_tma_video

…LMEvalKit into add_tma_video

junming-yang · 2024-08-06T03:25:08Z

Hi, @weikaih04. We found that TaskMeAnything_v1_imageqa_random has a minor typo. The field of tsv file spelled category incorrectly, resulting in the final calculation result can only output the overall score, but not the category.

weikaih04 · 2024-08-06T04:52:44Z

@junming-yang Hi, thank you for pointing this out! I’ve uploaded new versions of ImageQA and VideoQA and changed the md5, it should work now.

JieyuZ2 · 2024-08-06T09:11:26Z

hey @junming-yang do you think it's ready to merge?

kennymckormick · 2024-08-07T13:48:33Z

Hi, @JieyuZ2 ,
I may need to take a look at this PR and run the evaluation. I will try to complete this in this week.

JieyuZ2 · 2024-08-09T22:38:08Z

Hi, @JieyuZ2 , I may need to take a look at this PR and run the evaluation. I will try to complete this in this week.

Thanks!

weikaih04 and others added 8 commits August 4, 2024 15:38

add tma_image_random

933ad54

fix_name

7ee78c8

fix bug

1bde603

new_md5

beb19d7

fix md5

a4f0b30

support for video mcq and with TaskMeAnything-v1-videoqa-random

db9e3c2

Merge branch 'main' of https://github.com/weikaih04/VLMEvalKit into a…

bff75d1

…dd_tma_video

Merge branch 'add_tma_image_random' of https://github.com/weikaih04/V…

cdda54f

…LMEvalKit into add_tma_video

fix md5 bugs

a9d279f

junming-yang and others added 2 commits August 6, 2024 13:24

Merge branch 'main' into add_tma_video

35617a3

format the code with flake8 formatter instead of rufff

45e35cc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Benchmark] Support Video MCQ with TaskMeAnything-v1-video-random as an example #359

[Benchmark] Support Video MCQ with TaskMeAnything-v1-video-random as an example #359

weikaih04 commented Aug 5, 2024

junming-yang commented Aug 6, 2024

weikaih04 commented Aug 6, 2024

JieyuZ2 commented Aug 6, 2024

kennymckormick commented Aug 7, 2024

JieyuZ2 commented Aug 9, 2024

[Benchmark] Support Video MCQ with TaskMeAnything-v1-video-random as an example #359

Are you sure you want to change the base?

[Benchmark] Support Video MCQ with TaskMeAnything-v1-video-random as an example #359

Conversation

weikaih04 commented Aug 5, 2024

junming-yang commented Aug 6, 2024

weikaih04 commented Aug 6, 2024

JieyuZ2 commented Aug 6, 2024

kennymckormick commented Aug 7, 2024

JieyuZ2 commented Aug 9, 2024