Correct the configuration of LLaVA-CoT #705

XuGW-Kevin · 2024-12-31T03:06:16Z

First of all, we sincerely appreciate the work of VLMEvalKit and the tremendous contributions it has made to the entire VLM community! However, the current configuration of LLaVA-CoT (e.g., max_new_tokens) is incorrect, leading to significant deviations in the benchmark test results. This PR aims to correct the configuration of LLaVA-CoT.

* update vlrewardbench * pre-commit fix * formatter * [Improvement] Better `AUTO_SPLIT` and model split for InternVL2 * [Minor] Improve CC-OCR Import * [Model] Support QVQ * [Model] Update Molmo Eval to Match Official Implementation (#648) * add molmo prompts * fix lint format * [Fix] Refine Qwen-VL2 device assignment * [Fix] Fix RealWorldQA md5 * update MMMU_DEV_VAL tsv * [Fix] Fix confusing image width&height (#704) Co-authored-by: Yuan Ye <[email protected]> * Update llama_vision.py (#705) * [Fix] Fix Lint * Fix Lint * Fix Lint --------- Co-authored-by: kennymckormick <[email protected]> Co-authored-by: jamespark3922 <[email protected]> Co-authored-by: CMeteor <[email protected]> Co-authored-by: Yuan Ye <[email protected]> Co-authored-by: Guowei Xu <[email protected]>

Update llama_vision.py

adb6075

kennymckormick merged commit 6e1a59a into open-compass:main Dec 31, 2024
1 check failed

kennymckormick pushed a commit to TobiasLee/VLMEvalKit that referenced this pull request Jan 1, 2025

Update llama_vision.py (open-compass#705)

3691698

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correct the configuration of LLaVA-CoT #705

Correct the configuration of LLaVA-CoT #705

XuGW-Kevin commented Dec 31, 2024 •

edited

Loading

Correct the configuration of LLaVA-CoT #705

Correct the configuration of LLaVA-CoT #705

Conversation

XuGW-Kevin commented Dec 31, 2024 • edited Loading

XuGW-Kevin commented Dec 31, 2024 •

edited

Loading