Local judge LLM? #664

lyzhongcrd · 2024-12-12T17:17:32Z

Is it possible to use locally deployed LLM like LLaVa-Critic as judge LLM instead of calling GPT4 API?

kennymckormick · 2024-12-17T09:24:46Z

Hi, @lyzhongcrd ,
Yeah. However, we recommend you use the same LLM as the judger for all LMMs to make it comparable.
For MCQ or Y/N benchmarks, when LLMs are only used as choice extractor for more accurate evaluation, using different LLMs will not lead to significantly different results.

lyzhongcrd · 2024-12-20T08:59:56Z

@kennymckormick Could you tell me how to use locally deployed LLMs as judge LLM in the VLM eval kit? Thanks.

Leke-G · 2024-12-24T02:16:11Z

@kennymckormick您能告诉我如何使用本地部署的LLM作为VLM评估套件中的判断LLM吗？谢谢。

请问一下贴主，如果我要评估的模型是VLM，本地部署的模型应该是LLM还是VLM呢

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local judge LLM? #664

Local judge LLM? #664

lyzhongcrd commented Dec 12, 2024

kennymckormick commented Dec 17, 2024

lyzhongcrd commented Dec 20, 2024

Leke-G commented Dec 24, 2024

Local judge LLM? #664

Local judge LLM? #664

Comments

lyzhongcrd commented Dec 12, 2024

kennymckormick commented Dec 17, 2024

lyzhongcrd commented Dec 20, 2024

Leke-G commented Dec 24, 2024