Run benchmarking with the supported models on BEIR MSMARCO #177

HAKSOAT · 2025-01-12T21:33:58Z

We need to run benchmarking on the BEIR MSMARCO dataset, to have a better understanding of how the models are performing for retrieval tasks.

We can use the test split available on Hugging Face hub:

Proposed metrics:

Considering non-judged documents as non-relevant.

The text was updated successfully, but these errors were encountered:

HAKSOAT · 2025-01-17T14:56:42Z

The InformationRetrievalEvaluator from SentenceTransformers can be helpful for this.

HAKSOAT added help wanted Extra attention is needed evaluation labels Jan 12, 2025

Provide feedback