Evaluator name mismatch

Thank you for your great work.

I am currently working on this dataset and trying to build something on it, but I found a major problem. The names of the evaluators defined in vbvrevalkit/eval/vbvr_bench/evaluators/__init__.py is different from the name of the tasks in VBVR-Dataset/tars. Specifically, the names of 50 tasks match perfectly. However, for the remaining 50 tasks, both the IDs and the names are different. When looking at it manually, I can identify which evaluator corresponds to which task based on the names (e.g., _'O-60_symbol_substitute_data-generator': SymbolSubstituteEvaluator_ is actually the evaluator of _'O-4_symbol_substitution_data-generator.tar'_). However, this creates significant difficulties when trying to call the evaluator within the code to calculate scores.

I would like to ask: Is this a mistake with how I am using it? Or is this an actual problem within the code itself?
Could you please look into this? Thank you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluator name mismatch #238

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Evaluator name mismatch #238

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions