Skip to content

Evaluator name mismatch #238

@Zane-ZYQiu

Description

@Zane-ZYQiu

Thank you for your great work.

I am currently working on this dataset and trying to build something on it, but I found a major problem. The names of the evaluators defined in vbvrevalkit/eval/vbvr_bench/evaluators/init.py is different from the name of the tasks in VBVR-Dataset/tars. Specifically, the names of 50 tasks match perfectly. However, for the remaining 50 tasks, both the IDs and the names are different. When looking at it manually, I can identify which evaluator corresponds to which task based on the names (e.g., 'O-60_symbol_substitute_data-generator': SymbolSubstituteEvaluator is actually the evaluator of 'O-4_symbol_substitution_data-generator.tar'). However, this creates significant difficulties when trying to call the evaluator within the code to calculate scores.

I would like to ask: Is this a mistake with how I am using it? Or is this an actual problem within the code itself?
Could you please look into this? Thank you.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions