Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

问答评估 #233

Open
lizhen-lizhen opened this issue Dec 9, 2024 · 1 comment
Open

问答评估 #233

lizhen-lizhen opened this issue Dec 9, 2024 · 1 comment
Assignees
Labels

Comments

@lizhen-lizhen
Copy link

使用自己的问答数据集进行评估,为啥数据条目超过121条就会报错?
错误信息:
Traceback (most recent call last):
File "/home/hy/lizhen/evalscope-main/test.py", line 25, in
run_task(task_cfg=task_cfg)
File "/home/hy/lizhen/evalscope-main/evalscope/run.py", line 367, in run_task
res_dict: dict = evaluator.eval(infer_cfg=infer_cfg, debug=debug)
File "/home/hy/lizhen/evalscope-main/evalscope/evaluator/evaluator.py", line 485, in eval
reviews_list: list = self.get_reviews(subset_name=subset_name,
File "/home/hy/lizhen/evalscope-main/evalscope/evaluator/evaluator.py", line 350, in get_reviews
review_d = self._get_review(answer_d=answer_d, review_id=review_id, reviewer_spec=reviewer_spec)
File "/home/hy/lizhen/evalscope-main/evalscope/evaluator/evaluator.py", line 294, in _get_review
review_result = self.data_adapter.match(gold_content, answer_content)
File "/home/hy/lizhen/evalscope-main/evalscope/benchmarks/general_qa/general_qa_adapter.py", line 115, in match
rouge_dict = compute_rouge_score_one_sample_zh([pred], [gold])
File "/home/hy/lizhen/evalscope-main/evalscope/metrics/rouge_metric.py", line 67, in compute_rouge_score_one_sample_zh
r = ' '.join(jieba.cut(r)) if is_contains_chinese(r) else r
File "/home/hy/lizhen/evalscope-main/evalscope/metrics/rouge_metric.py", line 34, in is_contains_chinese
for _char in strs:
TypeError: 'NoneType' object is not iterable

@Yunnglin
Copy link
Collaborator

看起来问题跟 #228 类似,是否模型输出了空的结果

@Yunnglin Yunnglin self-assigned this Dec 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants