Skip to content

ydai-hub/deep_research_bench_results

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Deep Research Benchmark Results

iFlow Team, Alibaba

These are the evaluation results for iFlow on the Deep Research Bench Leaderboard.

Performance Metrics

Metric Score
Overall 51.62
Comprehensiveness 52.75
Insight 51.77
Instruction Following 50.91
Readability 50.01

About Deep Research Bench

Deep Research Bench is a benchmark for evaluating deep research capabilities, focusing on:

  • Comprehensiveness: Breadth and depth of research content
  • Insight: Quality of analysis and unique perspectives
  • Instruction Following: Understanding and execution of user requirements
  • Readability: Clarity and organization of output

More Information

Visit the Deep Research Bench Leaderboard for the complete rankings and detailed evaluation criteria.


Deep Research Benchmark 评测结果

iFlow 团队,阿里巴巴

这是 iFlow 在 Deep Research Bench Leaderboard 上的评测结果。

性能指标

指标 得分
总体得分 51.62
全面性 (Comprehensiveness) 52.75
洞察力 (Insight) 51.77
指令遵循 (Instruction Following) 50.91
可读性 (Readability) 50.01

关于 Deep Research Bench

Deep Research Bench 是一个用于评估深度研究能力的基准测试,主要考察 AI 系统在以下方面的表现:

  • 全面性:研究内容的广度和深度
  • 洞察力:分析的质量和独特见解
  • 指令遵循:对用户需求的理解和执行
  • 可读性:输出内容的清晰度和组织结构

更多信息

访问 Deep Research Bench Leaderboard 查看完整排行榜和详细评测标准。

About

deep_research_bench_results

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published