Deep Research Benchmark Results

iFlow Team, Alibaba

These are the evaluation results for iFlow on the Deep Research Bench Leaderboard.

Performance Metrics

Metric	Score
Overall	51.62
Comprehensiveness	52.75
Insight	51.77
Instruction Following	50.91
Readability	50.01

About Deep Research Bench

Deep Research Bench is a benchmark for evaluating deep research capabilities, focusing on:

Comprehensiveness: Breadth and depth of research content
Insight: Quality of analysis and unique perspectives
Instruction Following: Understanding and execution of user requirements
Readability: Clarity and organization of output

More Information

Visit the Deep Research Bench Leaderboard for the complete rankings and detailed evaluation criteria.

Deep Research Benchmark 评测结果

iFlow 团队，阿里巴巴

这是 iFlow 在 Deep Research Bench Leaderboard 上的评测结果。

性能指标

指标	得分
总体得分	51.62
全面性 (Comprehensiveness)	52.75
洞察力 (Insight)	51.77
指令遵循 (Instruction Following)	50.91
可读性 (Readability)	50.01

关于 Deep Research Bench

Deep Research Bench 是一个用于评估深度研究能力的基准测试，主要考察 AI 系统在以下方面的表现：

全面性：研究内容的广度和深度
洞察力：分析的质量和独特见解
指令遵循：对用户需求的理解和执行
可读性：输出内容的清晰度和组织结构

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
iFlow_deep_research.jsonl		iFlow_deep_research.jsonl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Deep Research Benchmark Results

Performance Metrics

About Deep Research Bench

More Information

Deep Research Benchmark 评测结果

性能指标

关于 Deep Research Bench

更多信息

About

Uh oh!

Releases

Packages

ydai-hub/deep_research_bench_results

Folders and files

Latest commit

History

Repository files navigation

Deep Research Benchmark Results

Performance Metrics

About Deep Research Bench

More Information

Deep Research Benchmark 评测结果

性能指标

关于 Deep Research Bench

更多信息

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages