Skip to content

Commit c891d05

Browse files
add dreamgym to index
1 parent 31721a8 commit c891d05

1 file changed

Lines changed: 8 additions & 0 deletions

File tree

research-papers/index.html

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -70,6 +70,14 @@ <h1 class="heading-2">Research Papers</h1>
7070
</td>
7171
<td class="description">Synthesis of Toby Ord's half-life framework with METR's exponential growth analysis. Reveals AI agents fail at constant rate per minute (half-life model) while capabilities double every 7 months. Projects specific reliability thresholds: 90% reliability requires 1/7 task duration reduction, current models achieve 50-minute tasks at 50% success. Predicts month-long task automation by 2030, with practical architecture patterns for current reliability levels.</td>
7272
</tr>
73+
<tr>
74+
<td>
75+
<a href="dreamgym_report.html" target="_blank" class="paper-link">
76+
<strong>DreamGym: Scaling Agent Learning via Experience Synthesis</strong>
77+
</a>
78+
</td>
79+
<td class="description">Breakthrough framework for training AI agents through synthetic experience synthesis. Introduces reasoning-based experience model that simulates environment dynamics, enabling scalable reinforcement learning without costly real-world interactions. Achieves 30%+ improvement on non-RL-ready tasks like WebArena using zero real environment interactions, while matching state-of-the-art on traditional benchmarks. Addresses four critical challenges: costly rollouts, limited task diversity, unreliable rewards, and infrastructure complexity.</td>
80+
</tr>
7381
<tr>
7482
<td>
7583
<a href="deepseek_ocr_report.html" target="_blank" class="paper-link">

0 commit comments

Comments
 (0)