You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: index.md
+2-2
Original file line number
Diff line number
Diff line change
@@ -67,8 +67,8 @@ Large language model (LLM) agents have been an important frontier in AI, however
67
67
| Feb 17th |*No Class - Presidents' Day*||
68
68
| Feb 24th‡ |**Open Training Recipes for Reasoning in Language Models** <br> Hanna Hajishirzi, University of Washington <br> <ahref="https://rdi.berkeley.edu/adv-llm-agents/slides/OLMo-Tulu-Reasoning-Hanna.pdf">Slides</a> | - [Tulu 3: Pushing Frontiers in Open Language Model Post-Training](https://arxiv.org/abs/2411.15124) <br> - [Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback](https://arxiv.org/abs/2406.09279) <br> - [OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs](https://arxiv.org/abs/2411.14199)|
69
69
| Mar 3rd |**Coding Agents and AI for Vulnerability Detection** <br> Charles Sutton, Google DeepMind <br> <ahref="https://rdi.berkeley.edu/adv-llm-agents/slides/Code Agents and AI for Vulnerability Detection.pdf">Slides</a> | - [Interactive Tools Substantially Assist LM Agents in Finding Security Vulnerabilities](https://arxiv.org/abs/2409.16165) <br> - [From Naptime to Big Sleep: Using Large Language Models To Catch Vulnerabilities In Real-World Code](https://googleprojectzero.blogspot.com/2024/10/from-naptime-to-big-sleep.html)|
70
-
| Mar 10th‡ |**Web Agents** <br> Ruslan Salakhutdinov, CMU/Meta <br> <ahref="https://rdi.berkeley.edu/adv-llm-agents/slides/ruslan-multimodal.pdf">Slides</a> | - [Mind2Web: Towards a Generalist Agent for the Web](https://arxiv.org/abs/2306.06070) <br> - [WebArena: A Realistic Web Environment for Building Autonomous Agents](https://arxiv.org/abs/2307.13854) <br> - [VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks](https://jykoh.com/vwa) <br> - [Tree Search for Language Model Agents](https://jykoh.com/search-agents)|
71
-
| Mar 17th |**Multimodal Agents** <br> Caiming Xiong, Salesforce AI Research ||
70
+
| Mar 10th‡ |**Multimodal Autonomous AI Agents** <br> Ruslan Salakhutdinov, CMU/Meta <br> <ahref="https://rdi.berkeley.edu/adv-llm-agents/slides/ruslan-multimodal.pdf">Slides</a> | - [Mind2Web: Towards a Generalist Agent for the Web](https://arxiv.org/abs/2306.06070) <br> - [WebArena: A Realistic Web Environment for Building Autonomous Agents](https://arxiv.org/abs/2307.13854) <br> - [VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks](https://jykoh.com/vwa) <br> - [Tree Search for Language Model Agents](https://jykoh.com/search-agents)|
71
+
| Mar 17th |**Multimodal Agents – From Perception to Action** <br> Caiming Xiong, Salesforce AI Research | - [OSWORLD: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments](https://arxiv.org/pdf/2404.07972) <br> - [AGUVIS: Unified Pure Vision Agents For Autonomous GUI Interaction](https://arxiv.org/pdf/2412.04454)|
72
72
| Mar 24th |*No Class - Spring Recess*||
73
73
| Mar 31st‡ |**AlphaProof** <br> Thomas Hubert, Google DeepMind <br> <spanstyle="color:red">10am-noon PST</span> ||
74
74
| Apr 7th |**Language models for autoformalization and theorem proving** <br> Kaiyu Yang, Meta FAIR ||
0 commit comments