Skip to content

Commit b763e93

Browse files
authored
Update index.md
1 parent 798a46b commit b763e93

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

Diff for: index.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -67,8 +67,8 @@ Large language model (LLM) agents have been an important frontier in AI, however
6767
| Feb 17th | *No Class - Presidents' Day* | |
6868
| Feb 24th‡ | **Open Training Recipes for Reasoning in Language Models** <br> Hanna Hajishirzi, University of Washington <br> <a href="https://rdi.berkeley.edu/adv-llm-agents/slides/OLMo-Tulu-Reasoning-Hanna.pdf">Slides</a> | - [Tulu 3: Pushing Frontiers in Open Language Model Post-Training](https://arxiv.org/abs/2411.15124) <br> - [Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback](https://arxiv.org/abs/2406.09279) <br> - [OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs](https://arxiv.org/abs/2411.14199) |
6969
| Mar 3rd | **Coding Agents and AI for Vulnerability Detection** <br> Charles Sutton, Google DeepMind <br> <a href="https://rdi.berkeley.edu/adv-llm-agents/slides/Code Agents and AI for Vulnerability Detection.pdf">Slides</a> | - [Interactive Tools Substantially Assist LM Agents in Finding Security Vulnerabilities](https://arxiv.org/abs/2409.16165) <br> - [From Naptime to Big Sleep: Using Large Language Models To Catch Vulnerabilities In Real-World Code](https://googleprojectzero.blogspot.com/2024/10/from-naptime-to-big-sleep.html) |
70-
| Mar 10th‡ | **Web Agents** <br> Ruslan Salakhutdinov, CMU/Meta <br> <a href="https://rdi.berkeley.edu/adv-llm-agents/slides/ruslan-multimodal.pdf">Slides</a> | - [Mind2Web: Towards a Generalist Agent for the Web](https://arxiv.org/abs/2306.06070) <br> - [WebArena: A Realistic Web Environment for Building Autonomous Agents](https://arxiv.org/abs/2307.13854) <br> - [VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks](https://jykoh.com/vwa) <br> - [Tree Search for Language Model Agents](https://jykoh.com/search-agents) |
71-
| Mar 17th | **Multimodal Agents** <br> Caiming Xiong, Salesforce AI Research | |
70+
| Mar 10th‡ | **Multimodal Autonomous AI Agents** <br> Ruslan Salakhutdinov, CMU/Meta <br> <a href="https://rdi.berkeley.edu/adv-llm-agents/slides/ruslan-multimodal.pdf">Slides</a> | - [Mind2Web: Towards a Generalist Agent for the Web](https://arxiv.org/abs/2306.06070) <br> - [WebArena: A Realistic Web Environment for Building Autonomous Agents](https://arxiv.org/abs/2307.13854) <br> - [VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks](https://jykoh.com/vwa) <br> - [Tree Search for Language Model Agents](https://jykoh.com/search-agents) |
71+
| Mar 17th | **Multimodal Agents – From Perception to Action** <br> Caiming Xiong, Salesforce AI Research | - [OSWORLD: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments](https://arxiv.org/pdf/2404.07972) <br> - [AGUVIS: Unified Pure Vision Agents For Autonomous GUI Interaction](https://arxiv.org/pdf/2412.04454) |
7272
| Mar 24th | *No Class - Spring Recess* | |
7373
| Mar 31st‡ | **AlphaProof** <br> Thomas Hubert, Google DeepMind <br> <span style="color:red">10am-noon PST</span> | |
7474
| Apr 7th | **Language models for autoformalization and theorem proving** <br> Kaiyu Yang, Meta FAIR | |

0 commit comments

Comments
 (0)