Skip to content

Commit 6b8a2c4

Browse files
authored
Update index.md
1 parent 4962689 commit 6b8a2c4

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

index.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -65,7 +65,7 @@ Large language model (LLM) agents have been an important frontier in AI, however
6565
| Feb 3rd‡ | **Learning to reason with LLMs** <br> Jason Weston, Meta <br> <a href="https://rdi.berkeley.edu/adv-llm-agents/slides/Jason-Weston-Reasoning-Alignment-Berkeley-Talk.pdf">Slides</a> | - [Direct Preference Optimization: Your Language Model is Secretly a Reward Model](https://arxiv.org/abs/2305.18290) <br> - [Iterative Reasoning Preference Optimization](https://arxiv.org/abs/2404.19733) <br> - [Chain-of-Verification Reduces Hallucination in Large Language Models](https://arxiv.org/abs/2309.11495) |
6666
| Feb 10th‡ | **On Reasoning, Memory, and Planning of Language Agents** <br> Yu Su, Ohio State University <br> <a href="https://rdi.berkeley.edu/adv-llm-agents/slides/language_agents_YuSu_Berkeley.pdf">Slides</a> | - [Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization](https://arxiv.org/abs/2405.15071) <br> - [HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models](https://arxiv.org/abs/2405.14831) <br> - [Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents](https://arxiv.org/abs/2411.06559) |
6767
| Feb 17th | *No Class - Presidents' Day* | |
68-
| Feb 24th‡ | **Open Training Recipes for Reasoning and Agents in Language Models** <br> Hanna Hajishirzi, University of Washington <br> <a href="https://rdi.berkeley.edu/adv-llm-agents/slides/OLMo-Tulu-Reasoning-Hanna.pdf">Slides</a> | - [Tulu 3: Pushing Frontiers in Open Language Model Post-Training](https://arxiv.org/abs/2411.15124) <br> - [Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback](https://arxiv.org/abs/2406.09279) <br> - [OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs](https://arxiv.org/abs/2411.14199) |
68+
| Feb 24th‡ | **Open Training Recipes for Reasoning in Language Models** <br> Hanna Hajishirzi, University of Washington <br> <a href="https://rdi.berkeley.edu/adv-llm-agents/slides/OLMo-Tulu-Reasoning-Hanna.pdf">Slides</a> | - [Tulu 3: Pushing Frontiers in Open Language Model Post-Training](https://arxiv.org/abs/2411.15124) <br> - [Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback](https://arxiv.org/abs/2406.09279) <br> - [OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs](https://arxiv.org/abs/2411.14199) |
6969
| Mar 3rd | **Coding Agents and AI for Vulnerability Detection** <br> Charles Sutton, Google DeepMind | |
7070
| Mar 10th‡ | **Coding agents/web agents** <br> Ruslan Salakhutdinov, CMU/Meta | |
7171
| Mar 17th | **Multimodal Agents** <br> Caiming Xiong, Salesforce AI Research | |

0 commit comments

Comments
 (0)