Learning in LLMs and MLsys, recently focused on RL training
- 💬 Personal Website: https://yushengsu-thu.github.io/
- Google Scholar: https://scholar.google.com/citations?user=xwy6Va4AAAAJ
- 📫 E-mail: [email protected]
Learning in LLMs and MLsys, recently focused on RL training
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
On Transferability of Prompt Tuning for Natural Language Processing
Forked from THUDM/slime
slime is a LLM post-training framework aiming at scaling RL.
Python 1
Forked from academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
My learning notes/codes for ML SYS.
Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Python 1