rlhf
Here are 130 public repositories matching this topic...
Applying quantum computing principles to large language models for more reliable, interpretable, and steerable systems.
-
Updated
Jan 5, 2024 - Python
Survey of preference alignment algorithms
-
Updated
Feb 25, 2024
Intelligent AI Chatbot that has the capability to learn from the user
-
Updated
Mar 22, 2024 - Python
Some experiments with activation steering in LLMs
-
Updated
Jan 21, 2024 - Python
Researching the reinforcement learning algorithm of ChatGPT
-
Updated
Apr 7, 2023 - Jupyter Notebook
Reinforcement Learning Tutorial (强化学习教程)
-
Updated
Sep 10, 2023
This repository was commited under the action of executing important tasks on which modern Generative AI concepts are laid on. In particular, we focussed on three coding actions of Large Language Models. Extra and necessary details are given in the README.md file.
-
Updated
Mar 28, 2024 - Jupyter Notebook
Projects and Models built in Python leveraging PyTorch, implementing Reinforcement Learning algorithms for reward-based tasks.
-
Updated
May 7, 2024 - Jupyter Notebook
This repository is dedicated to small projects and some theoretical material that I used to get into NLP and LLM in a practical and efficient way.
-
Updated
Jun 14, 2024 - Jupyter Notebook
Improving LLM truthfulness via reporting confidence
-
Updated
Jun 9, 2024 - Python
Large Language Model for Competitive Programming
-
Updated
Apr 28, 2023 - Python
JavaScript client library for managing your LLM data in one place
-
Updated
May 3, 2023 - JavaScript
Robot Learning from Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.
-
Updated
Apr 16, 2023 - Python
Improve this page
Add a description, image, and links to the rlhf topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rlhf topic, visit your repo's landing page and select "manage topics."