🤖
Highlights
- Pro
Pinned Loading
-
test-time-rl-discover-autoresearch
test-time-rl-discover-autoresearch PublicForked from karpathy/autoresearch
Test Time RL Discover + Auto Research
-
Hermes-Agent-Online-RL
Hermes-Agent-Online-RL PublicOnline RL for Hermes Agent — self-improving LoRA adapters from human feedback using MIS-PO
-
hermes-research-agent
hermes-research-agent PublicResearch-focused fork of Hermes Agent for autonomous end-to-end LLM research loops.
-
nanochat-attenresiduals
nanochat-attenresiduals PublicForked from karpathy/nanochat
The best ChatGPT that $100 can buy.
Python 1
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

