ar0cket1

Follow

🤖

Node ar0cket1

🤖

Follow

Post-training

4 followers · 6 following

Achievements

Achievements

Highlights

Pro

Pinned Loading

test-time-rl-discover-autoresearch test-time-rl-discover-autoresearch Public

Forked from karpathy/autoresearch

Test Time RL Discover + Auto Research

Python 17 1
Hermes-Agent-Online-RL Hermes-Agent-Online-RL Public

Online RL for Hermes Agent — self-improving LoRA adapters from human feedback using MIS-PO

Python 13 1
hermes-research-agent hermes-research-agent Public

Research-focused fork of Hermes Agent for autonomous end-to-end LLM research loops.

Python 6 1
nanochat-attenresiduals nanochat-attenresiduals Public

Forked from karpathy/nanochat

The best ChatGPT that $100 can buy.

Python 1
codex-onlinerl codex-onlinerl Public

Forked from openai/codex

online rl integration into codex

Rust