Skip to content
View ar0cket1's full-sized avatar
🤖
🤖

Highlights

  • Pro

Block or report ar0cket1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. test-time-rl-discover-autoresearch test-time-rl-discover-autoresearch Public

    Forked from karpathy/autoresearch

    Test Time RL Discover + Auto Research

    Python 17 1

  2. Hermes-Agent-Online-RL Hermes-Agent-Online-RL Public

    Online RL for Hermes Agent — self-improving LoRA adapters from human feedback using MIS-PO

    Python 13 1

  3. hermes-research-agent hermes-research-agent Public

    Research-focused fork of Hermes Agent for autonomous end-to-end LLM research loops.

    Python 6 1

  4. nanochat-attenresiduals nanochat-attenresiduals Public

    Forked from karpathy/nanochat

    The best ChatGPT that $100 can buy.

    Python 1

  5. codex-onlinerl codex-onlinerl Public

    Forked from openai/codex

    online rl integration into codex

    Rust