Skip to content
Change the repository type filter

All

    Repositories list

    • A framework for few-shot evaluation of language models.
      Python
      2.7k100Updated Sep 14, 2025Sep 14, 2025
    • s1

      Public
      s1: Simple test-time scaling
      Python
      7636.6k653Updated Jun 25, 2025Jun 25, 2025
    • JavaScript
      0200Updated Feb 11, 2025Feb 11, 2025