Skip to content
Change the repository type filter

All

    Repositories list

    • delphi

      Public
      Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.
      Python
      Apache License 2.0
      2516942Updated Apr 23, 2025Apr 23, 2025
    • open-r1

      Public
      Fully open reproduction of DeepSeek-R1
      Python
      Apache License 2.0
      2.2k100Updated Apr 22, 2025Apr 22, 2025
    • sparsify

      Public
      Sparsify transformers with SAEs and transcoders
      Python
      MIT License
      7052000Updated Apr 22, 2025Apr 22, 2025
    • elk

      Public
      Keeping language models honest by directly eliciting knowledge encoded in their activations.
      Python
      MIT License
      331991510Updated Apr 21, 2025Apr 21, 2025
    • gpt-neox

      Public
      An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
      Python
      Apache License 2.0
      1k7.2k6426Updated Apr 20, 2025Apr 20, 2025
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      2.3k8.7k379116Updated Apr 18, 2025Apr 18, 2025
    • POSER

      Public
      Poser: Unmasking Alignment Faking LLMs by Manipulating Their Internals
      Python
      1100Updated Apr 18, 2025Apr 18, 2025
    • rllm

      Public
      Democratizing Reinforcement Learning for LLMs
      Jupyter Notebook
      MIT License
      285000Updated Apr 16, 2025Apr 16, 2025
    • Ongoing research training transformer models at scale
      Python
      Other
      2.7k000Updated Apr 15, 2025Apr 15, 2025
    • tyche

      Public
      Precisely estimating the volume of basins in neural net parameter space corresponding to interpretable behaviors
      Jupyter Notebook
      Apache License 2.0
      0600Updated Apr 9, 2025Apr 9, 2025
    • rtopk

      Public
      Cuda
      MIT License
      0100Updated Apr 5, 2025Apr 5, 2025
    • DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
      Python
      Apache License 2.0
      4.3k16601Updated Apr 1, 2025Apr 1, 2025
    • ccs

      Public
      Python
      MIT License
      6614Updated Mar 21, 2025Mar 21, 2025
    • MIT License
      0000Updated Mar 17, 2025Mar 17, 2025
    • pythia

      Public
      The hub for EleutherAI's work on interpretability and learning dynamics
      Jupyter Notebook
      Apache License 2.0
      1832.5k265Updated Mar 13, 2025Mar 13, 2025
    • cookbook

      Public
      Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
      Python
      Apache License 2.0
      4078690Updated Mar 3, 2025Mar 3, 2025
    • cupbearer

      Public
      A library for mechanistic anomaly detection
      Jupyter Notebook
      MIT License
      10600Updated Feb 26, 2025Feb 26, 2025
    • clearnets

      Public
      Python
      MIT License
      0400Updated Feb 18, 2025Feb 18, 2025
    • Closed-form polynomial approximations to neural networks
      Python
      MIT License
      01200Updated Jan 31, 2025Jan 31, 2025
    • Experiments in transformer knowledge and reasoning
      Jupyter Notebook
      MIT License
      191000Updated Jan 30, 2025Jan 30, 2025
    • A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
      Python
      Apache License 2.0
      407000Updated Jan 29, 2025Jan 29, 2025
    • Acompanying code for our research on SAE feature overlap when trained on different seeds.
      Jupyter Notebook
      Apache License 2.0
      1300Updated Jan 28, 2025Jan 28, 2025
    • mdl

      Public
      Minimum Description Length probing for neural network representations
      Python
      MIT License
      21902Updated Jan 28, 2025Jan 28, 2025
    • MIDI tokenizers and pre-processing utils.
      Python
      Apache License 2.0
      1100Updated Jan 27, 2025Jan 27, 2025
    • Erasing concepts from neural representations with provable guarantees
      Python
      MIT License
      1522732Updated Jan 27, 2025Jan 27, 2025
    • aria

      Public
      Python
      Apache License 2.0
      114600Updated Dec 24, 2024Dec 24, 2024
    • Jupyter Notebook
      MIT License
      0400Updated Dec 14, 2024Dec 14, 2024
    • website

      Public
      New website for EleutherAI based on Hugo static site generator
      HTML
      5402Updated Dec 12, 2024Dec 12, 2024
    • Jupyter Notebook
      Apache License 2.0
      22100Updated Dec 11, 2024Dec 11, 2024
    • aria-amt

      Public
      Efficient and robust implementation of seq-to-seq automatic piano transcription.
      Python
      Apache License 2.0
      93600Updated Dec 2, 2024Dec 2, 2024