Skip to content
Change the repository type filter

All

    Repositories list

    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      7.2k001Updated May 1, 2025May 1, 2025
    • DataSc Lab Spring 2025: LLM Programs Blueprints
      Jupyter Notebook
      MIT License
      00284Updated May 1, 2025May 1, 2025
    • 0000Updated May 1, 2025May 1, 2025
    • Python
      0000Updated Apr 30, 2025Apr 30, 2025
    • mmore

      Public
      Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets and feed them to an LLM as a knowledge base? Well, MMORE is here to help you!
      Python
      Apache License 2.0
      1037243Updated Apr 30, 2025Apr 30, 2025
    • 0000Updated Apr 30, 2025Apr 30, 2025
    • llm-proxy

      Public
      LLM Serving and User Control
      JavaScript
      2000Updated Apr 29, 2025Apr 29, 2025
    • olmes

      Public
      Reproducible, flexible LLM evaluations
      Python
      Apache License 2.0
      25000Updated Apr 28, 2025Apr 28, 2025
    • sglang

      Public
      SGLang is a fast serving framework for large language models and vision language models.
      Python
      Apache License 2.0
      1.6k000Updated Apr 28, 2025Apr 28, 2025
    • Ongoing research training transformer models at scale
      Python
      Other
      2.7k4611Updated Apr 24, 2025Apr 24, 2025
    • Minimalistic large language model 3D-parallelism training
      Python
      Apache License 2.0
      185000Updated Apr 22, 2025Apr 22, 2025
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      2.3k003Updated Apr 22, 2025Apr 22, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      29k002Updated Apr 10, 2025Apr 10, 2025
    • PDF pipeline for creating training corpora (mainly for llm, multimodal and alignment horizontals)
      Python
      Apache License 2.0
      0400Updated Apr 3, 2025Apr 3, 2025
    • Shell
      0000Updated Mar 7, 2025Mar 7, 2025
    • LLM Serving Engine
      Python
      2000Updated Feb 21, 2025Feb 21, 2025
    • A suite of image and video neural tokenizers
      Jupyter Notebook
      Apache License 2.0
      78000Updated Feb 11, 2025Feb 11, 2025
    • A copy of nanotron for multilingual training
      Python
      Apache License 2.0
      185002Updated Jan 22, 2025Jan 22, 2025
    • lighteval

      Public
      Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
      Python
      MIT License
      234000Updated Jan 13, 2025Jan 13, 2025
    • Jupyter Notebook
      Apache License 2.0
      0000Updated Dec 19, 2024Dec 19, 2024
    • nanotron

      Public
      Minimalistic large language model 3D-parallelism training
      Python
      Apache License 2.0
      185619Updated Dec 2, 2024Dec 2, 2024
    • Containers for multimodal initiative (and maybe more across Swiss AI?)
      Dockerfile
      0000Updated Nov 29, 2024Nov 29, 2024
    • ml-4m

      Public
      4M: Massively Multimodal Masked Modeling (NeurIPS 2023 Spotlight)
      Python
      Apache License 2.0
      1030134Updated Nov 29, 2024Nov 29, 2024
    • Tool set for data preparation and selection in the context of Swiss-AI (forked from DataTrove)
      Python
      Apache License 2.0
      178001Updated Nov 21, 2024Nov 21, 2024
    • Python
      Apache License 2.0
      0000Updated Nov 6, 2024Nov 6, 2024
    • Easily create large video dataset from video urls
      Python
      MIT License
      70101Updated Oct 14, 2024Oct 14, 2024
    • ml-4m-v2

      Public
      0000Updated Aug 5, 2024Aug 5, 2024
    • MoE

      Public
      some mixture of experts architecture implementations
      Python
      Apache License 2.0
      21310Updated Mar 22, 2024Mar 22, 2024
    • distributed trainer for LLMs
      Python
      Other
      81000Updated Feb 8, 2024Feb 8, 2024