Skip to content
Change the repository type filter

All

    Repositories list

    • Build datasets using natural language
      Python
      Apache License 2.0
      1417462Updated Dec 24, 2024Dec 24, 2024
    • Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
      Python
      Apache License 2.0
      1441.8k7718Updated Dec 24, 2024Dec 24, 2024
    • argilla

      Public
      Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
      Python
      Apache License 2.0
      3884.1k10221Updated Dec 24, 2024Dec 24, 2024
    • Python
      Apache License 2.0
      0000Updated Dec 10, 2024Dec 10, 2024
    • Simple examples using Argilla tools to build AI
      Jupyter Notebook
      Apache License 2.0
      64802Updated Nov 18, 2024Nov 18, 2024
    • A public repo that contains integrations for Argilla and LlamaIndex.
      Python
      Apache License 2.0
      01300Updated Oct 10, 2024Oct 10, 2024
    • argilla-python

      Public archive
      The Argilla API python SDK
      Python
      Apache License 2.0
      1901Updated Sep 30, 2024Sep 30, 2024
    • spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface
      Python
      MIT License
      1925242Updated Sep 3, 2024Sep 3, 2024
    • Apache License 2.0
      0100Updated Aug 1, 2024Aug 1, 2024
    • Building a chatbot for Argilla SDK Step by step
      Jupyter Notebook
      Apache License 2.0
      3301Updated Jul 25, 2024Jul 25, 2024
    • Python
      0300Updated Jul 9, 2024Jul 9, 2024
    • Let's build better datasets, together!
      Jupyter Notebook
      29009Updated Jul 6, 2024Jul 6, 2024
    • A working repository for experimental pipelines in distilabel
      Jupyter Notebook
      17014Updated Jul 6, 2024Jul 6, 2024
    • [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.
      Python
      Apache License 2.0
      800014Updated Jul 5, 2024Jul 5, 2024
    • Python
      0200Updated Jun 21, 2024Jun 21, 2024
    • A proof of concept for integration with datasets repos
      Python
      0000Updated Jun 14, 2024Jun 14, 2024
    • argilla-server

      Public archive
      A Python native FastAPI server for the Argilla backend.
      Python
      Apache License 2.0
      9902Updated Jun 14, 2024Jun 14, 2024
    • A Gradio app to monitor a collective effort from the Open Source AI Community to understand and collect good quality and diverse prompts.
      Python
      Apache License 2.0
      1007Updated May 21, 2024May 21, 2024
    • orpo

      Public
      Official repository for ORPO
      Python
      40002Updated May 6, 2024May 6, 2024
    • cookbook

      Public
      Jupyter Notebook
      MIT License
      301000Updated Apr 14, 2024Apr 14, 2024
    • A public repo that contains integrations for Argilla and Haystack.
      Python
      Apache License 2.0
      0420Updated Apr 9, 2024Apr 9, 2024
    • trl

      Public
      Train transformer language models with reinforcement learning.
      Python
      Apache License 2.0
      1.4k000Updated Apr 9, 2024Apr 9, 2024
    • A repo that implements Stanford CRFM their HELM Instruct with adaptable evaluation criteria
      Jupyter Notebook
      Apache License 2.0
      1100Updated Mar 24, 2024Mar 24, 2024
    • Repository containing the SPIN experiments on the DIBT 10k ranked prompts
      Python
      Apache License 2.0
      02400Updated Mar 12, 2024Mar 12, 2024
    • .github

      Public
      ✨ Argilla: the open-source feedback platform for LLMs
      Apache License 2.0
      0000Updated Feb 27, 2024Feb 27, 2024
    • dill

      Public
      serialize all of Python
      Python
      Other
      181000Updated Feb 24, 2024Feb 24, 2024
    • notus

      Public
      Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach
      Python
      MIT License
      1416311Updated Jan 15, 2024Jan 15, 2024
    • Shell
      0100Updated Jan 9, 2024Jan 9, 2024
    • FastChat

      Public
      An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
      Python
      Apache License 2.0
      4.6k000Updated Jan 7, 2024Jan 7, 2024
    • chat-ui

      Public
      Open source codebase powering the HuggingChat app
      TypeScript
      Apache License 2.0
      1.2k100Updated Jan 3, 2024Jan 3, 2024