Skip to content
@FMInference

Foundation Model Inference

Inference Systems for Foundation Models

Pinned Loading

  1. FlexGen FlexGen Public

    Running large language models on a single GPU for throughput-oriented scenarios.

    Python 9.1k 528

Repositories

Showing 3 of 3 repositories
  • H2O Public

    [NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

    FMInference/H2O’s past year of commit activity
    Python 308 27 24 0 Updated Jun 18, 2024
  • FlexGen Public

    Running large language models on a single GPU for throughput-oriented scenarios.

    FMInference/FlexGen’s past year of commit activity
    Python 9,064 Apache-2.0 528 49 (3 issues need help) 7 Updated Apr 19, 2024
  • DejaVu Public
    FMInference/DejaVu’s past year of commit activity
    Python 240 31 20 1 Updated Apr 2, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…