Skip to content
View FellowTraveler's full-sized avatar

Organizations

@Open-Transactions

Block or report FellowTraveler

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

inference engine

12 repositories

FastMLX is a high performance production ready API to host MLX models.

Python 265 32 Updated Nov 29, 2024

a self-hosted webui for 30+ generative ai

Python 559 67 Updated Mar 5, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.

JavaScript 40,099 3,845 Updated Mar 4, 2025

LLM inference in C/C++

C++ 75,916 10,978 Updated Mar 6, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 131,241 10,768 Updated Mar 6, 2025

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 18,424 2,280 Updated Mar 6, 2025

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,484 768 Updated Mar 5, 2025

automatically quant GGUF models

Python 157 13 Updated Mar 5, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,372 6,059 Updated Mar 6, 2025

Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Python 331 48 Updated Mar 6, 2025

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

Zig 2,117 78 Updated Mar 5, 2025

Distribute and run LLMs with a single file.

C++ 21,885 1,150 Updated Mar 4, 2025