Skip to content

feat: add Qwen2.5-0.5B-Instruct#38

Merged
RK181 merged 1 commit into
develfrom
feat-add-kserve-qwen2-5-cpu
Jun 10, 2026
Merged

feat: add Qwen2.5-0.5B-Instruct#38
RK181 merged 1 commit into
develfrom
feat-add-kserve-qwen2-5-cpu

Conversation

@RK181

@RK181 RK181 commented Jun 10, 2026

Copy link
Copy Markdown
Contributor

This pull request introduces a new example service for deploying the Qwen2.5-0.5B-Instruct large language model on OSCAR using KServe and vLLM (CPU-only). It provides all the necessary configuration, Dockerfiles, and metadata to build, deploy, and document the service, enabling efficient LLM inference with OpenAI-compatible endpoints.

@RK181 RK181 merged commit 206ae71 into devel Jun 10, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants