-
Notifications
You must be signed in to change notification settings - Fork 1
Add metadata for deepseek-ai/DeepSeek-R1-0528-Qwen3-8B #34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,50 @@ | ||
| apiVersion: model.hydra.io/v1alpha1 | ||
| kind: ModelSpec | ||
| metadata: | ||
| name: deepseek-r1-0528-qwen3-8b | ||
| spec: | ||
| deployments: | ||
| - customRuntimeArgs: [] | ||
| resourceRequirements: | ||
| cpu: 8 | ||
| gpuCount: 8 | ||
| gpuType: nvidia-vgpu | ||
| memory: 640 | ||
| perGPUMemoryGB: 80 | ||
| runtime: vllm | ||
| versionRequired: '>=0.8.5' | ||
| descriptor: | ||
| description: | ||
| enUS: The DeepSeek R1 model has undergone a minor version upgrade, with the | ||
| current version being DeepSeek-R1-0528. In the latest update, DeepSeek R1 | ||
| has significantly improved its depth of reasoning and inference capabilities | ||
| by leveraging increased computational resources and introducing algorithmic | ||
| optimization mechanisms during post-training. The model has demonstrated outstanding | ||
| performance across various benchmark evaluations, including mathematics, programming, | ||
| and general logic. | ||
| zhCN: The DeepSeek R1 model has undergone a minor version upgrade, with the | ||
| current version being DeepSeek-R1-0528. In the latest update, DeepSeek R1 | ||
| has significantly improved its depth of reasoning and inference capabilities | ||
| by leveraging increased computational resources and introducing algorithmic | ||
| optimization mechanisms during post-training. The model has demonstrated outstanding | ||
| performance across various benchmark evaluations, including mathematics, programming, | ||
| and general logic. | ||
|
Comment on lines
+25
to
+31
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The Chinese description provided under Could you please replace this with the correct Chinese translation for the model description? I've added a placeholder suggestion. |
||
| display: DeepSeek-R1-0528-Qwen3-8B | ||
| icon: | ||
| src: https://public-resources.d.run/models/logos/deepseek-model-logo.svg | ||
| type: image/svg | ||
| links: | ||
| - description: About | ||
| url: https://www.deepseek.com/ | ||
| provider: | ||
| id: deepseek | ||
| name: | ||
| enUS: DeepSeek | ||
| zhCN: 深度求索 | ||
| tags: | ||
| - TEXT_GENERATION | ||
| source: | ||
| huggingface: | ||
| name: deepseek-ai/DeepSeek-R1-0528-Qwen3-8B | ||
| modelscope: | ||
| name: deepseek-ai/DeepSeek-R1-0528-Qwen3-8B | ||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The
gpuTypevaluenvidia-vgpuappears to be incorrect. Based on theModelSpecGPUTypeenum defined inapi/v1alpha1/modelspec_types.go(lines 27-28), the valid string values are"gpu"or"vgpu".Could you please update this to the appropriate value? Assuming this model is intended for virtual GPUs,
"vgpu"would be the correct choice.