Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
50 changes: 50 additions & 0 deletions models/deepseek-ai/deepseek-r1-0528-qwen3-8b-/metadata.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
apiVersion: model.hydra.io/v1alpha1
kind: ModelSpec
metadata:
name: deepseek-r1-0528-qwen3-8b
spec:
deployments:
- customRuntimeArgs: []
resourceRequirements:
cpu: 8
gpuCount: 8
gpuType: nvidia-vgpu

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

The gpuType value nvidia-vgpu appears to be incorrect. Based on the ModelSpecGPUType enum defined in api/v1alpha1/modelspec_types.go (lines 27-28), the valid string values are "gpu" or "vgpu".

Could you please update this to the appropriate value? Assuming this model is intended for virtual GPUs, "vgpu" would be the correct choice.

      gpuType: vgpu

memory: 640
perGPUMemoryGB: 80
runtime: vllm
versionRequired: '>=0.8.5'
descriptor:
description:
enUS: The DeepSeek R1 model has undergone a minor version upgrade, with the
current version being DeepSeek-R1-0528. In the latest update, DeepSeek R1
has significantly improved its depth of reasoning and inference capabilities
by leveraging increased computational resources and introducing algorithmic
optimization mechanisms during post-training. The model has demonstrated outstanding
performance across various benchmark evaluations, including mathematics, programming,
and general logic.
zhCN: The DeepSeek R1 model has undergone a minor version upgrade, with the
current version being DeepSeek-R1-0528. In the latest update, DeepSeek R1
has significantly improved its depth of reasoning and inference capabilities
by leveraging increased computational resources and introducing algorithmic
optimization mechanisms during post-training. The model has demonstrated outstanding
performance across various benchmark evaluations, including mathematics, programming,
and general logic.
Comment on lines +25 to +31

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The Chinese description provided under zhCN seems to be an exact duplicate of the English description (enUS). It's important to provide an accurate translation for users in the respective language.

Could you please replace this with the correct Chinese translation for the model description? I've added a placeholder suggestion.

      zhCN: "请在此处填写模型的中文描述."

display: DeepSeek-R1-0528-Qwen3-8B
icon:
src: https://public-resources.d.run/models/logos/deepseek-model-logo.svg
type: image/svg
links:
- description: About
url: https://www.deepseek.com/
provider:
id: deepseek
name:
enUS: DeepSeek
zhCN: 深度求索
tags:
- TEXT_GENERATION
source:
huggingface:
name: deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
modelscope:
name: deepseek-ai/DeepSeek-R1-0528-Qwen3-8B