Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pth convert to hf model 出现问题 #958

Open
no-execution opened this issue Oct 30, 2024 · 0 comments
Open

pth convert to hf model 出现问题 #958

no-execution opened this issue Oct 30, 2024 · 0 comments

Comments

@no-execution
Copy link

按照readme中流程完成训练
64 卡 训练qwen2.5 72B模型
生成了.pth文件夹,一共64个.pt文件
在转hf模型过程中,突然中断,没有任何报错

check了内存、显存、cpu占用,均无异常

7B模型就可以转换成功

看了一下,是读deepspeed 的 .pt文件时中断的

有什么解决办法吗?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant