We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
tensors[key].detach().cpu()
tensors[key].detach().cpu() KeyErrorKeyErrorKeyErrorKeyError: : : : tensors[key].detach().cpu()'exp_avg' 'exp_avg''exp_avg''exp_avg'tensors[key].detach().cpu()
The text was updated successfully, but these errors were encountered:
您好,这个问题我遇到过,貌似就是第二阶段加载的时候不加载优化器参数就可以了
Sorry, something went wrong.
多谢~~我已经加了no-load-optim参数,不起作用。。应该咋操作呢
断点需要不需要优化器状态吗?
您好,这个问题我遇到过,貌似就是第二阶段加载的时候不加载优化器参数就可以了 多谢~~我已经加了no-load-optim参数,不起作用。。应该咋操作呢
请问最后怎么解决的呢?
No branches or pull requests
py", line 757, in get_parameter_state_dp_zero
state_dict = optimizer.get_parameter_state_dp_zero()
File "/nas-wulanchabu/tanfan.zjh/Pai-Megatron-Patch/Megatron-LM-240405/megatron/core/optimizer/distrib_optimizer.py", line 757, in get_parameter_state_dp_zero
tensors[key].detach().cpu()
tensors[key].detach().cpu()
KeyError: 'exp_avg'
KeyError: 'exp_avg'tensors[key].detach().cpu() tensors[key].detach().cpu()
tensors[key].detach().cpu()
tensors[key].detach().cpu()
KeyErrorKeyErrorKeyErrorKeyError: : : : tensors[key].detach().cpu()'exp_avg' 'exp_avg''exp_avg''exp_avg'tensors[key].detach().cpu()
The text was updated successfully, but these errors were encountered: