-
Notifications
You must be signed in to change notification settings - Fork 379
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ziya2预训练的语料拼接是如何通过attention mask规避的 #444
Comments
同问+1 |
同问+1 |
只需要把当前token前面不属于同一个doc的token对应的attention_mask设置成0即可,不同doc通过eos即可区分。 |
|
可以用flash attention triton |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
The text was updated successfully, but these errors were encountered: