Skip to content

Conversation

@acse-yz4421
Copy link

全文的代码都有dropout层,但是没有阐述,结构示意图也没有。因为dropout层只在训练时防止过拟合才加入,原本的transformer结构中并没有。对于初学者需要解释下。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant