We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
请问二值化的linear和matmul操作是否支持?
The text was updated successfully, but these errors were encountered:
目前不支持,这个是cnn网络吗?可以换用等价写法,用conv 1x1,注意输出通道数是32倍数
Sorry, something went wrong.
是linear层。主要是想了解一下transformer二值化的可能性,当然全都换成conv1*1也不是不行。
实现:bolt目前没写1bit linear,可以开issue加,或者临时用替代方案 算法:目前我们这边只有cnn做到1bit,NLP的bert 2w8a
No branches or pull requests
请问二值化的linear和matmul操作是否支持?
The text was updated successfully, but these errors were encountered: