We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
根据您这个issues,#55,我实现了一版opencl代码在手机端的gpu上运行,并将尝试将m_tile和n_tile调整成4和8,比之前的m_tiles=8,n_tiles=4得到了优化,想问一下,还有没有其他优化手段提供一下思路。
m_tile和n_tile
m_tiles=8,n_tiles=4
The text was updated successfully, but these errors were encountered:
你可以参考其他关于调优的issue链接,但首先,有必要检查当前的计算是否达到你期望的计算峰值和内存带宽,这个需要看看硬件的文档
Sorry, something went wrong.
No branches or pull requests
根据您这个issues,#55,我实现了一版opencl代码在手机端的gpu上运行,并将尝试将
m_tile和n_tile
调整成4和8,比之前的m_tiles=8,n_tiles=4
得到了优化,想问一下,还有没有其他优化手段提供一下思路。The text was updated successfully, but these errors were encountered: