Releases
v6.1.138
Algorithms
New features
Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16 optimizations of class SynetConvolution16bNhwcDirect.
SimdCpuInfoCurrentFrequency in SimdCpuInfoType enumeration.
API of SynetMergedConvolution16b framework.
Base implementation of class SynetMergedConvolution16b.
Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16 optimizations of class SynetMergedConvolution16bDc.
Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16 optimizations of class SynetMergedConvolution16bCd.
Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16 optimizations of class SynetMergedConvolution16bCdc.
Support of YUV420P format to Simd::Frame.
Improving
AVX-512BF16 optimizations of class SynetConvolution16bNhwcGemm.
Bug fixing
Errors in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16 optimizations of class SynetConvolution16bNhwcGemm.
Error in Base implementation of class SynetMergedConvolution8i.
Test framework
New features
-wu command line option to set CPU warm up time in milliseconds.
Tests for verifying functionality of SynetMergedConvolution16b framework.
Infrastructure
Bug fixing
Errors in build_and_test_gcc section in Github actions script for CMake.
You can’t perform that action at this time.