Skip to content

Simd v6.1.138

Compare
Choose a tag to compare
@ermig1979 ermig1979 released this 03 Jun 06:53
· 241 commits to master since this release

Algorithms

New features
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16 optimizations of class SynetConvolution16bNhwcDirect.
  • SimdCpuInfoCurrentFrequency in SimdCpuInfoType enumeration.
  • API of SynetMergedConvolution16b framework.
  • Base implementation of class SynetMergedConvolution16b.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16 optimizations of class SynetMergedConvolution16bDc.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16 optimizations of class SynetMergedConvolution16bCd.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16 optimizations of class SynetMergedConvolution16bCdc.
  • Support of YUV420P format to Simd::Frame.
Improving
  • AVX-512BF16 optimizations of class SynetConvolution16bNhwcGemm.
Bug fixing
  • Errors in Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16 optimizations of class SynetConvolution16bNhwcGemm.
  • Error in Base implementation of class SynetMergedConvolution8i.

Test framework

New features
  • -wu command line option to set CPU warm up time in milliseconds.
  • Tests for verifying functionality of SynetMergedConvolution16b framework.

Infrastructure

Bug fixing
  • Errors in build_and_test_gcc section in Github actions script for CMake.