Releases: k2-fsa/sherpa-onnx
Releases · k2-fsa/sherpa-onnx
v1.9.18
What's Changed
- Fix building OpenFst on Windows. by @csukuangfj in #744
- Fix go API examples with portaudio on Windows. by @csukuangfj in #746
- Support audio tagging using zipformer by @csukuangfj in #747
- Add C++ microphone examples for audio tagging by @csukuangfj in #749
- Add SHERPA_ONNX_GITHUB by @bubao in #750
- Fix a bug in mean calculation of 'ys_probs' by @aask1357 in #748
- Add Python API and Python examples for audio tagging by @csukuangfj in #753
- Add C API for audio tagging by @csukuangfj in #754
- [feature] Configurable padding length by @manickavela29 in #755
- Use batch size 1 in generating subtitles. by @csukuangfj in #756
- Fix WebAssembly for kws by @csukuangfj in #758
- Support adding punctuations to the speech recogntion result by @csukuangfj in #761
- Add Python API for punctuation models. by @csukuangfj in #762
- Release v1.9.18 by @csukuangfj in #763
New Contributors
- @bubao made their first contribution in #750
- @aask1357 made their first contribution in #748
- @manickavela29 made their first contribution in #755
Full Changelog: v1.9.17...v1.9.18
punctuation-models
Use batch size 1 in generating subtitles. (#756)
audio-tagging-models
v1.9.17
What's Changed
- Support heteronyms in Chinese TTS by @csukuangfj in #738
- Add VAD examples using ALSA for recording by @csukuangfj in #739
- Fix releasing GIL by @csukuangfj in #741
- Support Chinese heteronyms on Android for TTS. by @csukuangfj in #742
Full Changelog: v1.9.16...v1.9.17
v1.9.16
What's Changed
- Fix building wasm in CI by @csukuangfj in #720
- Add more piper models for text-to-speech by @csukuangfj in #725
- Fix microphone privacy config by @yujinqiu in #727
- Add language identification swiftui demo by @yujinqiu in #729
- Add HLG decoding for streaming CTC models by @csukuangfj in #731
- Add C API for streaming HLG decoding by @csukuangfj in #734
- return timestamps for WebAssembly by @csukuangfj in #737
Full Changelog: v1.9.15...v1.9.16
v1.9.15
What's Changed
- Add C API for spoken language identification. by @csukuangfj in #695
- Add Swift API for spoken language identification. by @csukuangfj in #696
- Add C# API for spoken language identification by @csukuangfj in #697
- Use a single thread when loading models by @csukuangfj in #703
- Build Android TTS APKs for coqui-ai/TTS models by @csukuangfj in #704
- Support including TTS conditionally. by @csukuangfj in #699
- add openfst.cmake file by @hantengc in #707
- Add Golang API for VAD by @csukuangfj in #708
- Sign released TTS APKs by @csukuangfj in #710
- Add Golang API for spoken language identification. by @csukuangfj in #709
- Fix ios-swift to remove invalid references by @csukuangfj in #713
- delete incorrect logs by @longshiming in #714
- Added progress for callback of tts generator by @leohuang2013 in #712
- Add C API for speaker embedding extractor. by @csukuangfj in #711
- Add Go API for speaker identification by @csukuangfj in #718
- Release v1.9.15 by @csukuangfj in #719
New Contributors
- @hantengc made their first contribution in #707
- @leohuang2013 made their first contribution in #712
Full Changelog: v1.9.14...v1.9.15
v1.9.14
What's Changed
- Support spoken language identification with whisper by @csukuangfj in #694
Full Changelog: v1.9.13...v1.9.14
v1.9.13
What's Changed
- add WebAssembly for Kws by @lovemefan in #648
- small fixes to wasm kws. by @csukuangfj in #672
- Fix argument for beam search Decoding by @bhaswa in #673
- Add HotwordsFile and HotwordsScore fields to OnlineRecognizerConfig in C# API by @foreversimon in #675
- Add tts play example for .Net. by @csukuangfj in #676
- support whisper language by @wxbool in #678
- Support whisper language/task in various language bindings. by @csukuangfj in #679
- Add nuget package for Windows x86 by @csukuangfj in #683
- Update MainActivity.kt by @mablue in #687
- Refactor github actions tests by @csukuangfj in #688
- Add hotwords support to .Net by @csukuangfj in #689
- Configurable low_freq high_freq, dithering by @KarelVesely84 in #664
- Fix building c api examples with alsa by @csukuangfj in #691
- Add timestamps and tokens for .Net's online models. by @csukuangfj in #690
- Update MainActivity.kt by @mablue in #693
- Use manylinux in CI test by @csukuangfj in #692
New Contributors
- @lovemefan made their first contribution in #648
- @bhaswa made their first contribution in #673
- @foreversimon made their first contribution in #675
- @wxbool made their first contribution in #678
- @mablue made their first contribution in #687
Full Changelog: v1.9.12...v1.9.13
v1.9.12
What's Changed
- Add Python API for keyword spotting by @pkufool in #576
- Fix building wheels by @csukuangfj in #620
- Add WebAssembly demo for streaming trilingual Paraformer (Chinese+Cantonese+English) by @csukuangfj in #618
- Support Android NNAPI. by @csukuangfj in #622
- fixed variable's spell num_trailing_blanks by @lonngxiang in #623
- Support VITS models from icefall. by @csukuangfj in #625
- support onnxruntime v1.17.1 by @csukuangfj in #624
- Refactor WebAssembly for nodejs by @csukuangfj in #626
- Add WebAssembly for NodeJS. by @csukuangfj in #628
- Fix building wheels from source. by @csukuangfj in #632
- Support Ubuntu 18.04 and CentOS 7 by @csukuangfj in #635
- Allow to not use pre-installed onnxruntime libs. by @csukuangfj in #636
- Support using T-head-Semi/csi-nn2 for RISC-V by @csukuangfj in #637
- Fix deletions at the tail when using hotwords in streaming model by @pkufool in #638
- Add Python ASR examples with alsa by @csukuangfj in #646
- Add microphone streaming ASR example for C API by @csukuangfj in #650
- Expose 'language' 'task' 'tailPaddings' in OfflineWhisperModelConfig by @GaryLaurenceauAva in #643
- Support user provided data in tts callback. by @csukuangfj in #653
- Use onnxruntime 1.17.1 for iOS. by @csukuangfj in #654
- c++ api for keyword spotter by @xinhecuican in #642
- Print the time about the first message in tts. by @csukuangfj in #655
- Release v1.9.12 by @csukuangfj in #661
New Contributors
- @lonngxiang made their first contribution in #623
- @GaryLaurenceauAva made their first contribution in #643
- @xinhecuican made their first contribution in #642
Full Changelog: v1.9.11...v1.9.12
Release v1.9.11
What's Changed
- Use curl to replace wget for Windows. by @csukuangfj in #558
- Update sherpa-onnx /scripts/whisper/test.py by @20246688 in #560
- C++ API demo for speaker identification with portaudio. by @csukuangfj in #561
- Add fine-tuned whisper model on aishell by @csukuangfj in #565
- Use sequential layout for OfflineTtsConfig in C# by @csukuangfj in #567
- Add context biasing for mobile by @ductranminh in #568
- Update README by @csukuangfj in #572
- Support WebAssembly by @csukuangfj in #577
- Update onnxruntime from 1.16.3 to 1.17.0 by @csukuangfj in #581
- Support building debug version on Windows by @csukuangfj in #583
- updated the icon on TTS engine for android by @paolo-caroni in #579
- Download android onnxruntime libs from github. by @csukuangfj in #584
- Use onnxruntime static lib compiled with gcc8 on ubuntu 20.04 by @csukuangfj in #587
- Increase the right chunk size of streaming paraformer to 3 by @csukuangfj in #588
- Support MinSizeRel and RelWithDebInfo build on Windows. by @csukuangfj in #586
- Add more Chinese TTS models (Mandarin and Cantonese) by @csukuangfj in #589
- Fix CI warnings by @csukuangfj in #590
- Add missing "start_time" to Python API by @vsd-vector in #591
- Download ios-onnxruntime from github instead of huggingface. by @csukuangfj in #593
- Link the math lib. by @csukuangfj in #592
- Fix CI tests. by @csukuangfj in #596
- Refactor WebAssembly build script. by @csukuangfj in #598
- Publish wasm tts to model scope. by @csukuangfj in #599
- Install wasm tts in a separate directory by @csukuangfj in #600
- Add WebAssembly for ASR by @csukuangfj in #604
- Use alsa to read microphone in speaker identification demo. by @csukuangfj in #605
- Support RISC-V by @csukuangfj in #609
- Fix #608 by @csukuangfj in #610
- Support using alsa to access the microphone with non-streaming ASR models by @csukuangfj in #517
- support using xnnpack as execution provider by @csukuangfj in #612
- Use hub.nuaa.cf to replace huggingface URL to download dependencies. by @csukuangfj in #614
- Track token scores by @KarelVesely84 in #571
- Fix computing features for whisper. by @csukuangfj in #617
New Contributors
- @20246688 made their first contribution in #560
- @ductranminh made their first contribution in #568
- @paolo-caroni made their first contribution in #579
- @vsd-vector made their first contribution in #591
Full Changelog: v1.9.10...v1.9.11