Releases · k2-fsa/sherpa-onnx

13 Apr 08:35

csukuangfj

v1.9.18

b6ad043

v1.9.18

What's Changed

Fix building OpenFst on Windows. by @csukuangfj in #744
Fix go API examples with portaudio on Windows. by @csukuangfj in #746
Support audio tagging using zipformer by @csukuangfj in #747
Add C++ microphone examples for audio tagging by @csukuangfj in #749
Add SHERPA_ONNX_GITHUB by @bubao in #750
Fix a bug in mean calculation of 'ys_probs' by @aask1357 in #748
Add Python API and Python examples for audio tagging by @csukuangfj in #753
Add C API for audio tagging by @csukuangfj in #754
[feature] Configurable padding length by @manickavela29 in #755
Use batch size 1 in generating subtitles. by @csukuangfj in #756
Fix WebAssembly for kws by @csukuangfj in #758
Support adding punctuations to the speech recogntion result by @csukuangfj in #761
Add Python API for punctuation models. by @csukuangfj in #762
Release v1.9.18 by @csukuangfj in #763

New Contributors

@bubao made their first contribution in #750
@aask1357 made their first contribution in #748
@manickavela29 made their first contribution in #755

Full Changelog: v1.9.17...v1.9.18

Contributors

csukuangfj, bubao, and 2 other contributors

Assets 32

12 Apr 10:46

csukuangfj

punctuation-models

be4a248

punctuation-models

Use batch size 1 in generating subtitles. (#756)

Assets 5

09 Apr 08:04

csukuangfj

audio-tagging-models

db1b3ab

audio-tagging-models

Assets 9

08 Apr 13:38

csukuangfj

v1.9.17

0d90b34

v1.9.17

What's Changed

Support heteronyms in Chinese TTS by @csukuangfj in #738
Add VAD examples using ALSA for recording by @csukuangfj in #739
Fix releasing GIL by @csukuangfj in #741
Support Chinese heteronyms on Android for TTS. by @csukuangfj in #742

Full Changelog: v1.9.16...v1.9.17

Contributors

csukuangfj

Assets 51

05 Apr 12:25

csukuangfj

v1.9.16

c1c0f5b

v1.9.16

What's Changed

Fix building wasm in CI by @csukuangfj in #720
Add more piper models for text-to-speech by @csukuangfj in #725
Fix microphone privacy config by @yujinqiu in #727
Add language identification swiftui demo by @yujinqiu in #729
Add HLG decoding for streaming CTC models by @csukuangfj in #731
Add C API for streaming HLG decoding by @csukuangfj in #734
return timestamps for WebAssembly by @csukuangfj in #737

Full Changelog: v1.9.15...v1.9.16

Contributors

yujinqiu and csukuangfj

Assets 62

29 Mar 12:08

csukuangfj

v1.9.15

43af1e6

v1.9.15

What's Changed

Add C API for spoken language identification. by @csukuangfj in #695
Add Swift API for spoken language identification. by @csukuangfj in #696
Add C# API for spoken language identification by @csukuangfj in #697
Use a single thread when loading models by @csukuangfj in #703
Build Android TTS APKs for coqui-ai/TTS models by @csukuangfj in #704
Support including TTS conditionally. by @csukuangfj in #699
add openfst.cmake file by @hantengc in #707
Add Golang API for VAD by @csukuangfj in #708
Sign released TTS APKs by @csukuangfj in #710
Add Golang API for spoken language identification. by @csukuangfj in #709
Fix ios-swift to remove invalid references by @csukuangfj in #713
delete incorrect logs by @longshiming in #714
Added progress for callback of tts generator by @leohuang2013 in #712
Add C API for speaker embedding extractor. by @csukuangfj in #711
Add Go API for speaker identification by @csukuangfj in #718
Release v1.9.15 by @csukuangfj in #719

New Contributors

@hantengc made their first contribution in #707
@leohuang2013 made their first contribution in #712

Full Changelog: v1.9.14...v1.9.15

Contributors

leohuang2013, longshiming, and 2 other contributors

Assets 62

24 Mar 14:59

csukuangfj

v1.9.14

0d258dd

v1.9.14

What's Changed

Support spoken language identification with whisper by @csukuangfj in #694

Full Changelog: v1.9.13...v1.9.14

Contributors

csukuangfj

Assets 60

24 Mar 01:44

github-actions

v1.9.13

3cdad9b

v1.9.13

What's Changed

add WebAssembly for Kws by @lovemefan in #648
small fixes to wasm kws. by @csukuangfj in #672
Fix argument for beam search Decoding by @bhaswa in #673
Add HotwordsFile and HotwordsScore fields to OnlineRecognizerConfig in C# API by @foreversimon in #675
Add tts play example for .Net. by @csukuangfj in #676
support whisper language by @wxbool in #678
Support whisper language/task in various language bindings. by @csukuangfj in #679
Add nuget package for Windows x86 by @csukuangfj in #683
Update MainActivity.kt by @mablue in #687
Refactor github actions tests by @csukuangfj in #688
Add hotwords support to .Net by @csukuangfj in #689
Configurable low_freq high_freq, dithering by @KarelVesely84 in #664
Fix building c api examples with alsa by @csukuangfj in #691
Add timestamps and tokens for .Net's online models. by @csukuangfj in #690
Update MainActivity.kt by @mablue in #693
Use manylinux in CI test by @csukuangfj in #692

New Contributors

@lovemefan made their first contribution in #648
@bhaswa made their first contribution in #673
@foreversimon made their first contribution in #675
@wxbool made their first contribution in #678
@mablue made their first contribution in #687

Full Changelog: v1.9.12...v1.9.13

Contributors

mablue, csukuangfj, and 5 other contributors

Assets 61

11 Mar 10:53

csukuangfj

v1.9.12

a628002

v1.9.12

What's Changed

Add Python API for keyword spotting by @pkufool in #576
Fix building wheels by @csukuangfj in #620
Add WebAssembly demo for streaming trilingual Paraformer (Chinese+Cantonese+English) by @csukuangfj in #618
Support Android NNAPI. by @csukuangfj in #622
fixed variable's spell num_trailing_blanks by @lonngxiang in #623
Support VITS models from icefall. by @csukuangfj in #625
support onnxruntime v1.17.1 by @csukuangfj in #624
Refactor WebAssembly for nodejs by @csukuangfj in #626
Add WebAssembly for NodeJS. by @csukuangfj in #628
Fix building wheels from source. by @csukuangfj in #632
Support Ubuntu 18.04 and CentOS 7 by @csukuangfj in #635
Allow to not use pre-installed onnxruntime libs. by @csukuangfj in #636
Support using T-head-Semi/csi-nn2 for RISC-V by @csukuangfj in #637
Fix deletions at the tail when using hotwords in streaming model by @pkufool in #638
Add Python ASR examples with alsa by @csukuangfj in #646
Add microphone streaming ASR example for C API by @csukuangfj in #650
Expose 'language' 'task' 'tailPaddings' in OfflineWhisperModelConfig by @GaryLaurenceauAva in #643
Support user provided data in tts callback. by @csukuangfj in #653
Use onnxruntime 1.17.1 for iOS. by @csukuangfj in #654
c++ api for keyword spotter by @xinhecuican in #642
Print the time about the first message in tts. by @csukuangfj in #655
Release v1.9.12 by @csukuangfj in #661

New Contributors

@lonngxiang made their first contribution in #623
@GaryLaurenceauAva made their first contribution in #643
@xinhecuican made their first contribution in #642

Full Changelog: v1.9.11...v1.9.12

Contributors

csukuangfj, pkufool, and 3 other contributors

Assets 58

29 Feb 08:57

csukuangfj

v1.9.11

8b7928e

Release v1.9.11

What's Changed

Use curl to replace wget for Windows. by @csukuangfj in #558
Update sherpa-onnx /scripts/whisper/test.py by @20246688 in #560
C++ API demo for speaker identification with portaudio. by @csukuangfj in #561
Add fine-tuned whisper model on aishell by @csukuangfj in #565
Use sequential layout for OfflineTtsConfig in C# by @csukuangfj in #567
Add context biasing for mobile by @ductranminh in #568
Update README by @csukuangfj in #572
Support WebAssembly by @csukuangfj in #577
Update onnxruntime from 1.16.3 to 1.17.0 by @csukuangfj in #581
Support building debug version on Windows by @csukuangfj in #583
updated the icon on TTS engine for android by @paolo-caroni in #579
Download android onnxruntime libs from github. by @csukuangfj in #584
Use onnxruntime static lib compiled with gcc8 on ubuntu 20.04 by @csukuangfj in #587
Increase the right chunk size of streaming paraformer to 3 by @csukuangfj in #588
Support MinSizeRel and RelWithDebInfo build on Windows. by @csukuangfj in #586
Add more Chinese TTS models (Mandarin and Cantonese) by @csukuangfj in #589
Fix CI warnings by @csukuangfj in #590
Add missing "start_time" to Python API by @vsd-vector in #591
Download ios-onnxruntime from github instead of huggingface. by @csukuangfj in #593
Link the math lib. by @csukuangfj in #592
Fix CI tests. by @csukuangfj in #596
Refactor WebAssembly build script. by @csukuangfj in #598
Publish wasm tts to model scope. by @csukuangfj in #599
Install wasm tts in a separate directory by @csukuangfj in #600
Add WebAssembly for ASR by @csukuangfj in #604
Use alsa to read microphone in speaker identification demo. by @csukuangfj in #605
Support RISC-V by @csukuangfj in #609
Fix #608 by @csukuangfj in #610
Support using alsa to access the microphone with non-streaming ASR models by @csukuangfj in #517
support using xnnpack as execution provider by @csukuangfj in #612
Use hub.nuaa.cf to replace huggingface URL to download dependencies. by @csukuangfj in #614
Track token scores by @KarelVesely84 in #571
Fix computing features for whisper. by @csukuangfj in #617

New Contributors

@20246688 made their first contribution in #560
@ductranminh made their first contribution in #568
@paolo-caroni made their first contribution in #579
@vsd-vector made their first contribution in #591

Full Changelog: v1.9.10...v1.9.11

Contributors

paolo-caroni, csukuangfj, and 4 other contributors

Assets 59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

New Contributors

Contributors

What's Changed

Contributors

What's Changed

Contributors

What's Changed

New Contributors

Contributors

What's Changed

Contributors

What's Changed

New Contributors

Contributors

What's Changed

New Contributors

Contributors

What's Changed

New Contributors

Contributors

Releases: k2-fsa/sherpa-onnx

v1.9.18

What's Changed

New Contributors

Contributors

punctuation-models

audio-tagging-models

v1.9.17

What's Changed

Contributors

v1.9.16

What's Changed

Contributors

v1.9.15

What's Changed

New Contributors

Contributors

v1.9.14

What's Changed

Contributors

v1.9.13

What's Changed

New Contributors

Contributors

v1.9.12

What's Changed

New Contributors

Contributors

Release v1.9.11

What's Changed

New Contributors

Contributors