Releases · k2-fsa/sherpa-onnx

14 Sep 06:38

csukuangfj

v1.10.26

9dade25

v1.10.26

What's Changed

Add links to projects using sherpa-onnx. by @csukuangfj in #1345
Support lang/emotion/event results from SenseVoice in Swift API. by @csukuangfj in #1346
Support specifying max speech duration for VAD. by @csukuangfj in #1348
Add APIs about max speech duration in VAD for various programming languages by @csukuangfj in #1349
Release v1.10.26 by @csukuangfj in #1350

Full Changelog: v1.10.25...v1.10.26

Contributors

csukuangfj

Assets 50

13 Sep 07:00

csukuangfj

v1.10.25

211786e

v1.10.25

What's Changed

Fix releasing dart packages. by @csukuangfj in #1317
Throw error instead exit on fail to read wav in java by @RGdevz in #1323
Re-implement LM rescore for online transducer by @SilverSulfide in #1231
Fixed the C api calls and created the TTS project file by @twodawg in #1324
Build websocket related binaries for embedded systems. by @csukuangfj in #1327
fix wasm app for streaming paraformer by @csukuangfj in #1328
Fix vad.Flush() by @csukuangfj in #1329
Fix typos by @csukuangfj in #1330
Add Python binding for online punctuation models by @yaochie in #1312
Fix building by @csukuangfj in #1331
Preserve previous result as context for next segment by @vsd-vector in #1335
Fix computing features for CED audio tagging models. by @csukuangfj in #1341
re-pull-request allow tokens and hotwords be loaded from buffered string driectly by @shawl336 in #1339
Fix building by @csukuangfj in #1343
Release v1.10.25 by @csukuangfj in #1344

New Contributors

@RGdevz made their first contribution in #1323
@twodawg made their first contribution in #1324
@yaochie made their first contribution in #1312
@shawl336 made their first contribution in #1339

Full Changelog: v1.10.24...v1.10.25

Contributors

twodawg, csukuangfj, and 5 other contributors

Assets 48

30 Aug 09:52

github-actions

v1.10.24

cc46231

v1.10.24

Release v1.10.24 (#1309)

Assets 49

24 Aug 15:48

github-actions

v1.10.23

5ed8e31

v1.10.23

What's Changed

flutter: add lang, emotion, event to OfflineRecognizerResult by @eschmidbauer in #1268
Use a separate thread to initialize models for lazarus examples. by @csukuangfj in #1270
Object pascal examples for recording and playing audio with portaudio. by @csukuangfj in #1271
Text to speech API for Object Pascal. by @csukuangfj in #1273
update kotlin api for better release native object and add user-frien… by @fbzhong in #1275
Provide models for mobile-only platforms by fixing batch size to 1 by @csukuangfj in #1276
Update wave-reader.cc by @diyism in #1278
Set batch size to 1 for more streaming ASR models by @csukuangfj in #1280
Add WebAssembly for VAD by @csukuangfj in #1281
WebAssembly example for VAD + Non-streaming ASR by @csukuangfj in #1284
Add VAD and keyword spotting for the Node package with WebAssembly by @csukuangfj in #1286

New Contributors

@eschmidbauer made their first contribution in #1268
@diyism made their first contribution in #1278

Full Changelog: v1.10.22...v1.10.23

Contributors

fbzhong, diyism, and 2 other contributors

Assets 49

16 Aug 14:42

csukuangfj

v1.10.22

8880975

v1.10.22

What's Changed

Exclude .DS_Store files from flutter tts assets by @csukuangfj in #1238
Add Pascal API for reading wave files by @csukuangfj in #1243
Pascal API for streaming ASR by @csukuangfj in #1246
Pascal API for non-streaming ASR by @csukuangfj in #1247
Pascal API for VAD by @csukuangfj in #1249
Update offline-recognizer.cc by @iprovalo in #1253
Add more C API examples by @zhu-han in #1255
Add emotion, event of SenseVoice. by @fbzhong in #1257
Support reading multi-channel wave files with 8/16/32-bit encoded samples by @csukuangfj in #1258
Enable IPO only for Release build. by @csukuangfj in #1261
Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR by @csukuangfj in #1251
chore: update online-stream.h by @eltociear in #1264
Build generating subtitles APPs for more models by @csukuangfj in #1265
Fix looking up OOVs in lexicon.txt for MeloTTS models. by @csukuangfj in #1266
Release v1.10.22 by @csukuangfj in #1267

New Contributors

@zhu-han made their first contribution in #1255
@fbzhong made their first contribution in #1257

Full Changelog: v1.10.21...v1.10.22

Contributors

fbzhong, iprovalo, and 3 other contributors

Assets 28

08 Aug 03:00

github-actions

v1.10.21

94e2562

v1.10.21

What's Changed

Fix ffmpeg c api example by @csukuangfj in #1185
Fix splitting sentences for MeloTTS by @csukuangfj in #1186
Add test about whisper large-v3 for .Net by @csukuangfj in #1187
Non-streaming WebSocket client for Java. by @csukuangfj in #1190
Fix copying asset files for flutter examples. by @csukuangfj in #1191
Add Chinese+English tts example for flutter by @csukuangfj in #1192
Add speaker identification and verification exmaple for Dart API by @csukuangfj in #1194
Fix reading non-standard wav files. by @csukuangfj in #1199
Add ReazonSpeech Japanese pre-trained model by @csukuangfj in #1203
describe how to add new words for MeloTTS models by @csukuangfj in #1209
Remove libonnxruntime_providers_cuda.so as a dependency. by @csukuangfj in #1210
Update README to include Rust. by @csukuangfj in #1212
Fix setting SenseVoice language. by @csukuangfj in #1214
Support passing TTS callback from Swift to C by @csukuangfj in #1218
Add MeloTTS example for ios by @csukuangfj in #1223
Add online punctuation and casing prediction model for English language by @frankyoujian in #1224
Fix style issues for online punctuation source files by @csukuangfj in #1225
Add CI tests for online punctuation models by @csukuangfj in #1226
Fix typo in offline-lm-config.cc by @xsjk in #1229
Fix Python two pass ASR examples by @csukuangfj in #1230
addition of blank_penalty config in online_recognizer by @parth-khiera in #1232
Add blank penalty for various language bindings. by @csukuangfj in #1234