Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
-
Updated
Jun 30, 2024 - C++
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
VadRecorder based webrtc's VAD engine and vo-aac encoder, recording valid speech and discarding silence/noise data
Voice activity detection (VAD) library for speech-end detection, based on WebRTC's VAD engine
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Automagically synchronize subtitles with video.
EduSense: Practical Classroom Sensing at Scale
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Synchronize your subtitles using machine learning
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
Voice Activity Detection based on Deep Learning & TensorFlow
PocketPiglet for Android
PocketPiglet for iOS
This repository contains scripts of activities performed on various deep learning concepts
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
Speech Detection 💬
Add a description, image, and links to the speech-detection topic page so that developers can more easily learn about it.
To associate your repository with the speech-detection topic, visit your repo's landing page and select "manage topics."