Add VAD support #10

satra · 2024-03-15T16:15:18Z

files could have a lot of silence before or after. Add VAD functionality and add it optionally to the b2ai feature pipeline.

GasserElbanna · 2024-03-16T19:15:42Z

Some options to consider:

fabiocat93 · 2024-03-21T20:24:47Z

A simpler implementation for removing silence from start and end of audio files is here:

https://pytorch.org/audio/main/generated/torchaudio.transforms.Vad.html

Also, a note on pyannote-audio: since the last version, it's recommended to use the speaker-diarization pipeline and not the segmentation model because it works better in terms of diarization error rate

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add VAD support #10

Add VAD support #10

satra commented Mar 15, 2024

GasserElbanna commented Mar 16, 2024

fabiocat93 commented Mar 21, 2024

Add VAD support #10

Add VAD support #10

Comments

satra commented Mar 15, 2024

GasserElbanna commented Mar 16, 2024

fabiocat93 commented Mar 21, 2024