Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improving Performance on Shorter Audio Clips #5

Open
shawnbzhang opened this issue Sep 6, 2020 · 1 comment
Open

Improving Performance on Shorter Audio Clips #5

shawnbzhang opened this issue Sep 6, 2020 · 1 comment

Comments

@shawnbzhang
Copy link

shawnbzhang commented Sep 6, 2020

Using your GPVAD/VADC, I wish to process smaller chunks (i.e. ~200ms chunks) of audio files. However, when the duration is this low, the performance of the VAD is poor. What can I do to better the performance? I assume this must be done in the training side. Would you recommend downloading the datasets and splicing them into these smaller chunks, retraining from scratch?

Curious to hear your thoughts. Thank you!

@RicherMans
Copy link
Owner

Hey there,
well so far the proposed GPV is not "online" meaning that it does not directly output for each frame one probability.
Performance is dependent on the utterance length, due to the bidirectional GRU getting more information.

What can I do to better the performance? I assume this must be done in the training side. Would you recommend downloading the datasets and splicing them into these smaller chunks, retraining from scratch?

Well, the point of the entire project is just to show that VAD can be trained on clip-level using weak (here inexact and noisy) supervision.
If you have labels for every,e.g., 200ms, well just train a standard VAD model.
However, in reality, I doubt that you have this type of supervision available, its too costly.

However, when the duration is this low, the performance of the VAD is poor.

Well, how about during testing you just splice some short utterances together?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants