Student Hyperparameter Tuning in ASR Knowledge Distillation

The code of knowledge distillation hyperparameter tuning task for AINL AutoML Workshop 2024 paper "Size Matters: About Optimal Amount of Speech Data for Student Hyperparameter Tuning in ASR Knowledge Distillation".

Abstract:

The knowledge distillation (KD) methods of end-to-end automatic speech recognition (ASR) models are increasingly in demand nowadays, they are used to improve the performance of the models and to make their usage possible on devices with modest technical characteristics. However, it is still hard to find the right balance between the size, the perfomance and the inference quality of the compressed model. In this work we use the automated machine learning framework for the knowledge distillation hyperparameter tuning task to discover how much prepared audio data is necessary to get the best hyperparameters for the efficient KD of Wav2Vec2 model. The proposed approach can be also used for tuning of the compressed model's parameters, depending on the desired performance and speech recognition error rate indicators.

Conference: https://ainlconf.ru/

Workshop: https://ainlconf.ru/2024/automl

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
optuna_automl		optuna_automl
speaker_clusterisation		speaker_clusterisation
LICENSE		LICENSE
Presentation.pdf		Presentation.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Student Hyperparameter Tuning in ASR Knowledge Distillation

About

Releases

Packages

Languages

License

dangrebenkin/w2v2_distillation_hyperparameter_auto_ml

Folders and files

Latest commit

History

Repository files navigation

Student Hyperparameter Tuning in ASR Knowledge Distillation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages