A curated list of awesome speaker recognition/verification/identification papers, projects, datasets, and competition.
- Fundamentals of Speaker Recognition by Beigi, Homayoon
- Machine Learning for Speaker Recognition by Jen-Tzung Chien and Man-Wai Mak
- Speaker Verification - The present and future of voiceprint based security By Professor Eliathamby Ambikairajah
- Identify Speaker Voice Machine learning model Neural Networks in Keras/TensorFlow
- X-vectors: Robust DNN embeddings for speaker recognition
- A brief Introduction to SincNet
- https://paperswithcode.com/task/speaker-recognition
- https://paperswithcode.com/task/speaker-verification
- SPEECH AND SPEAKER RECOGNITION FROM RAW WAVEFORM WITH SINCNET (CNN, speech + speaker)
- Deep Neural Network Embeddings for Text-Independent Speaker Verification (x-vector)
- How to train your speaker embeddings extractor (VAD + speaker embeddings)
- https://github.com/WeidiXie/VGG-Speaker-Recognition (python 2 + tensorflow 1.x)
- https://github.com/zabir-nabil/tf2-speaker-recognition (python 3 + tensorflow 2.x)
- https://github.com/mravanelli/SincNet (python 3 + pytorch)
- deep-speaker [softmax + triplet works best, clean audio]
- meta-SR [pytorch, short utterances]
- VoxCeleb mirror
- CN-Celeb
- ST Chinese Mandarin Corpus
- AIF [not public]
- MLS [big + multi-lingual]
- AIF [not public]
- SdSV Challenge
- VoxSRC
- NIST SRE
- Kaldi Speech Recognition Toolkit - Extraction of x vector
- PLDA/LDA from enrollment using Kaldi - PLDA scoring
- Neural PLDA - Neural PLDA, kaldi
Have anything in mind that you think is awesome and would fit in this list? Feel free to send a pull request.
To the extent possible under law, Zabir Al Nazi has waived all copyright and related or neighboring rights to this work.