Skip to content

✅ A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.

License

Notifications You must be signed in to change notification settings

weimeng23/speech-recognition-learning-resources

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Speech Recognition Learning Resources

This repo contains several learning resources for speech recognition, including courses, books, tutorials, papers and toolkits.(continuously updating)

Table of contens

Courses

  • (Recommended) Automatic Speech Recognition (ASR) 2018-2019 Lectures, School of Informatics, University of Edingburgh [Website]
  • Speech recognition, EECS E6870 - Spring 2016, Columbia University [Website]
  • CS224N: Natural Language Processing with Deep Learning, Stanford [Website] [Video(Winter 2021)] [Video(Winter 2017)]
  • CS224S: Spoken Language Processing (Winter 2021), Stanford [Website]
  • DLHLP: DEEP LEARNING FOR HUMAN LANGUAGE PROCESSING, 2020 SPRING, Hung-yi Lee [Website] [Video(Spring 2020)]
  • Microsoft DEV287x: Speech Recognition Systems, 2019 [Website]
  • 语音识别从入门到精通,2019,谢磊 (NOT FREE) [Website]
  • 數位語音處理概論,国立台湾大学,李琳山 [Website]

Books

  • Fundamentals of speech recognition, Lawrence Rabiner, Being-Hwang Juang, 1993 [Book]
  • Spoken language processing: A guide to theory, algorithm, and system levelopment, xuedong Huang, Alex acero, hsiao-wuen Hon, 2001 [Book]
  • Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, Daniel Jurafsky & James H. Martin [Website] [Book 3rd Ed]
  • Automatic speech recognition: A Deep Learning Approach, Dong Yu and Li Deng, Springer, 2014 [Book]
  • Foundations of Statistical Natural Language Processing, Chris Manning and Hinrich Schütze, 1999 [Website] [Book]
  • 《解析深度学习:语音识别实践》,俞栋,邓力,电子工业出版社
  • 《Kaldi 语音识别实战》,陈果果,电子工业出版社
  • 《语音识别:原理与应用》,洪青阳,电子工业出版社
  • 《语音识别基本法》,汤志远,电子工业出版社
  • 《统计学习方法》李航,清华大学出版社
  • 《语音信号处理》韩继庆,清华大学出版社
  • 《语音信号处理》赵力,机械工业出版社

Papers

  • HMM: Rabiner L R. A tutorial on hidden Markov models and selected applications in speech recognition[J]. Proceedings of the IEEE, 1989, 77(2): 257-286. [Paper]
  • EM: Bilmes J A. A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models[J]. International Computer Science Institute, 1998, 4(510): 126. [Paper]
  • CTC: Graves A, Fernández S, Gomez F, et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks[C]//Proceedings of the 23rd international conference on Machine learning. 2006: 369-376. [Paper]

Tutorials

  • WFST
    • An Introduction to Weighted Automata in Machine Learning, Awni Hannun, 2021. [PDF]
  • k2
    • Speech Recognition with Next-Generation Kaldi (K2, Lhotse, Icefall), Interspeech 2021. [Video]
    • Progress in ASR with Next-Gen Kaldi, BAAI 2022. [Video] [Slides]
    • Speech Recognition with Icefall + Lhotse, Interspeech 2023. [Slides]

Toolkits

listed in no particular order

Releases

No releases published

Packages

No packages published