└── README.md /README.md: -------------------------------------------------------------------------------- 1 | # Video 2 | [哥伦比亚大学ASR课程](http://www.ee.columbia.edu/~stanchen/fall12/e6870/outline.html) 3 | 4 | [爱丁堡ASR课程](http://www.inf.ed.ac.uk/teaching/courses/asr/index-2019.html) 5 | 6 | [CMU ASR课程](http://www.speech.cs.cmu.edu/15-492/) 7 | 8 | [台湾大学数位语音处理](http://ocw.aca.ntu.edu.tw/ntu-ocw/ocw/cou/104S204) 9 | 10 | [斯坦福224S](http://web.stanford.edu/class/cs224s/) 11 | 12 | [YORK university](https://wiki.eecs.yorku.ca/course_archive/2012-13/F/6328/start) 13 | 14 | [爱丁堡大学Simon King教授语音课](http://www.speech.zone/courses/) 15 | 16 | # Book 17 | [剑桥大学的HTK工具包的手册,HTK Book](http://htk.eng.cam.ac.uk/docs/docs.shtml) 18 | 19 | Spoken Language Processing: A Guide to Theory, Algorithm and System Development 20 | by Xuedong Huang (Author), Alex Acero (Author), Hsiao-Wuen Hon (Author) 21 | https://www.amazon.com/Spoken-Language-Processing-Algorithm-Development/dp/0130226165 22 | 23 | Fundamentals of Speech Recognition 24 | by Lawrence Rabiner (Author), Biing-Hwang Juang (Author) 25 | https://www.amazon.com/Fundamentals-Speech-Recognition-Lawrence-Rabiner/dp/0130151572 26 | 27 | Automatic Speech Recognition: A Deep Learning Approach (Signals and Communication Technology) 2015th Edition 28 | by Dong Yu (Author), Li Deng (Author) 29 | https://www.amazon.com/Automatic-Speech-Recognition-Communication-Technology/dp/1447157788 30 | 31 | Speech and Language Processing, 2nd Edition 32 | by Daniel Jurafsky (Author), James H. Martin (Author) 33 | https://www.amazon.com/Speech-Language-Processing-Daniel-Jurafsky/dp/0131873210 34 | 35 | 36 | Pattern Recognition and Machine Learning (Information Science and Statistics) 37 | by Christopher M. Bishop (Author) 38 | https://www.amazon.com/Pattern-Recognition-Learning-Information-Statistics/dp/0387310738 39 | 40 | Wangd-kaldi-book 41 | http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/Wangd-kaldi-book 42 | 43 | 解析深度学习:语音识别实践 44 | https://book.douban.com/subject/26820808/ 45 | 46 | # Toolkit 47 | Kaldi https://github.com/kaldi-asr 48 | 49 | Eesen https://github.com/srvk/eesen 50 | 51 | CNTK https://github.com/Microsoft/CNTK 52 | 53 | # Paper 54 | L. R. Rabiner, “A tutorial on hidden Markov models and selected applications in speech recognition,” Proceedings of the IEEE, vol. 77, no. 2, pp. 257–286, 1989 55 | 56 | A. Graves, S. Fern´andez, F. Gomez, and J. Schmidhuber, “Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks,” in International Conference on Machine Learning (ICML), ACM, 2006, pp. 369–376. 57 | 58 | Reading list from NCMMSC Speech group 59 | http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/Reading_list_from_NCMMSC_Speech_group 60 | 61 | 62 | # Reference 63 | https://www.zhihu.com/question/65516424/answer/232899728 64 | https://www.zhihu.com/question/24342192/answer/225984574 65 | https://www.zhihu.com/question/39701966/answer/88084026 66 | https://www.msra.cn/zh-cn/news/features/book-recommendation-speech 67 | https://cloud.tencent.com/developer/article/1031646 68 | https://book.douban.com/review/8658211/ 69 | https://blog.csdn.net/chenghaoy/article/details/82761586 70 | http://ftli.farbox.com/post/automatic-speech-recognition-asr-courses 71 | http://zhaoshuaijiang.com/2019/02/15/end-to-end-asr/ 72 | https://antkillerfarm.github.io/speech/2018/04/16/speech.html 73 | --------------------------------------------------------------------------------