master2

所属分类:matlab编程
开发工具:matlab
文件大小:29093KB
下载次数:73
上传日期:2016-03-19 23:30:44
上 传 者ldw527033039
说明:  基于GMM-HMM的语音识别程序,自带训练和测试数据,使用的是MATLAB。
(GMM-HMM-based speech recognition program, comes with the training and test data, using MATLAB.)

文件列表:
Levenshtein.m (1997, 2014-04-08)
Testing (0, 2014-04-08)
Testing\TestingIDs1-15.txt (300, 2014-04-08)
Testing\hypotheses.txt (1664, 2014-04-08)
Testing\unkn_1.mfcc (63292, 2014-04-08)
Testing\unkn_1.phn (303, 2014-04-08)
Testing\unkn_1.txt (36, 2014-04-08)
Testing\unkn_1.wav (79712, 2014-04-08)
Testing\unkn_1.wrd (99, 2014-04-08)
Testing\unkn_10.mfcc (95715, 2014-04-08)
Testing\unkn_10.phn (580, 2014-04-08)
Testing\unkn_10.txt (64, 2014-04-08)
Testing\unkn_10.wav (120468, 2014-04-08)
Testing\unkn_10.wrd (170, 2014-04-08)
Testing\unkn_11.mfcc (65437, 2014-04-08)
Testing\unkn_11.phn (468, 2014-04-08)
Testing\unkn_11.txt (50, 2014-04-08)
Testing\unkn_11.wav (82170, 2014-04-08)
Testing\unkn_11.wrd (110, 2014-04-08)
Testing\unkn_12.mfcc (90163, 2014-04-08)
Testing\unkn_12.phn (503, 2014-04-08)
Testing\unkn_12.txt (55, 2014-04-08)
Testing\unkn_12.wav (113504, 2014-04-08)
Testing\unkn_12.wrd (138, 2014-04-08)
Testing\unkn_13.mfcc (54866, 2014-04-08)
Testing\unkn_13.phn (433, 2014-04-08)
Testing\unkn_13.txt (53, 2014-04-08)
Testing\unkn_13.wav (69062, 2014-04-08)
Testing\unkn_13.wrd (157, 2014-04-08)
Testing\unkn_14.mfcc (57012, 2014-04-08)
Testing\unkn_14.phn (355, 2014-04-08)
Testing\unkn_14.txt (49, 2014-04-08)
Testing\unkn_14.wav (71930, 2014-04-08)
Testing\unkn_14.wrd (119, 2014-04-08)
Testing\unkn_15.mfcc (85219, 2014-04-08)
Testing\unkn_15.phn (520, 2014-04-08)
Testing\unkn_15.txt (52, 2014-04-08)
Testing\unkn_15.wav (107360, 2014-04-08)
... ...

SpeechRecognition ================= Speaker and speech recognition. There is a dependency on FullBNT which is not included. Speaker Identification (GMM) ---------------------- Speaker identification is the task of correctly identifying speaker sc from among S possible speakers si=1..S given an input speech sequence X, consisting of a succession of d-dimensional real vectors. d used here is 14. This is a discrete classification task (choosing among several speakers) that uses continuous-valued data (the vectors of real numbers) as input. This uses M-component Gaussian mixture model (GMM) for each of the speakers in the Training data set. gmmClassify.m calculates and reports the likelihoods of the five most likely speakers for each test utterance. This output is the folder unkn in individual files. Speech Recogonition (HMM) ------------------- Speech recognition is the task of correctly identifying a word sequence given an input speech sequence X. Typically this process involves language models, dictionaries, and grammars. This considers only a small subset of the acoustic modelling component and uses the Bayes Net Toolbox. Word Error Rates using Levenshtein distance is done in Levenshtein.m Additionally, there is code for PCA to reduce the dimensions of the data in pca.m

近期下载者

相关文件


收藏者