baseline_CM

所属分类:人工智能/神经网络/深度学习
开发工具:matlab
文件大小:2851KB
下载次数:22
上传日期:2018-01-20 16:52:40
上 传 者longruo
说明:  声纹识别特征提取程序,防假冒录音的声纹识别主要特征
(CQCC feature extraction)

文件列表:
baseline_CM (0, 2017-01-20)
baseline_CM\baseline_CM.m (3755, 2017-01-20)
baseline_CM\bosaris_toolkit (0, 2016-12-23)
baseline_CM\utility (0, 2016-12-23)
baseline_CM\utility\logsumexp.m (261, 2016-12-23)
baseline_CM\utility\lgmmprob.m (343, 2016-12-23)
baseline_CM\utility\compute_llk.m (178, 2016-12-23)
baseline_CM\CQCC_v1.0 (0, 2016-12-23)
baseline_CM\CQCC_v1.0\D18_1000001.wav (128532, 2016-12-23)
baseline_CM\CQCC_v1.0\DEMO.m (934, 2016-12-23)
baseline_CM\CQCC_v1.0\._cqcc.m (4096, 2016-12-23)
baseline_CM\CQCC_v1.0\._DEMO.m (4096, 2016-12-23)
baseline_CM\CQCC_v1.0\._.DS_Store (4096, 2016-12-23)
baseline_CM\CQCC_v1.0\.DS_Store (6148, 2016-12-23)
baseline_CM\CQCC_v1.0\cqcc.m (5860, 2016-12-23)
baseline_CM\bosaris_toolkit\classes (0, 2016-12-23)
baseline_CM\bosaris_toolkit\demo (0, 2016-12-23)
baseline_CM\bosaris_toolkit\demo\demo_with_files.m (3675, 2016-12-23)
baseline_CM\bosaris_toolkit\demo\demo_make_score_files.m (1282, 2016-12-23)
baseline_CM\bosaris_toolkit\demo\demo_make_data_for_fusion.m (264, 2016-12-23)
baseline_CM\bosaris_toolkit\demo\demo_plot_det_for_fusion.m (983, 2016-12-23)
baseline_CM\bosaris_toolkit\demo\demo_fusion.m (1145, 2016-12-23)
baseline_CM\bosaris_toolkit\demo\demo_main.m (1831, 2016-12-23)
baseline_CM\bosaris_toolkit\demo\demo_plot_nber.m (1593, 2016-12-23)
baseline_CM\bosaris_toolkit\demo\demo_plot_det.m (1017, 2016-12-23)
baseline_CM\bosaris_toolkit\stats (0, 2016-12-23)
baseline_CM\bosaris_toolkit\stats\prbep.m (413, 2016-12-23)
baseline_CM\bosaris_toolkit\stats\fastEval.m (719, 2016-12-23)
baseline_CM\bosaris_toolkit\stats\cllr.m (1564, 2016-12-23)
baseline_CM\bosaris_toolkit\stats\fast_actDCF.m (3032, 2016-12-23)
baseline_CM\bosaris_toolkit\stats\area_under_rocch.m (539, 2016-12-23)
baseline_CM\bosaris_toolkit\stats\effective_prior.m (575, 2016-12-23)
baseline_CM\bosaris_toolkit\stats\dcf_at_threshold.m (936, 2016-12-23)
baseline_CM\bosaris_toolkit\stats\eer.m (341, 2016-12-23)
baseline_CM\bosaris_toolkit\stats\fast_minDCF.m (2585, 2016-12-23)
baseline_CM\bosaris_toolkit\stats\min_cllr.m (1379, 2016-12-23)
baseline_CM\bosaris_toolkit\calibration (0, 2016-12-23)
... ...

ASVspoof 2017 CHALLENGE: Audio replay detection challenge for automatic speaker verification anti-spoofing http://www.spoofingchallenge.org/ =================================================================================================== Matlab implementation of the baseline system for replay detection based on constant Q cepstral coefficients (CQCC) features + Gaussian Mixture Models (GMMs) =================================================================================================== Contents of the package ======================= - CQCC_v1.0 ------------------ Matlab implementation of constant Q cepstral coefficients. For further details on CQCC, refer to the following publication: Todisco, M., Delgado, H., Evans, N., A new feature for automatic speaker verification anti-spoofing: Constant Q cepstral coefficients. ODYSSEY 2016, The Speaker and Language Recognition Workshop, June 21-24, 2016, Bilbao, Spain http://audio.eurecom.fr/content/software - vlfeat ----------------- VLFeat open source library that implements the GMMs. For further details, refer to: http://www.vlfeat.org/ - bosaris_toolkit ----------------- The BOSARIS Toolkit provides MATLAB code for evaluating scores. For further details, refer to: https://sites.google.com/site/bosaristoolkit/ - baseline_CM.m ---------------- This script is the baseline countermeasures system for ASVspoof 2017 CHALLENGE - Audio replay detection challenge for automatic speaker verification anti-spoofing. Front-end is based on CQCC features, while back-end is based on GMMs. The CQCC is applied with a maximum frequency of fs/2, where fs = 16kHz is the sampling frequency. The minimum frequency is set to fs/2/2^9 ~15Hz (9 being the number of octaves). The number of bins per octave is set to 96. Re-sampling is applied with a sampling period of 16. CQCC features dimension is set to 29 coefficients + 0th, with the static, delta and delta-delta coefficients. 2-class GMMs are trained on the genuine and spoofed speech utterances of the training dataset, respectively. We use 512-component models, trained with an expectation-maximisation (EM) algorithm with random initialisation. The score is computed as the log-likelihood ratio for the test utterance given the natural and the spoofed speech models. BASELINE PERFORMANCE UPDATE!! Using the originally distributed dataset (file "ASVspoof2017.tar.gz"), performance in terms of equal error rate (EER) on the development set should be close to 7%. However, with the newly distributed data (file "ASVspoof2017_train_dev_update.tar.gz") with the DTFM tones removed/reduced, EER should be around 11%. Contact information =================== For any query, please contact: - asvspoof2017 at cs.uef.fi - Hector Delgado (delgado at eurecom.fr) - Massimiliano Todisco (todisco at eurecom.fr) - Nicholas Evans (evans at eurecom.fr)

近期下载者

相关文件


收藏者