題名: Derivation of Robust Mel-Frequency Cepstral Coefficients Using a Weighting Discrete Cosine Transform
作者: Hung, Wei-Wen
Huang, Yu-Yung
Xsia, Chia-Hsiung
關鍵字: mel-frequency cepstral coefficient
MFFCC
discrete cosing transform
DCT
critical band filter
F-ratio measure
discrimination capability
期刊名/會議名稱: 1999 NCS會議
摘要: Mel-frequency cepstral coefficient (MFCC) is one of the most popular speech features used in an automatic speech recognition system. In order to improve its discrimination capability and robustness in various environments, a weighting discrete cosine transform (WDCT) is proposed in this paper and incorporated into the derivation of the conventional mel-frequency cepstral coefficients. The weighting function used in the discrete cosine transform can be easily calculated from the log-spectral amplitudes of each speech frame and by which we can adequately explore the relative reliabilities among different critical band filters. Experimental results for recognition of continuous telephone speech indicate that the syllable recognition rates of the WDCT-MFCC are 3.91%and 2.16% higher than those of the conventional MFCC in the cases of with and without compensation of channel distortions, respectively. Those results verify the robustness and effectiveness of the proposed WDCT-MFCC. Moreover, comparisons of F-ratio measures between the conventional MFCC and WDCT-MFCC also conclude that the WDCT-MFCC has superior discrimination capability in modeling a speech recognizer.
日期: 2006-11-13
分類:1999年 NCS 全國計算機會議

文件中的檔案:
檔案 描述 大小格式 
ce07ncs001999000120.pdf409.66 kBAdobe PDF檢視/開啟


在 DSpace 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。