Welcome to Journal of Beijing Institute of Technology
Volume 12Issue 4
.
Turn off MathJax
Article Contents
ZHAO Jun-hui, KUANG Jing-ming, HUANG Shi-lei. Comparative Study on Channel Compensation for Robust Speech Recognition[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2003, 12(4): 403-406.
Citation: ZHAO Jun-hui, KUANG Jing-ming, HUANG Shi-lei. Comparative Study on Channel Compensation for Robust Speech Recognition[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2003, 12(4): 403-406.

Comparative Study on Channel Compensation for Robust Speech Recognition

Funds:theNationalNaturalScienceFoundation(60372089)
  • Received Date:2003-06-17
  • Some channel compensation techniques integrated into front-end of speech recognizer for improving channel robustness are described. These techniques include cepstral mean normalization, rasta processing and blind equalization. Two standard channel frequency characteristics, G.712 and MIRS, are introduced as channel distortion references and a mandarin digit string recognition task is performed for evaluating and comparing the performance of these different methods. The recognition results show that in G.712 case blind equalization can achieve the best recognition performance while cepstral mean normalization outperforms the other methods in MIRS case which is capable of reaching a word error rate of 3.96%.
  • loading
  • [1]
    Furui S.Cepstral analysis technique for automatic speakerverification[J] .IEEE Transactions on Acoustic,Speechand Signal Processing,1981,29(2):254-272.
    [2]
    Liu F H,Stern R,Acero A,et al.Environment normalization for robust speech recognition[Z] .ICASSP’94,Adelaide,Australia,1994.
    [3]
    Hermansky H,Morgan N.RASTA processing of speech[J] .IEEE Transactions on Speech and Audio Processing,1994,2(4):578-589.
    [4]
    Hermansky H,Morgan N.Towards handling t he acoustic environment in spoken language processing[Z] .ICSL P’92,Banff,Canada,1992.
    [5]
    Mokbel C,Jouvet D,Monne J.Blind equalization usingadaptive filtering for improving speech recognition overtelephone[Z] .Eurospeech’95,Madrid,1995.
    [6]
    Mauuary L.Blind equalization in t he cepstral domain forrobust telephone based speech recognition[Z] .EUSIPCO’98,Rhodes,Greece,1998.
    [7]
    ITU Recommendation G.712,Transmission performancecharacteristics of pulse code modulation channels[S] .
    [8]
    ITU Recommendation P.50,Artificial voices[S] .
  • 加载中

Catalog

    通讯作者:陈斌, bchen63@163.com
    • 1.

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (213) PDF downloads(0) Cited by()
    Proportional views
    Related

    /

      Return
      Return
        Baidu
        map