Welcome to Journal of Beijing Institute of Technology
Volume 21Issue 3
.
Turn off MathJax
Article Contents
WANG Jing, JI Xuan, HE Hai-long, KUANG Jing-ming. Decoder-based transient signal post-processing for ITU-T G 719 at low bit rate[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2012, 21(3): 370-375.
Citation: WANG Jing, JI Xuan, HE Hai-long, KUANG Jing-ming. Decoder-based transient signal post-processing for ITU-T G 719 at low bit rate[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2012, 21(3): 370-375.

Decoder-based transient signal post-processing for ITU-T G 719 at low bit rate

  • Received Date:2011-06-21
  • Associated with ITU-T G 719, a post-processing method in frequency domain for enhancing the perceptual quality of the decoded transient audio is proposed only to the audio decoder with no side information from the encoder. The proposed post-filter is used to filter the frequency coefficients of the decoded transient frame and consists of a short-term post-filter and a spectral tilt compensation filter which are derived from linear predictive coding (LPC) predictor based on the discrete cosine transform (DCT) coefficients of the decoded transient frame. As a result, the post-filter in frequency domain shapes the temporal noise in time domain and controls the pre-echo noise effectively while enhancing the transient perception. Listening test results show that the preferring ratio of the post-processed transient signal is higher than that of the original decoded signal at a low bit rate of 32 kbit/s in G 719 and the post-processing module brings a complexity of 12.399 WMOPS to the decoder.
  • loading
  • [1]
    Int Telecomm Union. ITU-T G.719. Low-complexity full-band audio coding for high-quality conversational applications[S]. Geneva, Switzerland: Int Telecomm Union, 2008.
    [2]
    Moore, Brian C J. Characterization of simultaneous, forward and backward masking[J]. The Perception of Reproduced Sound, 1993, 12(3): 22-33.
    [3]
    Sugiyama A, Hazu F, Iwadare M, et al. Adaptive transform coding with an adaptive block size(ATC-ABS)[C]//Acoustics, Speech, and Signal Processing. Albuquerque: IEEE, 1990: 1093-1096.
    [4]
    Shlien S. The modulated lapped transform, its time varying forms, and its applications to audio coding standards[J]. Speech and Audio Processing, 1997, 5(4): 359-366.
    [5]
    Herre J, Johnston J D. Enhancing the performance of perceptual audio coders by using temporal noise shaping (TNS)[C]//Signal. Low Bit-Rate Audio Coding. Los Angeles,USA: AES, 1996: 9496-9499.
    [6]
    The International Organization for Standardization and the International Electro-Technical Commission. ISO/IEC 14496 (Part 3, Audio). Information Technology-coding of audiovisual objects[S]. USA: The International Organization for Standardization and the International Electro-Technical Commission, 1999.
    [7]
    Herre J. Perceptual noise shaping in the time domain via LPC prediction in the frequency domain: US Patient, 5781888. 1998-01-01.
    [8]
    Herre J. Temporal noise shaping, quantization and coding methods in perceptual audio coding: A tutorial introduction[C]//High-Quality Audio Coding. Florence, Italy: AES, 1999: 17-31.
    [9]
    Vafin R, Heusden R, Kleijn W B. Modifying transients for efficient coding of audio[C]//Acoustics, Speech, and Signal Processing. Salt Lake City, UT: ICASSP, 2001:3285-3288.
    [10]
    Ragot S, Kovesi B, Virette D etal. A 8-32.kbit/s scalable wideband speech and audio coding candidate for ITU-T G.729EV standardization[C]//Signal. Acoustics, Speech, and Signal Processing. Toulouse, France: ICASSP, 2006:1-4.
    [11]
    Zhang T, Wang W, He J. On the pre-echo control method in transient signal coding of AVS audio[C]//Audio, Language and Image Processing. Shanghai, China: ICALIP, 2008: 242-246.
    [12]
    Taddei H. Pre-echo reduction in the ITU-T G.729.1 Embedded Coder[C]//Multimodality Acquisition of Articulatory Data and Processing. Lausanne, Switzerland: EUSIPCO, 2008:529-532.
    [13]
    Chen J H, Gersho A. Adaptive postfiltering for quality enhancement of decoded speech[J]. Speech and Audio Processing, 1995, 3(1): 59-71.
  • 加载中

Catalog

    通讯作者:陈斌, bchen63@163.com
    • 1.

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (1032) PDF downloads(113) Cited by()
    Proportional views
    Related

    /

      Return
      Return
        Baidu
        map