Article Contents

Article Navigation> JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY> 2012> 21(3): 370-375

WANG Jing, JI Xuan, HE Hai-long, KUANG Jing-ming. Decoder-based transient signal post-processing for ITU-T G 719 at low bit rate[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2012, 21(3): 370-375.

Citation:

WANG Jing, JI Xuan, HE Hai-long, KUANG Jing-ming. Decoder-based transient signal post-processing for ITU-T G 719 at low bit rate[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2012, 21(3): 370-375.

WANG Jing, JI Xuan, HE Hai-long, KUANG Jing-ming. Decoder-based transient signal post-processing for ITU-T G 719 at low bit rate[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2012, 21(3): 370-375.

Citation:

WANG Jing, JI Xuan, HE Hai-long, KUANG Jing-ming. Decoder-based transient signal post-processing for ITU-T G 719 at low bit rate[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2012, 21(3): 370-375.

PDF( 217 KB)

Decoder-based transient signal post-processing for ITU-T G 719 at low bit rate

School of Information and Electronics, Beijing Institute of Technology, Beijing 100081, China

Received Date:2011-06-21

Abstract

Abstract

Associated with ITU-T G 719, a post-processing method in frequency domain for enhancing the perceptual quality of the decoded transient audio is proposed only to the audio decoder with no side information from the encoder. The proposed post-filter is used to filter the frequency coefficients of the decoded transient frame and consists of a short-term post-filter and a spectral tilt compensation filter which are derived from linear predictive coding (LPC) predictor based on the discrete cosine transform (DCT) coefficients of the decoded transient frame. As a result, the post-filter in frequency domain shapes the temporal noise in time domain and controls the pre-echo noise effectively while enhancing the transient perception. Listening test results show that the preferring ratio of the post-processed transient signal is higher than that of the original decoded signal at a low bit rate of 32 kbit/s in G 719 and the post-processing module brings a complexity of 12.399 WMOPS to the decoder.
- transient audio,
- discrete cosine transform (DCT),
- post-filter in frequency domain,
- short-term post-filter,
- spectral tilter compensation filter

FullText(HTML)

References (13)

References

[1]	Int Telecomm Union. ITU-T G.719. Low-complexity full-band audio coding for high-quality conversational applications[S]. Geneva, Switzerland: Int Telecomm Union, 2008.
[2]	Moore, Brian C J. Characterization of simultaneous, forward and backward masking[J]. The Perception of Reproduced Sound, 1993, 12(3): 22-33.
[3]	Sugiyama A, Hazu F, Iwadare M, et al. Adaptive transform coding with an adaptive block size(ATC-ABS)[C]//Acoustics, Speech, and Signal Processing. Albuquerque: IEEE, 1990: 1093-1096.
[4]	Shlien S. The modulated lapped transform, its time varying forms, and its applications to audio coding standards[J]. Speech and Audio Processing, 1997, 5(4): 359-366.
[5]	Herre J, Johnston J D. Enhancing the performance of perceptual audio coders by using temporal noise shaping (TNS)[C]//Signal. Low Bit-Rate Audio Coding. Los Angeles,USA: AES, 1996: 9496-9499.
[6]	The International Organization for Standardization and the International Electro-Technical Commission. ISO/IEC 14496 (Part 3, Audio). Information Technology-coding of audiovisual objects[S]. USA: The International Organization for Standardization and the International Electro-Technical Commission, 1999.
[7]	Herre J. Perceptual noise shaping in the time domain via LPC prediction in the frequency domain: US Patient, 5781888. 1998-01-01.
[8]	Herre J. Temporal noise shaping, quantization and coding methods in perceptual audio coding: A tutorial introduction[C]//High-Quality Audio Coding. Florence, Italy: AES, 1999: 17-31.
[9]	Vafin R, Heusden R, Kleijn W B. Modifying transients for efficient coding of audio[C]//Acoustics, Speech, and Signal Processing. Salt Lake City, UT: ICASSP, 2001:3285-3288.
[10]	Ragot S, Kovesi B, Virette D etal. A 8-32.kbit/s scalable wideband speech and audio coding candidate for ITU-T G.729EV standardization[C]//Signal. Acoustics, Speech, and Signal Processing. Toulouse, France: ICASSP, 2006:1-4.
[11]	Zhang T, Wang W, He J. On the pre-echo control method in transient signal coding of AVS audio[C]//Audio, Language and Image Processing. Shanghai, China: ICALIP, 2008: 242-246.
[12]	Taddei H. Pre-echo reduction in the ITU-T G.729.1 Embedded Coder[C]//Multimodality Acquisition of Articulatory Data and Processing. Lausanne, Switzerland: EUSIPCO, 2008:529-532.
[13]	Chen J H, Gersho A. Adaptive postfiltering for quality enhancement of decoded speech[J]. Speech and Audio Processing, 1995, 3(1): 59-71.

Relative Articles

Supplements (0)

Cited By

Proportional views

Proportional views

通讯作者:陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Get Citation

PDF

XML

Article Metrics

Article views (1032) PDF downloads(113)

Decoder-based transient signal post-processing for ITU-T G 719 at low bit rate

Abstract

References

Proportional views

Catalog

通讯作者:陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Decoder-based transient signal post-processing for ITU-T G 719 at low bit rate

Abstract

References

Proportional views

Catalog

通讯作者:陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Export File

Citation

Format

Content