Measurement of the effects of temporal clipping on speech quality

Thumbnail Image

Date

2006

Journal Title

Journal ISSN

Volume Title

Type

Article

Publisher

IEEE

Series Info

IEEE Transactions on Instrumentation and Measurement;Volume: 55 , Issue: 4 , Aug. 2006, Page(s): 1197 - 1203

Doi

Abstract

This paper investigates the effects of temporal clipping on perceived speech quality. Temporal clipping usually results from voice activity detection (VAD), or line echo canceller's nonlinear processor, and the clipped speech portions are replaced by comfort noise. A nonintrusive algorithm is proposed to predict speech quality based on the clipping statistics. Mean opinion score (MOS) is used as a metric for speech quality and is measured by perceptual evaluation of speech quality (PESQ). The impacts of speech frame size and noise spectrum on the algorithm are also investigated. The results show that the proposed algorithm can efficiently predict the speech quality. The correlation coefficient between the prediction and the measurement is about 0.975, and the root mean square error for the prediction is 0.20 MOS. The algorithm can be used as an integral part of a general speech quality assessment scheme in voice over Internet protocol (VoIP)

Description

MSA Google Scholar

Keywords

Internet telephony, Speech enhancement, Noise cancellation, Speech analysis, Speech codecs, Bandwidth, Echo cancellers, Nonlinear distortion, Distortion measurement, Testing

Citation

1. Pietro Paglierani, Dario Petri, "Uncertainty evaluation of speech quality measurement in VoIP systems", Advanced Methods for Uncertainty Estimation in Measurement 2007 IEEE International Workshop on, pp. 104-108, 2007. View Article Full Text: PDF (4178KB) Google Scholar 2. Lijing Ding, Ayman Radwan, Mohamed Samy El-Hennawey, Rafik A. Goubran, "Performance Study of Objective Voice Quality Measures in VoIP", Computers and Communications 2007. ISCC 2007. 12th IEEE Symposium on, pp. 197-202, 2007. View Article Full Text: PDF (4114KB) Google Scholar 3. Zhigang Sun, Ziwen Zhang, Dong Wang, Hua Zhang, "TBM: A High-Effective Monitoring Algorithm to the Quality of IPTV Transmission", Multimedia Information Networking and Security (MINES) 2010 International Conference on, pp. 5-9, 2010. View Article Full Text: PDF (613KB) Google Scholar 4. Samy El-Hennawey, "C23. Self-healing autonomic networking for voice quality in VoIP and wireless networks", Radio Science Conference (NRSC) 2015 32nd National, pp. 297-304, 2015. View Article Full Text: PDF (994KB) Google Scholar 5. Pietro Paglierani, Dario Petri, "Uncertainty Evaluation of Objective Speech Quality Measurement in VoIP Systems", Instrumentation and Measurement IEEE Transactions on, vol. 58, no. 1, pp. 46-51, 2009. View Article Full Text: PDF (176KB) Google Scholar 6. Kit Yan Chan, Siow Yong Low, Sven Nordholm, Ka Fai Cedric Yiu, "A Decision-Directed Adaptive Gain Equalizer for Assistive Hearing Instruments", Instrumentation and Measurement IEEE Transactions on, vol. 63, no. 8, pp. 1886-1895, 2014. View Article Full Text: PDF (1427KB) Google Scholar 7. Antonella Castellana, Alessio Carullo, Simone Corbellini, Arianna Astolfi, "Discriminating Pathological Voice From Healthy Voice Using Cepstral Peak Prominence Smoothed Distribution in Sustained Vowel", Instrumentation and Measurement IEEE Transactions on, vol. 67, no. 3, pp. 646-654, 2018. View Article Full Text: PDF (1729KB) Google Scholar 8. Hyewon Lee, Seongho Byeon, Byoungjin Kim, Kwang Bok Lee, Sunghyun Choi, "Enhancing Voice over WLAN via Rate Adaptation and Retry Scheduling", Mobile Computing IEEE Transactions on, vol. 13, no. 12, pp. 2791-2805, 2014. View Article Full Text: PDF (2242KB) Google Scholar 9. Juan Manuel Gorriz, Javier Ramirez, Elmar W. Lang, Carlos G. Puntonet, "Jointly Gaussian PDF-Based Likelihood Ratio Test for Voice Activity Detection", Audio Speech and Language Processing IEEE Transactions on, vol. 16, no. 8, pp. 1565-1578, 2008. View Article Full Text: PDF (1287KB) Google Scholar Cited in Papers - Other Publishers (3) 1. Russell Ondusko, Matthew Marbach, Ravi P. Ramachandran, Linda M. Head, "Blind Signal-to-Noise Ratio Estimation of Speech Based on Vector Quantizer Classifiers and Decision Level Fusion", Journal of Signal Processing Systems, 2016. CrossRef Google Scholar 2. Tiago H. Falk, Wai-Yip Chan, "Performance Study of Objective Speech Quality Measurement for Modern Wireless-VoIP Communications", EURASIP Journal on Audio, Speech, and Music Processing, vol. 2009, pp. 1, 2009. CrossRef Google Scholar 3. Guo Chen, Vijay Parsa, Speech, Audio, Image and Biomedical Signal Processing using Neural Networks, vol. 83, pp. 97, 2008. CrossRef