A SILENCE REMOVAL AND ENDPOINT DETECTION APPROACH FOR SPEECH PROCESSING

Muhammad Asadullah, Shibli Nisar

Abstract


In this paper a brief overview of silence removal and voice activity detection is discussed and a new method for silence removal is suggested. The objective of suggested method is to delete the silence and unvoiced segments from the speech signal which are very useful to increase the performance and accuracy of the system. Endpoint detection is used to remove the DC offset value from the signal after silence removal process. Silence removal and Endpoint detection are main part of many applications such as speaker and speech recognition. The proposed method uses Root Mean Square (RMS) to delete the unvoiced segments from the speech signal. This work showed better results for silence removal and endpoint detection than existing methods. The performance of this research work is evaluated using MATLAB tool and accuracy of 97.2% is achieved.


Full Text:

Untitled

References


A. M. Cordovilla, N.Ma, V. Sánchez, J. L. Carmona, A. M. Peinado, J. Barker, “A Pitch Based Noise Estimation Technique for Robust Speech Recognition with Missing Data”, IEEE ICASSP, 2011 , pp. 4808 – 4811.

N. Soo Kim, W. Sung, “A statistical model-based voice activity detection”, IEEE Signal Processing Letters, 1999, vol. 6, pp. 1 – 3.

D. G. Childers, J. M. Larar, M. Hand “Silent and Voiced/Unvoied/ Mixed Excitation (Four-Way), Classification of Speech”, IEEE Transaction on ASSP, IEEE, 1989, Vol.37, pp. 1771-1774

H. Dou, Z. Wu, Y. Feng, Y. Qian, “Voice Activity Detection Based on the Bi-spectrum”, IEEE 10th International conference on Signal Processing, IEEE, 2010, pp. 502-505.

D. Enqing, L. Guizhong, Z. Yatong, C. Yu “Voice activity detection based on short-time energy and noise spectrum adaptation” IEEE, 6th international conference on signal processing, 2002, vol. 1, pp. 464 – 467.

E. A. E-Sotelo, E. E-Hernandez, E. G-Rios, H. M. P-Meana “Endpoint Detector Algorithm for Speech Recognition Application”, 2012 22nd International Conference on ELECOMP, IEEE, 2012, pp. 252 - 256

Poonam Sharma, Abha Kiran, “Automatic Identification of Silence, Unvoiced and Voiced Chunks in Speech” Academy & Industry Research Collaboration Center (AIRCC), Computer Science & Information Technology, 2013, 3 (5), pp. 87-96.

G. Saha, S. Chakroborty, S. Senapati, “A New Silence Removal and Endpoint Detection Algorithm for Speech and Speaker Recognition Applications”, IJIGSP, December 2014, pp. 1-5.

In-Sung Han, Chan-Shik Ahn, “Voice Detection using Speech Energy Maximization and Silence Feature Normalization”, Advanced Science and Technology Letters, Vol.49 (ICSS 2014), pp.25-29.

T.R Sahoo, S. Patra, “Silence Removal and Endpoint Detection of Speech Signal for Text Independent Speaker Identification”, I.J. Image, Graphics and Signal Processing, 2014, vol. 6, pp. 27-35.

Andrew KInghorn and Mark Greenwood, “SUVing: Automatic Silence/Unvoiced/Voiced Classification of Speech'', Presented at the university of Sheffield.


Refbacks

  • There are currently no refbacks.