Frequency Based Filtering for Voice Activity Detection

V.Adlin Vini*
Post Graduate, Applied Electronics, C.S.I Institute of Technology, Thovalai, Tamil Nadu, India.
Periodicity:October - December'2017
DOI : https://doi.org/10.26634/jdp.5.4.14561

Abstract

Signal Processing is used to bring out the speech in a degraded signal. Amplitude of the signal is obtained by using the SFF (Single Frequency Filtering). Spectral and Temporal resolutions are compared by using three different methods, which are discussed in this paper. Voice Activity Detection is the process in which any noise or disturbance that are made to the speech signal is detected. In this paper, the author has proposed Voice Activity Detection system with the help of Frequency based filtering method. The experimental results show that it gives better results compared to the existing systems.

Keywords

Voice Activity Detection, Speech Coding, Recognition.

How to Cite this Article?

Vini, A, V. (2017). Frequency Based Filtering for Voice Activity Detection. i-manager's Journal on Digital Signal Processing, 5(4), 10-19. https://doi.org/10.26634/jdp.5.4.14561

References

[1]. Adiga, M. T., & Bhandarkar, R. (2016, October). Improving single frequency filtering based Voice Activity Detection (VAD) using spectral subtraction based noise cancellation. In Signal Processing, Communication, Power and Embedded System (SCOPES), 2016 International Conference on (pp. 18-23). IEEE.
[2]. Aneeja, G., & Yegnanarayana, B. (2015). Single Frequency Filtering approach for discriminating speech and nonspeech. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 23(4), 705-717.
[3]. Borin, R. G., & Silva, M. T. (2017, August). Voice Activity Detection using discriminative restricted Boltzmann machines. In Signal Processing Conference (EUSIPCO), 2017 25 European (pp. 523-527). IEEE.
[4]. Dhananjaya, N., & Yegnanarayana, B. (2010). Voiced/ nonvoiced detection based on robustness of voiced epochs. IEEE Signal Processing Letters, 17(3), 273-276.
[5]. Garg, V. (2015). Speech Signal Processing. Journal of VLSI Design and Signal Processing, 1(2), 1-5.
[6]. Haigh, J. A., & Mason, J. S. (1993, October). Robust voice activity detection using cepstral features. In TENCON'93. Proceedings. Computer, Communication, Control and Power Engineering. 1993 IEEE Region 10 Conference on (Vol. 3, pp. 321-324). IEEE.
[7]. Hung, C. Y., Yeh, T. L., & Tu, Y. C. (2015). U.S. Patent No. 8,972,252. Washington, DC: U.S. Patent and Trademark Office.
[8]. Kinnunen, T., Chernenko, E., Tuononen, M., Fränti, P., & Li, H. (2007, October). Voice Activity Detection using MFCC features and Support Vector Machine. In Int. Conf. on Speech and Computer (SPECOM07), Moscow, Russia (Vol. 2, pp. 556-561).
[9]. Mak, M. W., & Yu, H. B. (2014). A study of Voice Activity Detection techniques for NIST speaker recognition evaluations. Computer Speech & Language, 28(1), 295- 313.
[10]. McCowan, I., Dean, D., McLaren, M., Vogt, R., & Sridharan, S. (2011). The delta-phase spectrum with application to voice activity detection and speaker recognition. IEEE Transactions on Audio, Speech, and Language Processing, 19(7), 2026-2038.
[11]. Ong, W. Q., Tan, A. W. C., Vengadasalam, V. V., Tan, C. H., & Ooi, T. H. (2017). Real-time Robust Voice Activity Detection using the Upper Envelope Weighted Entropy Measure and the Dual-Rate Adaptive Nonlinear Filter. Entropy, 19(11), 487.
[12]. Pham, T. V., Stark, M., & Rank, E. (2010, October). Performance analysis of wavelet subband based voice activity detection in cocktail party environment. In Advanced Technologies for Communications (ATC), 2010
International Conference on (pp. 85-88). IEEE.
[13]. Qiu, L., Yang, H., & Koh, S. N. (1994, August). A fundamental frequency detector of speech signals based on short time Fourier transform. In TENCON'94. IEEE Region 10's Ninth Annual International Conference. Theme: Frontiers of Computer Technology. Proceedings of 1994 (pp. 526-530). IEEE.
[14]. Sehgal, A., & Kehtarnavaz, N. (2018). A Convolutional Neural Network Smartphone App for Real-Time Voice Activity Detection. IEEE Access, 6, 9017-9026.
[15]. Sharmida, A. H., & Athira, A. P. (2016). A robust observation model for automatic speech recognition with Adaptive Thresholding. International Research Journal of Engineering and Technology (IRJET), 3(8), 644-649.
[16]. Sohn, J., & Sung, W. (1998, May). A Voice Activity Detector employing soft decision based noise spectrum adaptation. In Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on (Vol. 1, pp. 365-368). IEEE.
[17]. Zhang, X. L., & Wang, D. (2016). Boosting contextual information for deep neural network based Voice Activity Detection. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 24(2), 252-264.
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Pdf 35 35 200 20
Online 35 35 200 15
Pdf & Online 35 35 400 25

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.