Voice Activity Detection Techniques - A Review

V.Adlin Vini*
Department of Applied Electronics, C.S.I Institute of Technology, Thovalai, Tamil Nadu, India.
Periodicity:July - December'2021
DOI : https://doi.org/10.26634/jdp.9.2.14396

Abstract

Voice Activity Detection (VAD), also known as speech activity detection or speech detection, is a technique used in speech processing in which the presence or absence of human speech is detected. The main uses of VAD are in speech coding and speech recognition. It can facilitate speech processing, and can also be used to deactivate some processes during a non-speech section of an audio session: it can avoid unnecessary coding/transmission of silence packets in Voice over Internet Protocol applications, saving on computation and network bandwidth. VAD is an important enabling technology for a variety of speech-based applications. Therefore, various VAD algorithms have been developed that provide varying features and compromises between latency, sensitivity, accuracy, and computational cost. Some VAD algorithms also provide further analysis, for example, whether the speech is voiced, unvoiced, or sustained. Voice activity detection is usually language-independent. This paper discusses various voice activity detection techniques, their results, and their projection towards how they are going through VAD.

Keywords

Speech, VAD, Voice.

How to Cite this Article?

Vini, V. A. (2021). Voice Activity Detection Techniques - A Review. i-manager's Journal on Digital Signal Processing, 9(2), 27-33. https://doi.org/10.26634/jdp.9.2.14396

References

[3]. Flood, J. E. (2011). Telecommunication Switching, Traffic and Networks. Pearson Education (pp. 1-336).
[6]. Krishnan, P. S. H., Padmanabhan, R., & Murthy, H. A. (2007). Voice activity detection using group delay processing on buffered short-term energy, Proceedings 13th National Conference on Communication, (pp.1-7).
[10]. Tsiartas, A., Chaspari, T., Katsamanis, N., Ghosh, P. K., Li, M., Van Segbroeck, M., ... & Narayanan, S. S. (2013, August). Multi-band long-term signal variability features for robust voice activity detection. In Interspeech (pp. 718-722).
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Pdf 35 35 200 20
Online 35 35 200 15
Pdf & Online 35 35 400 25

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.