Text Dependent Speaker Verification with Neural Network

N. K. Kaphungkui*, Aditya Bihar Kandali **
* Department of Electronics and Communication Engineering, Dibrugarh University, Assam.
** Department of Electrical Engineering, Jorhat Engineering College, Assam.
Periodicity:October - December'2019
DOI : https://doi.org/10.26634/jdp.7.4.17615

Abstract

Speaker recognition system automatically recognizes who the speaker is by using the speaker's speech features included in speech signal. After verifying the speaker claimed to be, it allows and enable access control of various voice services. The main applications of speaker recognition are in the field of forensic and providing additional security layer where security is the primary concern. The aim of this work is to verify a speaker with the approach of MFCC and Back Propagation Neural Network. Training function Lavenberg-Marquardt is used to train the network. Voice samples from a group of ten people uttering the same sentence five times repeatedly are collected to train the neural network. The testing of the network for verifying the speaker is done with new data set with the same utterance spoken once. A specific target or speaker ID is assigned to each speakers and verification is based on how close the network output is to the assigned code for each speaker. Verification method depends on the minimum positive error generated between the code and the actual network output. If the error is below the threshold value, the speaker claimed to be is accepted otherwise rejected. The tool for simulation is MATLAB.

Keywords

Speaker Verification, MFCC, Text Dependent, Threshold Error, BPNN, Training, Testing.

How to Cite this Article?

Kaphungkui, N. K., and Kandali, A. B. (2019). Text Dependent Speaker Verification with Neural Network. i-manager's Journal on Digital Signal Processing, 7(4), 1-8. https://doi.org/10.26634/jdp.7.4.17615

References

[1]. Achkar, R., El-Halabi, M., Bassil, E., Fakhro, R., & Khalil, M. (2016). Voice identity finder using the back propagation algorithm of an artificial neural network. Procedia Computer Science, 95, 245-252. https://doi.org/10.1016/j. procs.2016.09.322
[2]. Ahmad, K. S., Thosar, A. S., Nirmal, J. H., & Pande, V. S. (2015, January). A unique approach in text independent speaker recognition using MFCC feature sets and probabilistic neural network. In 2015, Eighth International Conference on Advances in Pattern Recognition (ICAPR) (pp. 1-6). IEEE. https://doi.org/10.1109/ICAPR.2015.705066 9
[3]. Ayshwarya, N., Logeshwari, G., & Mala, G. S. A. (2014). Feed forward back propagation neural network for speaker independent speech recognition. International Journal of Industrial Electronics and Electrical Engineering, 2(8), 36-39.
[4]. Chakraborty, K., Talele, A., & Upadhya, S. (2014). Voice recognition using MFCC algorithm. International Journal of Innovative Research in Advanced Engineering (IJIRAE), 1(10), 158-161.
[5]. Hossain, M., Rahman, M., Prodhan, U. K., & Khan, M. (2013). Implementation of back-propagation neural network for isolated Bangla speech recognition. International Journal of Information Sciences and Techniques (IJIST), 3(4), 1-9. https://doi.org/10.5121/ijist.20 13.3401
[6]. Joshi, S. C., & Cheeran, A. N. (2014). MATLAB based back-propagation neural network for automatic speech recognition. International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering, 3(7), 10498-10504. https://doi.org/10.15662/ijareeie.2014. 0307016
[7]. Mansour, A. H., Zen, G., Salh, A., Hayder, H., & Alabdeen, Z. (2015). Voice recognition Using back propagation algorithm in neural networks. International Journal of Computer Trends and Technology (IJCTT), 23(3), 132-139.
[8]. Martinez, J., Perez, H., Escamilla, E., & Suzuki, M. M. (2012, February). Speaker recognition using Mel Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) techniques. In CONIELECOMP2012, 22nd International Conference on Electrical Communications and Computers (pp. 248-251). IEEE. https://doi.org/ 10.1109/CONIELECOMP.2012.6189918
[9]. Remya, A. R. (2014). Efficient watermarking schemes for speaker verification guaranteeing non repudiation (Doctoral dissertation), Department of Computer Applications, Cochin University of Science and Technology, Kochi, Kerala, India. Retrieved from https://shodhganga.in flibnet.ac.in/bitstream/10603/48131/14/14_chapter%205. pdf
[10]. Sharma, V., & Bansal, P. K. (2013). A review on speaker recognition approaches and challenges. International Journal of Engineering Research and Technology (IJERT), 2(5), 1581-1588.
[11]. Wang, Y., & Lawlor, B. (2017, June). Speaker recognition based on MFCC and BP neural networks. In 2017, 28th Irish Signals and Systems Conference (ISSC) (pp. 1-4). IEEE. https://doi.org/10.1109/ISSC.2017.7983644
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Pdf 35 35 200 20
Online 35 35 200 15
Pdf & Online 35 35 400 25

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.