Multilingual Speech Processing through MFCCs feature extraction formultilingual speaker identification system

Vinay Kumar Jain*, Neeta Tripathi**
* Research Scholar, Chhattisgarh Swami Vivekanand Technical University, Bhilai, India.
** Principal, Shri Shankaracharya Institute of Technology and Management, Bhilai, India.
Periodicity:March - May'2016
DOI : https://doi.org/10.26634/jpr.3.1.8102

Abstract

The speaker identification systems work only in a single language environment using sufficient data. Many countries including India are multilingual and hence the effect of multiple languages on a speaker identification system needs to be investigated. Speaker identification system shows poor performance when training is done in one language and the testing in another language. This is a major problem in multilingual speaker identification system. The main objective of this research work is to observe the impact of the languages on multilingual speaker identification system and identifying the variation of MFCC feature vector values in multilingual environments, which will help to design multilingual speaker identification system. The present paper explores the experimental result carried out on collected database of multilingual speakers of three Indian languages. The speech database consists of speech data recorded from 100 speakers including male and female. The Mel Frequency Cepstral Coefficients (MFCC) as a front end feature vectors are extracted from the speech signals. The minimum, maximum and mean values of the feature vectors have been calculated for the analysis. It is observed that Rajasthani language has the larger values as compared to Hindi language and Marathi Language in minimum values of the feature vectors, where as Marathi Language has the larger values as compared to Hindi language and Rajasthani language in maximum values of feature vectors. The impact of the languages on multilingual speaker identification system has been evaluated.

Keywords

MFCC, Delta Coefficients, Multi-language, Impact of Language

How to Cite this Article?

Jain, V. K., and Tripathi, N. (2016). Multilingual Speech Processing through MFCCs feature extraction for multilingual speaker identification system. i-manager’s Journal on Pattern Recognition, 3(1), 1-6. https://doi.org/10.26634/jpr.3.1.8102

References

[1]. S. Sarkar, et al. (2013). “Multilingual speaker recognition on Indian languages”. 2013 Annual IEEE India Conference (Indicon), Mumbai. pp.1-5.
[2]. Nagaraja B.G. and H.S. Jayanna, (2013). “Combination of Features for Multilingual Speaker Identification with the Constraint of Limited Data”. International Journal of Computer Applications, (0975 - 8887), Vol.70, No.6, pp.1-6.
[3]. S. Agrawal, et al. (2010). “Prosodic feature based text dependent speaker recognition using Machine learning althorithms”. International Journal of Engineering Science and Technology, Vol.2, No.10, pp.5150-5157.
[4]. W. Bharti, et al. (2011). “Marathi Isolated Word Recognition System using MFCC and DTW Features”. ACEEE Int. J. on Information Technology, Vol.1, No.1, pp.21-24.
[5]. U. Bhattacharjee. and K. Sarmah, (2012). “A multilingual speech database for speaker recognition”. IEEE International Conference on Signal Processing, Computing and Control (ISPCC), Waknaghat Solan, pp.1- 5.
[6]. U. Bhattacharjee, and K. Sarmah, (2012). “Development of a Speech Corpus for Speaker Verification Research in Multilingual Environment”. International Journal of Soft Computing and Engineering (IJSCE). Vol.2, No.6, pp.443-446.
[7]. U. Bhattacharjee, and K. Sarmah. (2012). “GMM-UBM Based Speaker Verification in Multilingual Environments”. (IJCSI) International Journal of Computer Science, Vol.9, No.6, pp.373-380.
[8]. P. Kumar and S.L. Lahudkar, (2015). “Automatic Speaker Recognition using LPCC and MFCC ”. International Journal on Recent and Innovation Trends in Computing and Communication, Vol.3, No.4, ISSN: 2321-8169, pp.2106- 2109.
[9]. R Ranjan, et al,. (2010). “Text-Dependent Multilingual Speaker Identification for Indian Languages Using Artificial rd Neural Network”. 3 International Conference on Emerging Trends in Engineering and Technology, Goa, India, pp.632-635.
[10]. G. Kaur and H. Kaur, (2013). “Multi Lingual Speaker Identification on Foreign Languages Using Artificial Neural Network with Clustering”. International Journal of Advanced Research in Computer Science and Software Engineering, Vol.3, No.5, ISSN: 2277 128X, pp.14-20.
[11]. M. Ferras, et al,. (2010). “Comparison of Speaker Adaptation Methods as Feature Extraction for SVM-Based Speaker Recognition”. IEEE Transaction on Audio, Speech, and Language Processing, Vol.18, No.6, pp.366-1378.
[12]. H.A. Patil, et al. (2006). “Design of Cross-lingual and Multilingual Corpora for Speaker Recognition Research and Evaluation in Indian Languages”. International Symposium on Chinese Spoken Languages Processing (ISCSLP 2006), Kent Ridge, Singapore.
[13]. V.K. Jain and N. Tripathi, (2016). “Multilingual Speaker Identification using analysis of Pitch and Formant frequencies”. Published in IJRITCC Journal, Vol.4, No.2, ISSN: 2321-8169, pp.296-298.
[14]. K.K. Sahu and V. Jain, (2014). “A Novel Language Identification System For Identifying Hindi, Chhattisgarhi and English Spoken Language”. International Journal of Engineering Research & Technology (IJERT), ISSN: 2278- 0181, Vol.3, No.12, pp.728-731.
[15]. N. Tripathi, et al. (2006). “Correlation Between Eyebrows Movement and Speech Acoustic Parameters”. PCEA-IFToMM International Conference on Recent Trends in Automation and its Adaptation to Industries, Nagpur.
[16]. N. Tripathi, et al. (2008). “A Close Correlation between Eyebrows Movement and Acoustic Parameter”. Journal of Acoustics Society of India, (ISSN No.0973- 3302), Vol.35, No.4, pp.158-162.
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Pdf 35 35 200 20
Online 35 35 200 15
Pdf & Online 35 35 400 25

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.