An Approach for Speech Recognition Technique

M. Suleman Basha*, Suantak Kamsonlian**, K. Rajendra Prasad***, B. Rama Subbaiah****
*-** Assistant Professor, Rajeev Gandhi Memorial College of Engineering and Technology, Nandyal, Andhra Pradesh, India.
*** Associate Professor, Rajeev Gandhi Memorial College of Engineering and Technology, Nandyal, Andhra Pradesh, India.
Periodicity:June - August'2014
DOI : https://doi.org/10.26634/jpr.1.2.2923

Abstract

The design of Speech Recognition system has careful attention in the following issues: classification of various types of speech classes, speech representation, and feature extraction techniques. The purpose of this paper is to summarize and compare some of the well known methods used in different stages of Speech Recognition system and identify research topic and applications which are at the front position of this stimulating and challenging field. Speech is the most famous and primary mode of communication among human beings. The communication among human and computer is called as human computer interface. Speech has been the important mode of interaction with computer . This paper gives an overview of major technological viewpoint and approval of the primary progress of speech recognition, and also gives an overview of the techniques developed in each stage of speech recognition.

Keywords

Speech Recognition, Recognition Techniques, Feature Extraction, Communication.

How to Cite this Article?

Basha, M. S., Subbaiah, B. R., and Prasad, K. R. (2014). An Approach for Speech Recognition Technique. i-manager’s Journal on Pattern Recognition, 1(2), 7-15. https://doi.org/10.26634/jpr.1.2.2923

References

[1]. Suleman Basha, B. Rama Subbaiah, (2014). “Review of Automatic speech recognition in Signal Processing”. International Journal of Engineering Research, Vol.3, No.2, pp.112-115.
[2]. R. Klevansand R. Rodman, (1997). “ Voice Recognition”, Artech House, Boston, London.
[3]. Samudravijaya K. “Speech and Speaker recognition tutorial”. TIFR, Mumbai.
[4]. K.H.Davis, R.Biddulph, and S.Balashek, (1952). “Automatic Recognition of spoken Digits”. The Journal of the Acoustical Society of America, Vol.24, No.6, pp.637- 642.
[5]. D.B.Fry, (1959). “Theoritical Aspects of Mechanical speech Recognition“, and P. Denes, “The design and Operation of the Mechanical Speech Recognizer” at Universtiy College London, J.British Inst. Radio Engr., Vol.19:4, pp.211-299.
[6]. J.W.Forgie and C.D.Forgie, (1959). “Results obtained from a vowel recognition computer program” J.A.S.A., Vo.31(11), pp.1480-1489.
[7]. Asghar .Taheri ,Mohammad Reza Trihi et.al, (2005). “Fuzzy Hidden Markov Models for speech recognition on based FEM Algorithm”. Transaction on engineering Computing and Technology, Vol.4.
[8]. K. Rajendra Prasad, and Dr. P. Govinda Rajulu, (2010). “A Survey on Clustering Technique for large databases using Efficient Graph Structures”. The Journal International Journal of Engineering Science and Technology(IJEST), Vol.2, No.7.
[9]. Dr. B. Eswara Reddy, and K. Rajendra Prasad, (2012). “Reducing runtime values in minimum spanning tree based clustering by visual access tendency”. Int. Journal of Data Mining & Knowledge Management Process (IJDKP), Vol.2, No.3.
[10]. K. Rajendra Prasad, and Dr. B. Eswara Reddy, (2013). “Assessment of clustering tendency through progressive random sampling and graph-based clustering results”, 3rd IEEE Int. Advanced Computing Conference, Ghaziabad, India.
[11]. M Senoussaoui, P Kenny, T Stafylakis, P Dumouchel, (2013). “A Study Of The Cosine Distance-Based Mean Shift For Telephone Speech Diarization”, IEEE/ACM Transactions On Audio, Speech, And Language Processing, pp. 99.
[12]. Martin Zelenák et. al, (2012). “Simultaneous Speech Detection With Spatial Features For Speaker Diarization”. IEEE Transactions on Audio, Speech, and Language Processing, Vol.20, No.2, pp.436-446.
[13]. Bowen Zhou et. al, (2005). “Efficient Audio Stream Segmentation Via The Combined T2 Statistic and Bayesian Information Criterion”. IEEE Transactions on Speech and Audio Processing, Vol.13, No.4, pp.467-474.
[14]. Sujeeth Bharadwaj et. al, (2013). “Sparse Hidden Markov Models For Purer Clusters”. IEEE Conf. 2013, pp.3098-3102
[15]. Yaodong Zhang and James R. Glass, (2009). “Unsupervised Spoken Keyword Spotting Via Segmental DTW On Gaussian Posteriorgrams”. IEEE Workshop on Automatic Speech Recognition & Understanding, pp.398-403.
[16]. Snani Cherifa et. al, (2013). “Performance Study Of Vector Quantization Methods (K-Means, GMM) For Arabic Isolated Word Recognition System Based On DHMM”. World Congress on Computer and Information Technology (WCCIT), pp.1-4.
[17]. Mahnoosh Mehrabani, John H.L. Hansen et. al. (2013). “Singing Speaker Clustering Based On Subspace Learning In The GMM Mean Supervector Space”. Journal Of Speech Communication, Elsevier, pp.653-666
[18]. Douglas A. Reynolds et. al, (1995). “Robust Text- Independent Speaker Identification Using Gaussian Mixture Models”. IEEE Trans. on Speech And Audio Processing, Vol.3, No.1, pp.72-83.
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Pdf 35 35 200 20
Online 35 35 200 15
Pdf & Online 35 35 400 25

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.