Tamil Optical Character Recognition System: A Survey and Comparative Study

R. Jagadeesh Kannan*, R. Prabhakar**
*Department of Computer Science & Engineering, RMK Engineering College, Kavaraipettai, Chennai, India.
**Department of Computer Science & Engineering, Coimbatore Institute of Technology, Coimbatore, India.
Periodicity:October - December'2009
DOI : https://doi.org/10.26634/jse.4.2.1070

Abstract

In the field of pattern recognition, Optical Character Recognition (OCR) has been a cutting edge research area for the last few decades. And for quite some time now, the recognition of Indian language characters has been a subject of attention. A number of approaches have been proposed by researchers for recognizing printed, handwritten and cursive Tamil scripts both off-line and on-line. This article presents a survey of the researches available for optical character recognition of Tamil characters, an Indian language, along with a comparative study of our approaches against the most significant approaches from the literature. In addition, a concise description about the OCR system and the Tamil Script is provided. The aim of this article is to assist the budding researchers in the field of Tamil Optical Character Recognition in understanding the available methods and to aid their research further.

Keywords

Optical Character Recognition (OCR), Printed Text, Handwriting Recognition, Cursive Text, Off-line Character Recognition, On-line Character Recognition, Tamil Script.

How to Cite this Article?

R. Jagadeesh Kannan, R. Prabhakar (2009). Tamil Optical Character Recognition System: A Survey and Comparative Study, i-manager’s Journal on Software Engineering, 4(2),33-46. https://doi.org/10.26634/jse.4.2.1070

References

[1]. B. B. Chaudhuri and U. Pal, (1997), “A complete printed Bangla OCR system”, Pattern Recognition, Vol. 31, No. 5, pp. 531-549.
[2]. Bharath A, Sriganesh Madhvanath, (2007), “Hidden Markov Models for Online Handwritten Tamil Word Recognition”, Proceedings of the Ninth International Conference on Document Analysis and Recognition, Vol. 01, pp. 506-510.
[3]. C. S. Sundaresan and S. S. Keerthi, (1999), “A Study of Representations for Pen based Handwriting Recognition of Tamil Characters”, Proceedings of the Fifth International Conference on Document Analysis and Recognition, ICDAR apos; pp. 422 - 425, 20-22 September.
[4]. C-H. Chang, (1996), “Simulated annealing clustering of Chinese words for contextual text recognition”, Pattern Recognition Letters, Vol. 17, No. 1, pp. 57-66.
[5]. Chinnuswamy, P., and S.G. Krishnamoorthy, (1980),“Recognition of Hand printed Tamil Characters”, Pattern Recognition, Vol. 12, pp. 141-152
[6]. D. De Coste and B. Schölkopf, (2002), “Training invariant support vector machines”, Machine Learning, Vol. 46, No. 1/3, pp. 161.
[7]. D. Deng, K. P. Chan, and Y. Yu, (1994), “Handwritten Chinese character recognition using spatial Gabor filters and self-organizing feature maps”, Proc. IEEE Inter. Confer. On Image Processing, Austin TX, Vol. 3, pp. 940- 944, June.
[8]. F. Samaria, F. Fallside, (1993), “Face Identification and Feature Extraction Using Hidden Markov Models”, in G. Vernazza, A.N. Venetsanopoulos, C. Braccini (editors): Image Processing: Theory and Applications, Elsevier Science publishers B.V., Pp.292-302.
[9]. G. Nagy, (1992), “On the Frontiers of OCR”, Proceedings of the IEEE, Vol. 40, No. 8, pp. 1093-1100, July.
[10]. G. Siromoney, R. Chandrasekaran and M. Chandrasekaran, (1978), “Computer Recognition of Printed Tamil Character”, Pattern Recognition, Vol. 10, pp. 243-247.
[11]. Govindan, V.K. and A.P. Shivaprasad, (1990), “Character Recognition-A Review”, Pattern Recognition, Vol. 23, No. 7, pp. 671-683.
[12]. H. Aparna, V. Subramanian, Kasirajan, V. Prakash, V. Chakravarthy, and S. Madhvanath, (2004), “Online Handwriting Recognition for Tamil”, Proceedings of the 9th International Workshop on Frontiers in Handwriting Recognition, pp. 438- 443.
[13]. H. Bunke, M. Roth, and E. G. Schukat-Talamazzini, (1995), “Offline Cursive Handwriting Recognition using Hidden Markov Models”, Pattern Recognition, Vol. 28, No. 9, pp. 1399-1413.
[14]. H. Yamada, K. Yamamoto, and T. Saito, (1990), “A non-linear normalization method for hand printed Kanji character recognitionline density equalization”, Pattern Recognition, Vol. 23, No. 9, pp. 1023-1029.
[15]. Hewavitharana, S, and H.C. Fernando, (2002), “A Two-Stage Classification Approach to Tamil Handwriting Recognition”, Tamil Internet 2002, California, USA, pp. 118-124.
[16]. Hu, M. K. Brown and W. Turin, (1996), “HMM based on-line handwriting recognition”, IEEE Trans. on Pattern Anal. Mach. Intell., Vol. 18, No. 10, pp. 1039-1045, October.
[17]. J. Cai and Z-Q. Liu, (1999), “Integration of structural and statistical information for unconstrained handwritten numeral recognition,” IEEE Trans. on Pattern Anal. Mach. Intell., Vol. 21, No. 3, pp. 263-270, March.
[18]. K. K. Biswas and S. Chatterjee, (1995), “Feature based Recognition of Hindi Characters” Indian Conference on Pattern Recognition, Image Processing and Computer Vision (ICPIC), December, pp 182-187.
[19]. K. Khatatneh, (2006), "Probabilistic Artificial Neural Network for Recognizing the Arabic. Hand Written Characters", Journal of Computer Science, Vol. 3, No. 12, pp. 881-886.
[20]. K.H. Aparna, Sumanth Jaganathan, P. Krishnan, V.S. Chakravarthy, (2005), "Document Image Analysis: with specific Application to Tamil Newsprint", In Proceedings of Neural Information Processing Systems (NIPS 2005), Whistler, Vancouver, Canada, 9 - 10 December.
[21]. Mantas, J., (1986), “An overview of character recognition methodologies”, Pattern recognition, Vol. 19, No. 6, pp. 425-430.
[22]. N. Cristianini and J. Shawe-Taylor, (2000), “Support Vector Machines”, Cambridge University Press.
[23]. N. Joshi, G. Sita, A. G. Ramakrishnan, and S. Madhvanath, (2004), “Comparison of Elastic Matching Algorithms for Online Tamil Handwritten Character Recognition”, Proceedings of the 9th International Workshop on Frontiers in Handwriting Recognition, pp. 444 - 449, 26-29 October.
[24]. N. Joshi, G. Sita, A. G. Ramakrishnan, and S. Madhvanath, (2004), “Tamil Handwriting Recognition Using Subspace and DTW Based Classifiers”, Lecture Notes in Computer Science, Springer Berlin / Heidelberg, Vol. 3316, pp. 806-813,.
[25]. Pal, U., and B.B. Chaudhuri, (2004), “Indian Script Character Recognition: a Survey”,Pattern Recognition, Vol. 37, pp. 1887-1899.
[26]. R. Bajaj and S. Chaudhary, (1995), “Devanagari Numeral Recognition using Multiple Neural Classifiers”, Indian Conference on Pattern Recognition, Image Processing and Computer Vision (ICPIC), December.
[27]. R. Jagadeesh kannan and R. Prabhakar, (2008), "Off-Line Cursive Handwritten Tamil Character Recognition", WSEAS Transactions on Signal Processing, Vol. 4, No. 6, pp. 351-360, June.
[28]. R. Jagadeesh kannan and R. Prabhakar, (2008), “An Improved Handwritten Tamil Character Recognition System using Octal Graph", Journal of Computer Science, Vol. 4, No. 7, pp. 509-516.
[29]. R. Jagadeesh Kannan, R. Prabhakar, (2008), "Accuracy Augmentation of Tamil OCR Using Algorithm Fusion", IJCSNS International Journal of Computer Science and Network Security, Vol.8 No.5, May.
[30]. R. M. Bozinovic and S. N. Srihari, (1989), “Off-line cursive script word recognition”, IEEE Transactions on Pattern Anal. Mach. Intell., Vol. 11, No. 1, pp. 68-83, January.
[31]. R. Plamondon and S. N. Srihari, (2000), “On-line and off-line handwritten recognition: a comprehensive survey”, IEEE Transactions on PAMI, Vol. 22, No.1, pp. 63- 84.
[32]. R. Plamondon, D. Lopresti, L.R.B. Shoemaker and R. Srihari, (1999), “On-line Handwriting Recognition,” Encyclopedia of Electrical and Electronics Eng., J.G. Webster, ed., New York: Wiley, Vol. 15, pp.123-146,.
[33]. S. Hewavitharana, H. C. Fernando and N.D. Kodikara, (2002), "Off-line Sinhala Handwriting Recognition using Hidden Markov Models", Proc. of Indian Conference on Computer Vision, Graphics & Image Processing (ICVGIP).
[34]. S. Palit and B. Chaudhary, (1995), “A Feature based Scheme for Machine Recognition of Printed Devanagari Script” Indian Conference on Pattern Recognition, Image Processing and Computer Vision, (ICPIC), December.
[35]. Sandor Szedmak, John Shawe-Taylor, (2005), “Learning Hierarchies at Two-class Complexity”, Kernel Methods and Structured domains, NIPS.
[36]. Seethalakshmi R., Sreeranjani T.R., Balachandar T., Abnikant Singh, Markandey Singh, Ritwaj Ratan, Sarvesh Kumar, (2005), "Optical Character Recognition for printed Tamil text using Unicode", Journal of Zhejiang University Science, Vol. 6A, No. 11.
[37]. Shanthi and K. Duraiswamy, (2007), “Performance Comparison of Different Image Sizes for Recognizing Unconstrained Handwritten Tamil Characters using SVM”, Journal of Computer Science, Vol. 3, No. 9, pp. 760-764.
[38]. Shivsubramani K, Loganathan R, Srinivasan CJ, Ajay V, Soman KP, (2007), “Multiclass Hierarchical SVM for Recognition of Printed Tamil Characters”, Proceedings of IJCAI-2007 Workshop on Analytics for Noisy Unstructured th Text Data, Hyderabad, India, 8 January, pp. 93,
[39]. Suresh et al., (1999), “Recognition of Hand printed Tamil Characters Using Classification Approach”, ICAPRDT' 99, pp. 63-84.
[40]. Sutha, J., Ramaraj. N., (2007), “Neural Network Based Offline Tamil Handwritten Character Recognition System”, Proceedings of the International Conference on computational Intelligence and Multimedia Applications (ICCIMA 2007), Vol. 02, pp. 446-450.
[41]. S-W. Lee, (1996), “Off-line recognition of totally unconstrained handwritten numerals using multiplayer cluster neural network”, IEEE Trans. on Pattern Anal. Mach. Intell., June, Vol. 18, No. 6, pp. 648-652.
[42]. V. Bansal and R.M.K. Sinha, (1999), “On how to describe shapes of Devanagari characters and use them th for recognition”, Proc. 5 Int. Conf. Document Analysis and Recognition, Bangalore, India, September, pp. 410- 413.
[43]. V. Deepu and S. Madhvanath, (2004), “Principal Component Analysis for Online Handwritten Character th Recognition”, Proceedings of the 17 International Conference on Pattern Recognition, Vol. 2, pp. 327-330.
[44]. Y. He, A. Kundu, (1991), “2-D shape classification Using Hidden Markov Model”, IEEE Trans. On PAMI, Vol.13, pp.1172-1184.
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Online 15 15

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.