Two-Pass Segmentation for Marathi Character Extraction

Madhav A. Kankhar*, C. Namrata Mahender **
*-** Department of Computer Science & Information Technology, Dr. Babasaheb Ambedkar Marathwada University, Aurangabad, Maharashtra, India.
Periodicity:July - December'2020
DOI : https://doi.org/10.26634/jpr.7.2.18148

Abstract

Handwriting recognition is very complex but a much needed system for various purposes. Work on English language can be seen in full swing as compared to Indian languages. Free style handwriting has various challenges due to the individual variation in writing style, along with half characters, joint characters, overlapped characters, etc. are major when considering written in Devanagari script. Computational studies of Indian languages written using Devanagari script, including Marathi, are in its primary stage. The present work focuses on proper character segmentation in Marathi words for better post-processing and recognition rate as segmentation is an important stage that affects post-processing performance. Data is collected from native speakers of the Marathi language, with 3000 samples handwritten words in Marathi language for numbers from one to ten, and this paper discusses the proposed two-pass segmentation approach to extract written Marathi script.

Keywords

Online Recognition, Offline Recognition, Segmentation, Handwriting Recognition, Preprocessing.

How to Cite this Article?

Kankhar, M. A., and Mahender, C. N. (2020). Two-Pass Segmentation for Marathi Character Extraction. i-manager's Journal on Pattern Recognition, 7(2), 1-6. https://doi.org/10.26634/jpr.7.2.18148

References

[1]. Bansal, V., & Sinha, R. M. K. (2002). Segmentation of touching and fused Devanagari characters. Pattern Recognition, 35(4), 875-893. https://doi.org/10.1016/S003 1-3203(01)00081-4
[2]. Casey, R. G., & Lecolinet, E. (1996). A survey of methods and strategies in character segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 18(7), 690-706. https://doi.org/10.1109/34.506792
[3]. Crane, R. (1996). Simplified approach to image processing: Classical and modern techniques in C. New Jersey: Prentice Hall PTR.
[4]. Dunn, C. E., & Wang, P. S. P. (1992, January). Character segmentation techniques for handwritten text-a survey. In 11th IAPR International Conference on Pattern Recognition, Conference B: Pattern Recognition Methodology and Systems (Vol. 1, pp. 577-578). IEEE Computer Society. https://doi.ieeecomputersociety.org/10.1109/ICPR.1992. 201844
[5]. Golait, S. S., Malik, L. G., & Thomas, A. (2017a). Feature extraction for Marathi compound character using edge map. International Journal of Scientific & Engineering Research, 8(2), 1095-1101.
[6]. Golait, S. S., Malik, L. G., & Thomas, A. (2017b). Handwritten Marathi compound character segmentation with morphological operation. International Journal of Innovations in Engineering and Science, 2(12), 8-12.
[7]. Gonzales, R. S. & Wintz, P. (1987). Digital Image Processing. Boston: Addison-wesley.
[8]. Kamble, P. M., & Hegadi, R. S. (2015). Handwritten Marathi character recognition using R-HOG feature. Procedia Computer Science, 45, 266-274. https://doi.org/ 10.1016/j.procs.2015.03.137
[9]. Kaur, A., Baghla, S., & Kumar, S. (2015). Study of various character segmentation techniques for handwritten offline cursive words: A review. International Journal of Advances in Science Engineering and Technology, 3(3), 154-158.
[10]. Khobragade, R. N., Koli, N. A., & Lanjewar, V. T. (2020). Challenges in recognition of online and off-line compound handwritten characters: A review. Smart Trends in Computing and Communications, 375-383. https://doi. org/10.1007/978-981-15-0077-0_38
[11]. Kim, G., & Govindaraju, V. (1997). A lexicon driven approach to handwritten word recognition for real-time applications. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(4), 366-379. https://doi.org/10.1109/34.588017
[12]. Lam, L., Lee, S. W., & Suen, C. Y. (1992). Thinning methodologies - A comprehensive survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 14(9), 869- 885. https://doi.org/10.1109/34.161346
[13]. Lee, J. S. (1983). Digital image smoothing and the sigma filter. Computer Vision, Graphics, and Image Processing, 24(2), 255-269. https://doi.org/10.1016/0734- 189X(83)90047-6
[14]. Limkar, S., Mudaliar, G., Kulkarni, S., Rathod, N., Gadakh, T., & Shah, S. (2020). Optical character recognition for Marathi language using deep convolutional neural network. Journal of University of Shanghai for Science and Technology, 22(12), 1201-1213.
[15]. Lu, Y. (1995). Machine printed character segmentation- An overview. Pattern Recognition, 28(1), 67-80. https://doi.org/10.1016/0031-3203(94)00068-W
[16]. Lu, Y., & Shridhar, M. (1996). Character segmentation in handwritten words - An overview. Pattern Recognition, 29(1), 77-96. https://doi.org/10.1016/0031-3203(95)00072-0
[17]. Malik, L. (2016). Handwritten Marathi compound character segmentation using minutiae detection algorithm. Procedia Computer Science, 87, 18-24. https:// doi.org/10.1016/j.procs.2016.05.120
[18]. Phamondon, R., & Shrihari, S. N. (2000). Online and off-line handwriting recognition: A comprehensive survey. IEEE Transaction on Pattern Analysis and Machine Intelligence, 22, 63-84. https://doi.org/10.1109/34.824821
[19]. Schalkoff, R. (1990). Digital image processing and computer vision. New Jersey, US: John Wiley and Sons.
[20]. Senior, A. W. (1994). Off-line cursive handwriting recognition using recurrent neural networks (Doctoral Dissertation). University of Cambridge, England.
[21]. Shelke, S., & Apte, S. (2011, February). A novel multistage classification and wavelet based kernel generation for handwritten Marathi compound character recognition. In 2011, International Conference on Communications and Signal Processing (pp. 193-197). IEEE.
[22]. Shrihari, S. N. & Law, S. W. (1995). Character recognition (Technical Report no. CEDAR-TR-95-1). New York: CEDAR.
[23]. Suen, C. Y., Legault, R., Nadal, C., Cheriet, M., & Lam, L. (1993). Building a new generation of handwriting recognition systems. Pattern Recognition Letters, 14(4), 303-315. https://doi.org/10.1016/0167-8655(93)90096-V
[24]. Vaidya, M., Joshi, Y., & Bhalerao, M. (2017). Segmentation based optical character recognition for handwritten Marathi characters. In 5th International Conference on Emerging Trends in Engineering, Technology, Science and Management (pp. 662-669).
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Online 15 15

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.