A Novel Meta-Heuristic Jellyfish Optimizer for Detection and Recognition of Text from Complex Images

T. Gnana Prakash*, Sumalatha Lingamgunta**, B. Sujatha***
*,** Department of Computer Science and Engineering, Jawaharlal Nehru Technological University Kakinada (JNTUK), Kakinada, Eastgodavarti, Andhra Pradesh, India.
*** Department of Computer Science and Engineering, Godavari Institute of Engineering and Technology, Rajahmundry, Eastgodavari, Andhra Pradesh, India.
Periodicity:July - September'2024

Abstract

Recognizing text in images presents significant challenges, particularly in complex backgrounds. This technology is essential for aiding visually impaired individuals and interpreting semantic content. This survey examines various techniques developed in recent years for handling text recognition in complex images. The paper provides an analysis of related works and evaluates the performance of these recognition methods. Although image complexity is not easily defined, it can be described through parameters such as background details, noise levels, lighting conditions, textures, and fonts. Additionally, this survey discusses several benchmark datasets used in the reviewed studies. By reviewing these works, challenges in this area can be identified and compared. Complex background images limit the accuracy achieved. To improve accuracy, convolutional neural networks (CNN) are employed. The proposed method comprises three parts: the text is detected from a complex background, the text is extracted from the image using Tesseract, and all identified words are saved in a text file. An audio file is then created from the text. The proposed system reads the text from the image with the aim of providing assistance to visually impaired individuals.

Keywords

Complex Images, Image Processing, Text Recognition, Jellyfish Optimizer, Chicken Swarm Optimizer.

How to Cite this Article?

Prakash, T. G., Lingamgunta, S., and Sujatha, B. (2024). A Novel Meta-Heuristic Jellyfish Optimizer for Detection and Recognition of Text from Complex Images. i-manager’s Journal on Image Processing, 11(3), 1-9.

References

[3]. Fan, H., Chang, X., Cheng, D., Yang, Y., Xu, D., & Hauptmann, A. G. (2017). Complex event detection by identifying reliable shots from untrimmed videos. In Proceedings of the IEEE International Conference on Computer Vision (pp. 736-744).
[5]. Hazra, T. K., & Pratapsingh, D. (2017). Nikunjdaga, Optical Character Recognition using KNN on Custom Image Dataset. IEEE Access.
[10]. Long, S., Ruan, J., Zhang, W., He, X., Wu, W., & Yao, C. (2018). Textsnake: A flexible representation for detecting text of arbitrary shapes. In Proceedings of the European Conference on Computer Vision (ECCV) (pp. 20-36).
[12]. Luo, J. H., Wu, J., & Lin, W. (2017). Thinet: A filter level pruning method for deep neural network compression. In Proceedings of the IEEE International Conference on Computer Vision (pp. 5058-5066).
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Online 15 15

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.