i-manager Publications

Real-Time Object Detector for the Visually Impaired with Voice Feedback using OpenCV

Rajeshwar Kumar Dewangan*, Siddharth Chaubey**

*-** Department of Computer Science & Engineering, Shri Shankaracharya Group of Institute, Bhilai, Junwani, Chhattisgarh, India.

Periodicity:January - June'2022
DOI : https://doi.org/10.26634/jdp.10.1.18580

Abstract

The goal of this paper is to create an object detector model that can detect objects for visually impaired people and other commercial users by detecting it at a certain distance. Existing object detection algorithms required a huge amount of training data, which took longer and was extremely complex. This is also a difficult task. As a result, it presents a computer vision paradigm for converting an object to text by importing a pre-trained CAFFEMODEL (a machine learning model created by Caffe) framework dataset model, and the texts are further converted to speech. This method allows the detection of multiple objects on the same screen. It helps in real-time object detection. This paper discusses the concept, methodology, and system architecture for the implementation of the system in combination with the obtained intermediate results and analyzes the tools used in the proposed system. This system can then be implemented in any other system. Portable gadgets that detect objects at a certain distance from visually impaired people and transmit a voice signal.

Keywords

Real-Time Object Detection, CAFFEMODEL Framework, MobileNet SSD, Deep Neural Network, Voice Output.

How to Cite this Article?

Dewangan, R. K., and Chaubey, S. (2022). Real-Time Object Detector for the Visually Impaired with Voice Feedback using OpenCV. i-manager’s Journal on Digital Signal Processing, 10(1), 29-33. https://doi.org/10.26634/jdp.10.1.18580

References

[1]. Cai, H., Lin, J., & Han, S. (2022). Efficient methods for deep learning. In Advanced Methods and Deep Learning in Computer Vision (159-190). https://doi.org/10.1016/B978-0-12-822109-9.00013-8

[2]. Cao, W., Yuan, J., He, Z., Zhang, Z., & He, Z. (2018). Fast deep neural networks with knowledge guided training and predicted regions of interests for real-time video object detection. IEEE Access, 6, 8990-8999. https://doi.org/10.1109/ACCESS.2018.2795798

[3]. GeeksforGeeks. (2021). OpenCV – Overview. Retrieved from https://www.geeksforgeeks.org/opencvoverview/

[4]. Kumar, R. (2022). What is Caffe and how it Works? An Overview and its Use Cases? Retrieved from https://www. devopsschool.com/blog/what-is-caffe-and-how-itworks-an-overview-and-its-use-cases-2/

[5]. Mao, Q. C., Sun, H. M., Liu, Y. B., & Jia, R. S. (2019). Mini- YOLOv3: real-time object detector for embedded applications. IEEE Access, 7, 133529-133538. https://doi.org/ 10.1109/ACCESS.2019.2941547

[6]. Massof, R. W. (2009). The role of Braille in the literacy of blind and visually impaired children. Archives of Ophthalmology, 127(11), 1530-1531. https://doi.org/10.1001/archophthalmol.2009.295

[7]. Nijhawan, S. S., Kumar, A., Bhardwaj, S., & Nijhawan, G. (2019). Real-time object detection for visually impaired with optimal combination of scores. In 2019 6th International Conference on Computing for Sustainable Global Development (INDIACom), 307-311.

[8]. Ouyang, Z., Niu, J., Liu, Y., & Guizani, M. (2020). Deep CNN-based real-time traffic light detector for self-driving vehicles. IEEE transactions on Mobile Computing, 19(2), 300-313. https://doi.org/10.1109/TMC.2019.2892451

[9]. Wu, X. H., Hu, R., & Bao, Y. Q. (2019). Parallelism optimized architecture on FPGA for real-time traffic light detection. IEEE Access, 7, 178167-178176. https://doi.org/10.1109/ACCESS.2019.2959084

Real-Time Object Detector for the Visually Impaired with Voice Feedback using OpenCV

Abstract

Keywords

How to Cite this Article?

References

If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Options for accessing this content:

	North Americas,UK, Middle East,Europe		India	Rest of world
	USD	EUR	INR	USD-ROW
Pdf	35	35	200	20
Online	15	15	200	15
Pdf & Online	35	35	400	25