State-of-the-Art Deep Learning Techniques for Object Identification in Practical Applications

Kanthirekha Miriyala*, Padmaja Rani B.**
*-** Department of Computer Science and Engineering, Jawaharlal Nehru Technological University, Hyderabad, India.
Periodicity:July - December'2024
DOI : https://doi.org/10.26634/jpr.11.2.21475

Abstract

Unprecedented advancements have been witnessed in deep learning, particularly in the domain of real-world object detection. The sophisticated capabilities of contemporary deep learning methodologies to extract and process features from intricate data have catalyzed their adoption across diverse fields, including computer vision, robotics, and autonomous systems. This comprehensive literature survey examines cutting-edge deep learning approaches employed for effective real-world object detection. The review encompasses various aspects of object identification, such as detection, recognition, and segmentation, while exploring diverse deep learning frameworks, including Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and Deep Belief Networks (DBNs). Furthermore, the survey provides a thorough examination of current research developments, addressing critical challenges and limitations, while identifying promising avenues for future research endeavors. The review offers valuable insights into state-of-the-art techniques and their potential real-world applications.

Keywords

Convolutional Neural Networks (CNNs), Region-based CNNs, SSD, YOLOv3, Faster R-CNN, Retina Net.

How to Cite this Article?

Miriyala, K., and Rani, B. P. (2024). State-of-the-Art Deep Learning Techniques for Object Identification in Practical Applications. i-manager’s Journal on Pattern Recognition, 11(2), 31-45. https://doi.org/10.26634/jpr.11.2.21475

References

[4]. Cai, Z., & Vasconcelos, N. (2018). Cascade R-CNN: Delving into high quality object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 6154-6162).
[5]. Chen, L. C., Hermans, A., Papandreou, G., Schroff, F., Wang, P., & Adam, H. (2018). Masklab: Instance segmentation by refining object detection with semantic and direction features. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4013-4022).
[7]. Deng, J., Xuan, X., Wang, W., Li, Z., Yao, H., & Wang, Z. (2020, November). A review of research on object detection based on deep learning. In Journal of Physics: Conference Series, 1684 (1), 012028. IOP Publishing.
[11]. Girshick, R., Donahue, J., Darrell, T., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 580-587).
[14]. He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 770-778).
[15]. Howard, A. G. (2017). Mobilenets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv preprint arXiv:1704.04861.
[16]. Huang, G., Liu, Z., Van Der Maaten, L., & Weinberger, K. Q. (2017). Densely connected convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4700-4708).
[24]. Kishore, G., Gnanasundar, G., & Harikrishnan, S. (2019). A survey on object detection using deep learning techniques. International Research Journal of Engineering and Technology (IRJET), 6 (2), 2140-2143.
[25]. Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems, 25.
[27]. Lin, T. (2017). Focal Loss for Dense Object Detection. arXiv preprint arXiv:1708.02002.
[28]. Lin, T. Y., Dollár, P., Girshick, R., He, K., Hariharan, B., & Belongie, S. (2017). Feature pyramid networks for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 2117-2125).
[35]. Redmon, J., & Farhadi, A. (2017). YOLO9000: better, faster, stronger. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 7263-7271).
[39]. Shen, Z., Liu, Z., Li, J., Jiang, Y. G., Chen, Y., & Xue, X. (2017). Dsod: Learning deeply supervised object detectors from scratch. In Proceedings of the IEEE International Conference on Computer Vision (pp. 1919-1927).
[44]. Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., & Rabinovich, A. (2015). Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1-9).
[45]. Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., & Wojna, Z. (2016). Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 2818-2826).
[46]. Tan, M., & Le, Q. (2019, May). Efficientnet: Rethinking model scaling for convolutional neural networks. In International Conference on Machine Learning (pp. 6105-6114). PMLR.
[47]. Wang, R. J., Li, X., & Ling, C. X. (2018). Pelee: A real-time object detection system on mobile devices. Advances in Neural Information Processing Systems, 31.
[49]. Wu, X., Sahoo, D., & Hoi, S. C. (2020). Recent advances in deep learning for object detection. Neurocomputing, 396, 39-64.
[53]. Zhang, S., Wen, L., Bian, X., Lei, Z., & Li, S. Z. (2018b). Single-shot refinement neural network for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4203-4212).
[54]. Zhang, X., Zhou, X., Lin, M., & Sun, J. (2018a). Shufflenet: An extremely efficient convolutional neural network for mobile devices. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 6848-6856).
[55]. Ziou, D., & Tabbone, S. (1998). Edge detection techniques-an overview. Pattern Recognition and Image Analysis: Advances in Mathematical Theory and Applications, 8(4), 537-559.
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Pdf 35 35 200 20
Online 15 15 200 15
Pdf & Online 35 35 400 25

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.