i-manager Publications

Swin-Transformer Based Recognition of Diabetic Retinopathy Grade

Sanjay Gandhi Gundabatini*, Sai Sindhu Manne**, Sunkara Likhit Babu***, Vangapandu Bhargava Rao****, Sanka Tejaswi*****

*-***** Department of Computer Science and Engineering, Vasireddy Venkatadri Institute of Technology, Guntur, Andhra Pradesh, India.

Periodicity:January - June'2025
DOI : https://doi.org/10.26634/jpr.12.1.21927

Abstract

Diabetic Retinopathy (DR), a common diabetes-related disorder, is a leading driver of blindness worldwide. Quick detection and precise staging are essential for effective management and vision preservation. This study explores the Swin Transformer, an advanced deep learning framework with a multi-layered setup and a unique sliding window method, to create an automated tool for DR stage assessment. Utilizing the APTOS 2019 Blindness Detection dataset, the system accurately identifies small retinal signs like microaneurysms and more pronounced features such as hemorrhages, achieving high precision. Improved preprocessing, including image enrichment and calibration, enhances its versatility. Results indicate that this approach outperforms traditional Convolutional Neural Networks (CNNs) in precision, computational thrift, and growth potential, with a test accuracy of 99.57% and a test loss of 0.0220.

Keywords

Diabetic Retinopathy, Swin Transformer, Automated Staging, Retinal Analysis, Deep Learning Technology.

How to Cite this Article?

Gundabatini, S. G., Manne, S. S., Babu, S. L., Rao, V. B., and Tejaswi, S. (2025). Swin-Transformer Based Recognition of Diabetic Retinopathy Grade. i-manager’s Journal on Pattern Recognition, 12(1), 26-34. https://doi.org/10.26634/jpr.12.1.21927

References

[1]. Beam, A. L., & Kohane, I. S. (2018). Big data and machine learning in health care. JAMA, 319(13), 1317- 1318.

[2]. Bergstra, J., & Bengio, Y. (2012). Random search for hyper-parameter optimization. The Journal of Machine Learning Research, 13(1), 281-305.

[3]. Brownlee, J. (2019). Deep Learning for Computer Vision: Image Classification, Object Detection, and Face Recognition in Python. Machine Learning Mastery.

[4]. Chen, L. C., Zhu, Y., Papandreou, G., Schroff, F., & Adam, H. (2018). Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European Conference on Computer Vision (ECCV) (pp. 801-818).

[5]. Coan, L. J., Williams, B. M., Adithya, V. K., Upadhyaya, S., Alkafri, A., Czanner, S., & Czanner, G. (2023). Automatic detection of glaucoma via fundus imaging and artificial intelligence: A review. Survey of Ophthalmology, 68(1), 17-41.

[6]. Défossez, A., Bottou, L., Bach, F., & Usunier, N. (2020). A simple convergence proof of adam and adagrad. arXiv preprint arXiv:2003.02395.

[7]. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., & Dehghani, M. (2020). An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929.

[8]. Dumoulin, V., & Visin, F. (2016). A guide to convolution arithmetic for deep learning. ar Xiv preprint arXiv:1603.07285.

[9]. Goodfellow, I., Bengio, Y., Courville, A., & Bengio, Y. (2016). Deep Learning. MIT Press, Cambridge.

[10]. Gulshan, V., Peng, L., Coram, M., Stumpe, M. C., Wu, D., Narayanaswamy, A., & Webster, D. R. (2016). Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA, 316(22), 2402-2410.

[11]. Hatamizadeh, A., Nath, V., Tang, Y., Yang, D., Roth, H. R., & Xu, D. (2021). Swin unetr: Swin transformers for semantic segmentation of brain tumors in MRI images. In International MICCAI Brainlesion Workshop (pp. 272-284). Springer International Publishing.

[12]. Hathot, S. F., Jubier, N. J., Hassani, R. H., & Salim, A. A. (2021). Physical and elastic properties of TeO2-Gd2O3 glasses: Role of zinc oxide contents variation. Optik, 247, 167941.

[13]. Kaggle. (2019). APTOS 2019 Blindness Detection. Asia Pacific Tele-Ophthalmology Society (APTOS).

[14]. Kingma, D. P. (2014). Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.

[15]. Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., & Guo, B. (2021). Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 10012-10022).

[16]. O'shea, K., & Nash, R. (2015). An introduction to convolutional neural networks. ar Xiv preprint arXiv:1511.08458.

[17]. Powers, D. M. (2020). Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation. arXiv preprint arXiv:2010.16061.

[18]. Selvaraju, R. R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., & Batra, D. (2017). Grad-cam: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE International Conference on Computer Vision (pp. 618-626).

[19]. Shorten, C., & Khoshgoftaar, T. M. (2019). A survey on image data augmentation for deep learning. Journal of Big Data, 6(1), 1-48.

[20]. Tharwat, A. (2021). Classification assessment methods. Applied Computing and Informatics, 17(1), 168-192.

[21]. Topol, E. J. (2019). High-performance medicine: The convergence of human and artificial intelligence. Nature Medicine, 25(1), 44-56.

[22]. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30, 1-11.

[23]. World Health Organization. (2025). Blindness and Vision Impairment.

[24]. Xu, X., Feng, Z., Cao, C., Li, M., Wu, J., Wu, Z., & Ye, S. (2021). An improved swin transformer-based model for remote sensing object detection and instance segmentation. Remote Sensing, 13(23), 4779.

	North Americas,UK, Middle East,Europe		India	Rest of world
	USD	EUR	INR	USD-ROW
Pdf	35	35	200	20
Online	15	15	200	15
Pdf & Online	35	35	400	25

Swin-Transformer Based Recognition of Diabetic Retinopathy Grade

Abstract

Keywords

How to Cite this Article?

References

If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Options for accessing this content: