Skin Disease Detection using Vision Transformers

Ramya Asalatha.Busi*, Velaga Naga Bhavana**, Revathi Navuluri***, Vamsi Donkada****, Shaik Faizaan Ahmed*****
*-***** Department of Computer Science and Engineering, Vasireddy Venkatadri Institute of Technology, Nambur, Guntur, Andhra Pradesh, India.
Periodicity:January - June'2025
DOI : https://doi.org/10.26634/jpr.12.1.21946

Abstract

Skin diseases are prevalent health issues that significantly impact individual's quality of life. Early and accurate diagnosis is crucial for timely treatment, leading to faster recovery. With advancements in machine learning and computer vision, Vision Transformers (ViTs) have emerged as a powerful alternative to Convolutional Neural Networks (CNNs) for automatic skin disease detection. This study explores the application of Vision Transformers in diagnosing skin diseases, highlighting their potential to support dermatologists and healthcare professionals. The proposed method utilizes the HAM10000 image dataset comprising various skin conditions, including melanoma, benign keratosis, basal carcinoma and other common ailments. Vision Transformers, known for their ability to capture long-range dependencies and global context in images, are employed to extract high-level features from input images. These features are then fed into a classification layer for disease detection. The ViT model learns to identify patterns associated with different skin diseases through training on an extensive dataset of skin images. When presented with a new image, the model extracts relevant features, enabling it to accurately classify the disease. The test accuracy and val loss are 93.36% and 0.2181. This study demonstrates the effectiveness of Vision Transformers in skin disease detection, offering a promising tool for improving diagnostic accuracy and supporting early intervention in dermatology.

Keywords

Skin Diseases, Vision Transformers, CNN, Image Analysis, Image Embedding.

How to Cite this Article?

Busi, R. A., Bhavana, V. N., Navuluri, R., Donkada, V., and Ahmed, S. F. (2025). Skin Disease Detection using Vision Transformers. i-manager’s Journal on Pattern Recognition, 12(1), 45-52. https://doi.org/10.26634/jpr.12.1.21946

References

[4]. Dagnaw, G. H., El Mouhtadi, M., & Mustapha, M. (2024). Skin cancer classification using vision transformers and explainable artificial intelligence. Journal of Medical Artificial Intelligence, 7, 1-17.
[8]. Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., & Guo, B. (2021). Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 10012-10022).
[9]. Touvron, H., Cord, M., Douze, M., Massa, F., Sablayrolles, A., & Jégou, H. (2021). Training data-efficient image transformers & distillation through attention. In International Conference on Machine Learning (pp. 10347-10357). PMLR.
[10]. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30.
If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Single Article

North Americas,UK,
Middle East,Europe
India Rest of world
USD EUR INR USD-ROW
Pdf 35 35 200 20
Online 15 15 200 15
Pdf & Online 35 35 400 25

Options for accessing this content:
  • If you would like institutional access to this content, please recommend the title to your librarian.
    Library Recommendation Form
  • If you already have i-manager's user account: Login above and proceed to purchase the article.
  • New Users: Please register, then proceed to purchase the article.