RESEARCH ARTICLE


DEEPSCAN: Integrating Vision Transformers for Advanced Skin Lesion Diagnostics



Jenefa A1, Edward Naveen V2, Vinayakumar Ravi5, *, Punitha S3, Tahani Jaser Alahmadi6, *, Thompson Stephan3, Prabhishek Singh4, Manoj Diwakar3
1 Department of Computer Science, Karunya Institute of Technology and Science, Coimbatore, India
2 Department of Computer Science, Sri Shakthi Institute of Engineering and Technology, Coimbatore, India
3 Department of Computer Science, Graphic Era Deemed to be University, Dehradun, Uttarakhand, India
4 School of Computer Science Engineering and Technology, Bennett University, Greater Noida, India
5 Center for Artificial Intelligence, Prince Mohammad Bin Fahd University, Khobar, Saudi Arabia
6 Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia


Article Metrics

CrossRef Citations:
0
Total Statistics:

Full-Text HTML Views: 313
Abstract HTML Views: 180
PDF Downloads: 68
ePub Downloads: 54
Total Views/Downloads: 615
Unique Statistics:

Full-Text HTML Views: 206
Abstract HTML Views: 94
PDF Downloads: 64
ePub Downloads: 51
Total Views/Downloads: 415



Creative Commons License
© 2024 The Author(s). Published by Bentham Open.

open-access license: This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International Public License (CC-BY 4.0), a copy of which is available at: https://creativecommons.org/licenses/by/4.0/legalcode. This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

* Address correspondence to these authors at the Center for Artificial Intelligence, Prince Mohammad Bin Fahd University, Khobar, Saudi Arabia and Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia; E-mails: vinayakumarr77@gmail.com, tjalahmadi@pnu.edu.sa


Abstract

Introduction/Background

The rise in dermatological conditions, especially skin cancers, highlights the urgency for accurate diagnostics. Traditional imaging methods face challenges in capturing complex skin lesion patterns, risking misdiagnoses. Classical CNNs, though effective, often miss intricate patterns and contextual nuances.

Materials and Methods

Our research investigates the adoption of Vision Transformers (ViTs) in diagnosing skin lesions, capitalizing on their attention mechanisms and global contextual insights. Utilizing the fictional Dermatological Vision Dataset (DermVisD) with over 15,000 annotated images, we compare ViTs against traditional CNNs. This approach aims to assess the potential benefits of ViTs in dermatology.

Results

Initial experiments showcase an 18% improvement in diagnostic accuracy using ViTs over CNNs, with ViTs achieving a remarkable 97.8% accuracy on the validation set. These findings suggest that ViTs are significantly more adept at recognizing complex lesion patterns.

Discussion

The integration of Vision Transformers into dermatological imaging marks a promising shift towards more accurate diagnostics. By leveraging global contextual understanding and attention mechanisms, ViTs offer a nuanced approach that could surpass traditional methods. This advancement indicates a potential for setting new accuracy benchmarks in skin lesion diagnostics.

Conclusion

ViTs present a significant advancement in the field of dermatological imaging, potentially redefining accuracy and reliability standards. This study underscores the transformative impact of ViTs on the detection and diagnosis of skin conditions, advocating for their broader adoption in clinical settings.

Keywords: Vision transformers (ViTs), Skin lesion diagnostics, Deep learning, Medical image analysis, Human and disease, Health system.