Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-Based Noninvasive Digital System

Skin cancer is a significant health concern worldwide, and early and accurate diagnosis plays a crucial role in improving patient outcomes. In recent years, deep learning models have shown remarkable success in various computer vision tasks, including image classification. In this research study, we...

Full description

Saved in:

Bibliographic Details
Main Authors:	Galib Muhammad Shahriar Himel, Md. Masudul Islam, Kh. Abdullah Al-Aff, Shams Ibne Karim, Md. Kabir Uddin Sikder
Format:	Article
Language:	English
Published:	Wiley 2024-01-01
Series:	International Journal of Biomedical Imaging
Online Access:	http://dx.doi.org/10.1155/2024/3022192
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832545267491012608
author	Galib Muhammad Shahriar Himel Md. Masudul Islam Kh. Abdullah Al-Aff Shams Ibne Karim Md. Kabir Uddin Sikder
author_facet	Galib Muhammad Shahriar Himel Md. Masudul Islam Kh. Abdullah Al-Aff Shams Ibne Karim Md. Kabir Uddin Sikder
author_sort	Galib Muhammad Shahriar Himel
collection	DOAJ
description	Skin cancer is a significant health concern worldwide, and early and accurate diagnosis plays a crucial role in improving patient outcomes. In recent years, deep learning models have shown remarkable success in various computer vision tasks, including image classification. In this research study, we introduce an approach for skin cancer classification using vision transformer, a state-of-the-art deep learning architecture that has demonstrated exceptional performance in diverse image analysis tasks. The study utilizes the HAM10000 dataset; a publicly available dataset comprising 10,015 skin lesion images classified into two categories: benign (6705 images) and malignant (3310 images). This dataset consists of high-resolution images captured using dermatoscopes and carefully annotated by expert dermatologists. Preprocessing techniques, such as normalization and augmentation, are applied to enhance the robustness and generalization of the model. The vision transformer architecture is adapted to the skin cancer classification task. The model leverages the self-attention mechanism to capture intricate spatial dependencies and long-range dependencies within the images, enabling it to effectively learn relevant features for accurate classification. Segment Anything Model (SAM) is employed to segment the cancerous areas from the images; achieving an IOU of 96.01% and Dice coefficient of 98.14% and then various pretrained models are used for classification using vision transformer architecture. Extensive experiments and evaluations are conducted to assess the performance of our approach. The results demonstrate the superiority of the vision transformer model over traditional deep learning architectures in skin cancer classification in general with some exceptions. Upon experimenting on six different models, ViT-Google, ViT-MAE, ViT-ResNet50, ViT-VAN, ViT-BEiT, and ViT-DiT, we found out that the ML approach achieves 96.15% accuracy using Google’s ViT patch-32 model with a low false negative ratio on the test dataset, showcasing its potential as an effective tool for aiding dermatologists in the diagnosis of skin cancer.
format	Article
id	doaj-art-d5f5abe3e2e7482bb7a883a3b21a94c8
institution	Kabale University
issn	1687-4196
language	English
publishDate	2024-01-01
publisher	Wiley
record_format	Article
series	International Journal of Biomedical Imaging
spelling	doaj-art-d5f5abe3e2e7482bb7a883a3b21a94c82025-02-03T07:26:21ZengWileyInternational Journal of Biomedical Imaging1687-41962024-01-01202410.1155/2024/3022192Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-Based Noninvasive Digital SystemGalib Muhammad Shahriar Himel0Md. Masudul Islam1Kh. Abdullah Al-Aff2Shams Ibne Karim3Md. Kabir Uddin Sikder4Jahangirnagar UniversityJahangirnagar UniversityBangladesh University of Health SciencesBangabandhu Sheikh Mujib Medical UniversityJahangirnagar UniversitySkin cancer is a significant health concern worldwide, and early and accurate diagnosis plays a crucial role in improving patient outcomes. In recent years, deep learning models have shown remarkable success in various computer vision tasks, including image classification. In this research study, we introduce an approach for skin cancer classification using vision transformer, a state-of-the-art deep learning architecture that has demonstrated exceptional performance in diverse image analysis tasks. The study utilizes the HAM10000 dataset; a publicly available dataset comprising 10,015 skin lesion images classified into two categories: benign (6705 images) and malignant (3310 images). This dataset consists of high-resolution images captured using dermatoscopes and carefully annotated by expert dermatologists. Preprocessing techniques, such as normalization and augmentation, are applied to enhance the robustness and generalization of the model. The vision transformer architecture is adapted to the skin cancer classification task. The model leverages the self-attention mechanism to capture intricate spatial dependencies and long-range dependencies within the images, enabling it to effectively learn relevant features for accurate classification. Segment Anything Model (SAM) is employed to segment the cancerous areas from the images; achieving an IOU of 96.01% and Dice coefficient of 98.14% and then various pretrained models are used for classification using vision transformer architecture. Extensive experiments and evaluations are conducted to assess the performance of our approach. The results demonstrate the superiority of the vision transformer model over traditional deep learning architectures in skin cancer classification in general with some exceptions. Upon experimenting on six different models, ViT-Google, ViT-MAE, ViT-ResNet50, ViT-VAN, ViT-BEiT, and ViT-DiT, we found out that the ML approach achieves 96.15% accuracy using Google’s ViT patch-32 model with a low false negative ratio on the test dataset, showcasing its potential as an effective tool for aiding dermatologists in the diagnosis of skin cancer.http://dx.doi.org/10.1155/2024/3022192
spellingShingle	Galib Muhammad Shahriar Himel Md. Masudul Islam Kh. Abdullah Al-Aff Shams Ibne Karim Md. Kabir Uddin Sikder Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-Based Noninvasive Digital System International Journal of Biomedical Imaging
title	Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-Based Noninvasive Digital System
title_full	Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-Based Noninvasive Digital System
title_fullStr	Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-Based Noninvasive Digital System
title_full_unstemmed	Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-Based Noninvasive Digital System
title_short	Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-Based Noninvasive Digital System
title_sort	skin cancer segmentation and classification using vision transformer for automatic analysis in dermatoscopy based noninvasive digital system
url	http://dx.doi.org/10.1155/2024/3022192
work_keys_str_mv	AT galibmuhammadshahriarhimel skincancersegmentationandclassificationusingvisiontransformerforautomaticanalysisindermatoscopybasednoninvasivedigitalsystem AT mdmasudulislam skincancersegmentationandclassificationusingvisiontransformerforautomaticanalysisindermatoscopybasednoninvasivedigitalsystem AT khabdullahalaff skincancersegmentationandclassificationusingvisiontransformerforautomaticanalysisindermatoscopybasednoninvasivedigitalsystem AT shamsibnekarim skincancersegmentationandclassificationusingvisiontransformerforautomaticanalysisindermatoscopybasednoninvasivedigitalsystem AT mdkabiruddinsikder skincancersegmentationandclassificationusingvisiontransformerforautomaticanalysisindermatoscopybasednoninvasivedigitalsystem

Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-Based Noninvasive Digital System

Similar Items