Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning

Document classification is a challenging research problem in Document image analysis, majorly the presence of degradations like low illumination, blur, and shadows. In this paper, a deep learning-based two-level hierarchical approach is proposed for classifying smartphone-captured handwritten docume...

Full description

Saved in:
Bibliographic Details
Main Authors: K. S. Koushik, B. J. Bipin Nair, N. Shobha Rani, Mohammed Javed
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10807224/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832583997115334656
author K. S. Koushik
B. J. Bipin Nair
N. Shobha Rani
Mohammed Javed
author_facet K. S. Koushik
B. J. Bipin Nair
N. Shobha Rani
Mohammed Javed
author_sort K. S. Koushik
collection DOAJ
description Document classification is a challenging research problem in Document image analysis, majorly the presence of degradations like low illumination, blur, and shadows. In this paper, a deep learning-based two-level hierarchical approach is proposed for classifying smartphone-captured handwritten document images into three classes. The model is devised based on the pre-trained weights transferred from XceptionNet and the additional convolutional layers for feature extraction focused on high-frequency details from document images. A set of synthetically generated datasets concerning all three classes with two to three levels of degradation severity is considered for training the proposed deep learning model along with the real images labelled based on reference datasets. The dataset comprises blur and low illumination images with three levels of severity, and shadow images with two levels of severity. About 2,841 images from classes with 783 for blur, 828 for low illumination, and 1,230 for shadow were collected. The proposed model is compared with nine state-of-the-art deep-learning classification models. The proposed model showcases robust performance under varying levels of degradation severity levels by achieving 100% sensitivity towards shadow documents, while maintaining a high F1 score of 98.79% for Level 1. In case of low illumination conditions, again the model produces 100% precision, sensitivity, and specificity at Level 3 towards testsets. The results emphasize the model’s robustness to perform reliably, particularly under shadow and low illumination conditions.
format Article
id doaj-art-b3734a2a1f144d56be23b431fcf61641
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-b3734a2a1f144d56be23b431fcf616412025-01-28T00:01:30ZengIEEEIEEE Access2169-35362025-01-0113149221494810.1109/ACCESS.2024.352032710807224Robust Classification of Smartphone Captured Handwritten Document Images Using Deep LearningK. S. Koushik0https://orcid.org/0000-0002-6392-9947B. J. Bipin Nair1https://orcid.org/0000-0003-4592-4947N. Shobha Rani2https://orcid.org/0000-0003-4882-1919Mohammed Javed3https://orcid.org/0000-0002-3019-7401Department of Computer Science, Amrita School of Computing, Amrita Vishwa Vidyapeetham, Mysuru, IndiaDepartment of Computer Science, Amrita School of Computing, Amrita Vishwa Vidyapeetham, Mysuru, IndiaDepartment of Artificial Intelligence and Data Science, GITAM School of Technology, Bengaluru, GITAM (Deemed to be University), Visakhapatnam, IndiaDepartment of IT, Indian Institute of Information Technology, Allahabad (IIIT-Allahabad), Allahabad, IndiaDocument classification is a challenging research problem in Document image analysis, majorly the presence of degradations like low illumination, blur, and shadows. In this paper, a deep learning-based two-level hierarchical approach is proposed for classifying smartphone-captured handwritten document images into three classes. The model is devised based on the pre-trained weights transferred from XceptionNet and the additional convolutional layers for feature extraction focused on high-frequency details from document images. A set of synthetically generated datasets concerning all three classes with two to three levels of degradation severity is considered for training the proposed deep learning model along with the real images labelled based on reference datasets. The dataset comprises blur and low illumination images with three levels of severity, and shadow images with two levels of severity. About 2,841 images from classes with 783 for blur, 828 for low illumination, and 1,230 for shadow were collected. The proposed model is compared with nine state-of-the-art deep-learning classification models. The proposed model showcases robust performance under varying levels of degradation severity levels by achieving 100% sensitivity towards shadow documents, while maintaining a high F1 score of 98.79% for Level 1. In case of low illumination conditions, again the model produces 100% precision, sensitivity, and specificity at Level 3 towards testsets. The results emphasize the model’s robustness to perform reliably, particularly under shadow and low illumination conditions.https://ieeexplore.ieee.org/document/10807224/Document image classificationdeep learningdegradation issueseducational technologymodel generalization
spellingShingle K. S. Koushik
B. J. Bipin Nair
N. Shobha Rani
Mohammed Javed
Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning
IEEE Access
Document image classification
deep learning
degradation issues
educational technology
model generalization
title Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning
title_full Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning
title_fullStr Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning
title_full_unstemmed Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning
title_short Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning
title_sort robust classification of smartphone captured handwritten document images using deep learning
topic Document image classification
deep learning
degradation issues
educational technology
model generalization
url https://ieeexplore.ieee.org/document/10807224/
work_keys_str_mv AT kskoushik robustclassificationofsmartphonecapturedhandwrittendocumentimagesusingdeeplearning
AT bjbipinnair robustclassificationofsmartphonecapturedhandwrittendocumentimagesusingdeeplearning
AT nshobharani robustclassificationofsmartphonecapturedhandwrittendocumentimagesusingdeeplearning
AT mohammedjaved robustclassificationofsmartphonecapturedhandwrittendocumentimagesusingdeeplearning