Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning

Document classification is a challenging research problem in Document image analysis, majorly the presence of degradations like low illumination, blur, and shadows. In this paper, a deep learning-based two-level hierarchical approach is proposed for classifying smartphone-captured handwritten docume...

Full description

Saved in:

Bibliographic Details
Main Authors:	K. S. Koushik, B. J. Bipin Nair, N. Shobha Rani, Mohammed Javed
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Document image classification deep learning degradation issues educational technology model generalization
Online Access:	https://ieeexplore.ieee.org/document/10807224/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832583997115334656
author	K. S. Koushik B. J. Bipin Nair N. Shobha Rani Mohammed Javed
author_facet	K. S. Koushik B. J. Bipin Nair N. Shobha Rani Mohammed Javed
author_sort	K. S. Koushik
collection	DOAJ
description	Document classification is a challenging research problem in Document image analysis, majorly the presence of degradations like low illumination, blur, and shadows. In this paper, a deep learning-based two-level hierarchical approach is proposed for classifying smartphone-captured handwritten document images into three classes. The model is devised based on the pre-trained weights transferred from XceptionNet and the additional convolutional layers for feature extraction focused on high-frequency details from document images. A set of synthetically generated datasets concerning all three classes with two to three levels of degradation severity is considered for training the proposed deep learning model along with the real images labelled based on reference datasets. The dataset comprises blur and low illumination images with three levels of severity, and shadow images with two levels of severity. About 2,841 images from classes with 783 for blur, 828 for low illumination, and 1,230 for shadow were collected. The proposed model is compared with nine state-of-the-art deep-learning classification models. The proposed model showcases robust performance under varying levels of degradation severity levels by achieving 100% sensitivity towards shadow documents, while maintaining a high F1 score of 98.79% for Level 1. In case of low illumination conditions, again the model produces 100% precision, sensitivity, and specificity at Level 3 towards testsets. The results emphasize the model’s robustness to perform reliably, particularly under shadow and low illumination conditions.
format	Article
id	doaj-art-b3734a2a1f144d56be23b431fcf61641
institution	Kabale University
issn	2169-3536
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-b3734a2a1f144d56be23b431fcf616412025-01-28T00:01:30ZengIEEEIEEE Access2169-35362025-01-0113149221494810.1109/ACCESS.2024.352032710807224Robust Classification of Smartphone Captured Handwritten Document Images Using Deep LearningK. S. Koushik0https://orcid.org/0000-0002-6392-9947B. J. Bipin Nair1https://orcid.org/0000-0003-4592-4947N. Shobha Rani2https://orcid.org/0000-0003-4882-1919Mohammed Javed3https://orcid.org/0000-0002-3019-7401Department of Computer Science, Amrita School of Computing, Amrita Vishwa Vidyapeetham, Mysuru, IndiaDepartment of Computer Science, Amrita School of Computing, Amrita Vishwa Vidyapeetham, Mysuru, IndiaDepartment of Artificial Intelligence and Data Science, GITAM School of Technology, Bengaluru, GITAM (Deemed to be University), Visakhapatnam, IndiaDepartment of IT, Indian Institute of Information Technology, Allahabad (IIIT-Allahabad), Allahabad, IndiaDocument classification is a challenging research problem in Document image analysis, majorly the presence of degradations like low illumination, blur, and shadows. In this paper, a deep learning-based two-level hierarchical approach is proposed for classifying smartphone-captured handwritten document images into three classes. The model is devised based on the pre-trained weights transferred from XceptionNet and the additional convolutional layers for feature extraction focused on high-frequency details from document images. A set of synthetically generated datasets concerning all three classes with two to three levels of degradation severity is considered for training the proposed deep learning model along with the real images labelled based on reference datasets. The dataset comprises blur and low illumination images with three levels of severity, and shadow images with two levels of severity. About 2,841 images from classes with 783 for blur, 828 for low illumination, and 1,230 for shadow were collected. The proposed model is compared with nine state-of-the-art deep-learning classification models. The proposed model showcases robust performance under varying levels of degradation severity levels by achieving 100% sensitivity towards shadow documents, while maintaining a high F1 score of 98.79% for Level 1. In case of low illumination conditions, again the model produces 100% precision, sensitivity, and specificity at Level 3 towards testsets. The results emphasize the model’s robustness to perform reliably, particularly under shadow and low illumination conditions.https://ieeexplore.ieee.org/document/10807224/Document image classificationdeep learningdegradation issueseducational technologymodel generalization
spellingShingle	K. S. Koushik B. J. Bipin Nair N. Shobha Rani Mohammed Javed Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning IEEE Access Document image classification deep learning degradation issues educational technology model generalization
title	Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning
title_full	Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning
title_fullStr	Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning
title_full_unstemmed	Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning
title_short	Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning
title_sort	robust classification of smartphone captured handwritten document images using deep learning
topic	Document image classification deep learning degradation issues educational technology model generalization
url	https://ieeexplore.ieee.org/document/10807224/
work_keys_str_mv	AT kskoushik robustclassificationofsmartphonecapturedhandwrittendocumentimagesusingdeeplearning AT bjbipinnair robustclassificationofsmartphonecapturedhandwrittendocumentimagesusingdeeplearning AT nshobharani robustclassificationofsmartphonecapturedhandwrittendocumentimagesusingdeeplearning AT mohammedjaved robustclassificationofsmartphonecapturedhandwrittendocumentimagesusingdeeplearning

Robust Classification of Smartphone Captured Handwritten Document Images Using Deep Learning

Similar Items