Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley Data

Handwritten and legacy engineering diagrams are still commonly used in many industries and academic settings, but the lack of digitization limits their utility in modern workflows. While significant effort has been made to digitize handwritten content in other engineering domains, the digitization o...

Full description

Saved in:
Bibliographic Details
Main Authors: Nadim Ahmed, Mirza Fuad Adnan, Ahmad Shafiullah, Hayder Jahan Parash, Md. Saifur Rahman, Irfan Chowdhury Akib, Golam Sarowar
Format: Article
Language:English
Published: Elsevier 2025-04-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340925000472
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832583854496415744
author Nadim Ahmed
Mirza Fuad Adnan
Ahmad Shafiullah
Hayder Jahan Parash
Md. Saifur Rahman
Irfan Chowdhury Akib
Golam Sarowar
author_facet Nadim Ahmed
Mirza Fuad Adnan
Ahmad Shafiullah
Hayder Jahan Parash
Md. Saifur Rahman
Irfan Chowdhury Akib
Golam Sarowar
author_sort Nadim Ahmed
collection DOAJ
description Handwritten and legacy engineering diagrams are still commonly used in many industries and academic settings, but the lack of digitization limits their utility in modern workflows. While significant effort has been made to digitize handwritten content in other engineering domains, the digitization of handwritten circuit diagrams remains underexplored. Automating this process would enable the development of tools capable of converting handwritten circuit diagrams into machine-readable formats that can be instantly interpreted by circuit simulation software. This will offer significant benefits to both students and industry professionals. However, the lack of publicly available datasets focused on the digitization of handwritten circuit diagrams has slowed progress in this area. To address this gap, we developed the Digitize - Handwritten Circuit Diagram (HCD) Dataset, a comprehensive collection of 1,277 handwritten circuit diagrams contributed by 176 volunteers. The dataset includes detailed annotations for multiple aspects of handwritten circuit diagrams, such as component symbols, text labels, and port locations. It contains 18,602 annotated instances across 17 distinct classes of circuit component symbols and 11,936 annotated text labels associated with these components. For the preparation of ground-truth data for component port localization, we developed an annotation tool, which is publicly available for reuse. The Digitize-HCD dataset has the potential to accelerate research on digitization of handwritten circuit diagrams and contribute to the development of advanced end-to-end tools capable of transforming these diagrams into machine-readable formats.
format Article
id doaj-art-917f7b61275d4d8ca513f1b320a894cd
institution Kabale University
issn 2352-3409
language English
publishDate 2025-04-01
publisher Elsevier
record_format Article
series Data in Brief
spelling doaj-art-917f7b61275d4d8ca513f1b320a894cd2025-01-28T04:14:42ZengElsevierData in Brief2352-34092025-04-0159111315Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley DataNadim Ahmed0Mirza Fuad Adnan1Ahmad Shafiullah2Hayder Jahan Parash3Md. Saifur Rahman4Irfan Chowdhury Akib5Golam Sarowar6Department of Electrical and Electronic Engineering, Islamic University of Technology, Gazipur 1704, Bangladesh; Corresponding author.Department of Electrical and Electronic Engineering, Islamic University of Technology, Gazipur 1704, BangladeshDepartment of Electrical and Electronic Engineering, Islamic University of Technology, Gazipur 1704, BangladeshDepartment of Electrical and Electronic Engineering, Islamic University of Technology, Gazipur 1704, BangladeshDepartment of Electrical and Electronic Engineering, Islamic University of Technology, Gazipur 1704, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology, Dhaka, BangladeshDepartment of Electrical and Electronic Engineering, Islamic University of Technology, Gazipur 1704, BangladeshHandwritten and legacy engineering diagrams are still commonly used in many industries and academic settings, but the lack of digitization limits their utility in modern workflows. While significant effort has been made to digitize handwritten content in other engineering domains, the digitization of handwritten circuit diagrams remains underexplored. Automating this process would enable the development of tools capable of converting handwritten circuit diagrams into machine-readable formats that can be instantly interpreted by circuit simulation software. This will offer significant benefits to both students and industry professionals. However, the lack of publicly available datasets focused on the digitization of handwritten circuit diagrams has slowed progress in this area. To address this gap, we developed the Digitize - Handwritten Circuit Diagram (HCD) Dataset, a comprehensive collection of 1,277 handwritten circuit diagrams contributed by 176 volunteers. The dataset includes detailed annotations for multiple aspects of handwritten circuit diagrams, such as component symbols, text labels, and port locations. It contains 18,602 annotated instances across 17 distinct classes of circuit component symbols and 11,936 annotated text labels associated with these components. For the preparation of ground-truth data for component port localization, we developed an annotation tool, which is publicly available for reuse. The Digitize-HCD dataset has the potential to accelerate research on digitization of handwritten circuit diagrams and contribute to the development of advanced end-to-end tools capable of transforming these diagrams into machine-readable formats.http://www.sciencedirect.com/science/article/pii/S2352340925000472Computer visionDiagram digitizationHandwritten circuit diagramCircuit component detectionText recognition
spellingShingle Nadim Ahmed
Mirza Fuad Adnan
Ahmad Shafiullah
Hayder Jahan Parash
Md. Saifur Rahman
Irfan Chowdhury Akib
Golam Sarowar
Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley Data
Data in Brief
Computer vision
Diagram digitization
Handwritten circuit diagram
Circuit component detection
Text recognition
title Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley Data
title_full Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley Data
title_fullStr Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley Data
title_full_unstemmed Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley Data
title_short Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley Data
title_sort digitize hcd a dataset for digitization of handwritten circuit diagramsmendeley data
topic Computer vision
Diagram digitization
Handwritten circuit diagram
Circuit component detection
Text recognition
url http://www.sciencedirect.com/science/article/pii/S2352340925000472
work_keys_str_mv AT nadimahmed digitizehcdadatasetfordigitizationofhandwrittencircuitdiagramsmendeleydata
AT mirzafuadadnan digitizehcdadatasetfordigitizationofhandwrittencircuitdiagramsmendeleydata
AT ahmadshafiullah digitizehcdadatasetfordigitizationofhandwrittencircuitdiagramsmendeleydata
AT hayderjahanparash digitizehcdadatasetfordigitizationofhandwrittencircuitdiagramsmendeleydata
AT mdsaifurrahman digitizehcdadatasetfordigitizationofhandwrittencircuitdiagramsmendeleydata
AT irfanchowdhuryakib digitizehcdadatasetfordigitizationofhandwrittencircuitdiagramsmendeleydata
AT golamsarowar digitizehcdadatasetfordigitizationofhandwrittencircuitdiagramsmendeleydata