Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley Data
Handwritten and legacy engineering diagrams are still commonly used in many industries and academic settings, but the lack of digitization limits their utility in modern workflows. While significant effort has been made to digitize handwritten content in other engineering domains, the digitization o...
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2025-04-01
|
Series: | Data in Brief |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340925000472 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832583854496415744 |
---|---|
author | Nadim Ahmed Mirza Fuad Adnan Ahmad Shafiullah Hayder Jahan Parash Md. Saifur Rahman Irfan Chowdhury Akib Golam Sarowar |
author_facet | Nadim Ahmed Mirza Fuad Adnan Ahmad Shafiullah Hayder Jahan Parash Md. Saifur Rahman Irfan Chowdhury Akib Golam Sarowar |
author_sort | Nadim Ahmed |
collection | DOAJ |
description | Handwritten and legacy engineering diagrams are still commonly used in many industries and academic settings, but the lack of digitization limits their utility in modern workflows. While significant effort has been made to digitize handwritten content in other engineering domains, the digitization of handwritten circuit diagrams remains underexplored. Automating this process would enable the development of tools capable of converting handwritten circuit diagrams into machine-readable formats that can be instantly interpreted by circuit simulation software. This will offer significant benefits to both students and industry professionals. However, the lack of publicly available datasets focused on the digitization of handwritten circuit diagrams has slowed progress in this area. To address this gap, we developed the Digitize - Handwritten Circuit Diagram (HCD) Dataset, a comprehensive collection of 1,277 handwritten circuit diagrams contributed by 176 volunteers. The dataset includes detailed annotations for multiple aspects of handwritten circuit diagrams, such as component symbols, text labels, and port locations. It contains 18,602 annotated instances across 17 distinct classes of circuit component symbols and 11,936 annotated text labels associated with these components. For the preparation of ground-truth data for component port localization, we developed an annotation tool, which is publicly available for reuse. The Digitize-HCD dataset has the potential to accelerate research on digitization of handwritten circuit diagrams and contribute to the development of advanced end-to-end tools capable of transforming these diagrams into machine-readable formats. |
format | Article |
id | doaj-art-917f7b61275d4d8ca513f1b320a894cd |
institution | Kabale University |
issn | 2352-3409 |
language | English |
publishDate | 2025-04-01 |
publisher | Elsevier |
record_format | Article |
series | Data in Brief |
spelling | doaj-art-917f7b61275d4d8ca513f1b320a894cd2025-01-28T04:14:42ZengElsevierData in Brief2352-34092025-04-0159111315Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley DataNadim Ahmed0Mirza Fuad Adnan1Ahmad Shafiullah2Hayder Jahan Parash3Md. Saifur Rahman4Irfan Chowdhury Akib5Golam Sarowar6Department of Electrical and Electronic Engineering, Islamic University of Technology, Gazipur 1704, Bangladesh; Corresponding author.Department of Electrical and Electronic Engineering, Islamic University of Technology, Gazipur 1704, BangladeshDepartment of Electrical and Electronic Engineering, Islamic University of Technology, Gazipur 1704, BangladeshDepartment of Electrical and Electronic Engineering, Islamic University of Technology, Gazipur 1704, BangladeshDepartment of Electrical and Electronic Engineering, Islamic University of Technology, Gazipur 1704, BangladeshDepartment of Electrical, Electronic and Communication Engineering, Military Institute of Science and Technology, Dhaka, BangladeshDepartment of Electrical and Electronic Engineering, Islamic University of Technology, Gazipur 1704, BangladeshHandwritten and legacy engineering diagrams are still commonly used in many industries and academic settings, but the lack of digitization limits their utility in modern workflows. While significant effort has been made to digitize handwritten content in other engineering domains, the digitization of handwritten circuit diagrams remains underexplored. Automating this process would enable the development of tools capable of converting handwritten circuit diagrams into machine-readable formats that can be instantly interpreted by circuit simulation software. This will offer significant benefits to both students and industry professionals. However, the lack of publicly available datasets focused on the digitization of handwritten circuit diagrams has slowed progress in this area. To address this gap, we developed the Digitize - Handwritten Circuit Diagram (HCD) Dataset, a comprehensive collection of 1,277 handwritten circuit diagrams contributed by 176 volunteers. The dataset includes detailed annotations for multiple aspects of handwritten circuit diagrams, such as component symbols, text labels, and port locations. It contains 18,602 annotated instances across 17 distinct classes of circuit component symbols and 11,936 annotated text labels associated with these components. For the preparation of ground-truth data for component port localization, we developed an annotation tool, which is publicly available for reuse. The Digitize-HCD dataset has the potential to accelerate research on digitization of handwritten circuit diagrams and contribute to the development of advanced end-to-end tools capable of transforming these diagrams into machine-readable formats.http://www.sciencedirect.com/science/article/pii/S2352340925000472Computer visionDiagram digitizationHandwritten circuit diagramCircuit component detectionText recognition |
spellingShingle | Nadim Ahmed Mirza Fuad Adnan Ahmad Shafiullah Hayder Jahan Parash Md. Saifur Rahman Irfan Chowdhury Akib Golam Sarowar Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley Data Data in Brief Computer vision Diagram digitization Handwritten circuit diagram Circuit component detection Text recognition |
title | Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley Data |
title_full | Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley Data |
title_fullStr | Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley Data |
title_full_unstemmed | Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley Data |
title_short | Digitize-HCD: A dataset for digitization of handwritten circuit diagramsMendeley Data |
title_sort | digitize hcd a dataset for digitization of handwritten circuit diagramsmendeley data |
topic | Computer vision Diagram digitization Handwritten circuit diagram Circuit component detection Text recognition |
url | http://www.sciencedirect.com/science/article/pii/S2352340925000472 |
work_keys_str_mv | AT nadimahmed digitizehcdadatasetfordigitizationofhandwrittencircuitdiagramsmendeleydata AT mirzafuadadnan digitizehcdadatasetfordigitizationofhandwrittencircuitdiagramsmendeleydata AT ahmadshafiullah digitizehcdadatasetfordigitizationofhandwrittencircuitdiagramsmendeleydata AT hayderjahanparash digitizehcdadatasetfordigitizationofhandwrittencircuitdiagramsmendeleydata AT mdsaifurrahman digitizehcdadatasetfordigitizationofhandwrittencircuitdiagramsmendeleydata AT irfanchowdhuryakib digitizehcdadatasetfordigitizationofhandwrittencircuitdiagramsmendeleydata AT golamsarowar digitizehcdadatasetfordigitizationofhandwrittencircuitdiagramsmendeleydata |