End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature Enhancement
Recently, cross-border logistics has experienced rapid development. Cross-border logistics courier orders come in various formats, featuring diverse layouts. Additionally, there is no standardized format for the writing of address and other information on these courier orders. It is challenging for...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2025-01-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/15/2/698 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832589232144646144 |
---|---|
author | Wei Shen Han Li Youbo Jin Chase Q. Wu |
author_facet | Wei Shen Han Li Youbo Jin Chase Q. Wu |
author_sort | Wei Shen |
collection | DOAJ |
description | Recently, cross-border logistics has experienced rapid development. Cross-border logistics courier orders come in various formats, featuring diverse layouts. Additionally, there is no standardized format for the writing of address and other information on these courier orders. It is challenging for current automated recognition models to handle such images. In this paper, we presented an end-to-end trainable neural network model based on feature enhancement, SwFB, capable of achieving end-to-end conversion from raw images to structured text information. We constructed our feature enhancement module, Co-G-Ma, based on a convolutional neural network (CNN), gated recurrent unit (GRU), and multi-head attention. We collected real cross-border logistics courier order images from a postal company in Zhejiang province, China, to build our dataset, COFIE, and conducted a series of experiments to explore the impact of hyperparameters on the extraction of key field text. Comparative experiments were also performed with other models on publicly available datasets CORD and SROIE. The experimental results demonstrate that our model achieves advanced performance in extracting visual text information and exhibits strong generalization. |
format | Article |
id | doaj-art-03f5f9f8ec4243eab42c73dd1b9a7479 |
institution | Kabale University |
issn | 2076-3417 |
language | English |
publishDate | 2025-01-01 |
publisher | MDPI AG |
record_format | Article |
series | Applied Sciences |
spelling | doaj-art-03f5f9f8ec4243eab42c73dd1b9a74792025-01-24T13:20:30ZengMDPI AGApplied Sciences2076-34172025-01-0115269810.3390/app15020698End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature EnhancementWei Shen0Han Li1Youbo Jin2Chase Q. Wu3School of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, ChinaSchool of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, ChinaSchool of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, ChinaDepartment of Data Science, New Jersey Institute of Technology, Newark, NJ 07102, USARecently, cross-border logistics has experienced rapid development. Cross-border logistics courier orders come in various formats, featuring diverse layouts. Additionally, there is no standardized format for the writing of address and other information on these courier orders. It is challenging for current automated recognition models to handle such images. In this paper, we presented an end-to-end trainable neural network model based on feature enhancement, SwFB, capable of achieving end-to-end conversion from raw images to structured text information. We constructed our feature enhancement module, Co-G-Ma, based on a convolutional neural network (CNN), gated recurrent unit (GRU), and multi-head attention. We collected real cross-border logistics courier order images from a postal company in Zhejiang province, China, to build our dataset, COFIE, and conducted a series of experiments to explore the impact of hyperparameters on the extraction of key field text. Comparative experiments were also performed with other models on publicly available datasets CORD and SROIE. The experimental results demonstrate that our model achieves advanced performance in extracting visual text information and exhibits strong generalization.https://www.mdpi.com/2076-3417/15/2/698end-to-endvisual text extractioncourier order imageneural network |
spellingShingle | Wei Shen Han Li Youbo Jin Chase Q. Wu End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature Enhancement Applied Sciences end-to-end visual text extraction courier order image neural network |
title | End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature Enhancement |
title_full | End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature Enhancement |
title_fullStr | End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature Enhancement |
title_full_unstemmed | End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature Enhancement |
title_short | End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature Enhancement |
title_sort | end to end information extraction from courier order images using a neural network model with feature enhancement |
topic | end-to-end visual text extraction courier order image neural network |
url | https://www.mdpi.com/2076-3417/15/2/698 |
work_keys_str_mv | AT weishen endtoendinformationextractionfromcourierorderimagesusinganeuralnetworkmodelwithfeatureenhancement AT hanli endtoendinformationextractionfromcourierorderimagesusinganeuralnetworkmodelwithfeatureenhancement AT youbojin endtoendinformationextractionfromcourierorderimagesusinganeuralnetworkmodelwithfeatureenhancement AT chaseqwu endtoendinformationextractionfromcourierorderimagesusinganeuralnetworkmodelwithfeatureenhancement |