End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature Enhancement

Recently, cross-border logistics has experienced rapid development. Cross-border logistics courier orders come in various formats, featuring diverse layouts. Additionally, there is no standardized format for the writing of address and other information on these courier orders. It is challenging for...

Full description

Saved in:
Bibliographic Details
Main Authors: Wei Shen, Han Li, Youbo Jin, Chase Q. Wu
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/2/698
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832589232144646144
author Wei Shen
Han Li
Youbo Jin
Chase Q. Wu
author_facet Wei Shen
Han Li
Youbo Jin
Chase Q. Wu
author_sort Wei Shen
collection DOAJ
description Recently, cross-border logistics has experienced rapid development. Cross-border logistics courier orders come in various formats, featuring diverse layouts. Additionally, there is no standardized format for the writing of address and other information on these courier orders. It is challenging for current automated recognition models to handle such images. In this paper, we presented an end-to-end trainable neural network model based on feature enhancement, SwFB, capable of achieving end-to-end conversion from raw images to structured text information. We constructed our feature enhancement module, Co-G-Ma, based on a convolutional neural network (CNN), gated recurrent unit (GRU), and multi-head attention. We collected real cross-border logistics courier order images from a postal company in Zhejiang province, China, to build our dataset, COFIE, and conducted a series of experiments to explore the impact of hyperparameters on the extraction of key field text. Comparative experiments were also performed with other models on publicly available datasets CORD and SROIE. The experimental results demonstrate that our model achieves advanced performance in extracting visual text information and exhibits strong generalization.
format Article
id doaj-art-03f5f9f8ec4243eab42c73dd1b9a7479
institution Kabale University
issn 2076-3417
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj-art-03f5f9f8ec4243eab42c73dd1b9a74792025-01-24T13:20:30ZengMDPI AGApplied Sciences2076-34172025-01-0115269810.3390/app15020698End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature EnhancementWei Shen0Han Li1Youbo Jin2Chase Q. Wu3School of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, ChinaSchool of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, ChinaSchool of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, ChinaDepartment of Data Science, New Jersey Institute of Technology, Newark, NJ 07102, USARecently, cross-border logistics has experienced rapid development. Cross-border logistics courier orders come in various formats, featuring diverse layouts. Additionally, there is no standardized format for the writing of address and other information on these courier orders. It is challenging for current automated recognition models to handle such images. In this paper, we presented an end-to-end trainable neural network model based on feature enhancement, SwFB, capable of achieving end-to-end conversion from raw images to structured text information. We constructed our feature enhancement module, Co-G-Ma, based on a convolutional neural network (CNN), gated recurrent unit (GRU), and multi-head attention. We collected real cross-border logistics courier order images from a postal company in Zhejiang province, China, to build our dataset, COFIE, and conducted a series of experiments to explore the impact of hyperparameters on the extraction of key field text. Comparative experiments were also performed with other models on publicly available datasets CORD and SROIE. The experimental results demonstrate that our model achieves advanced performance in extracting visual text information and exhibits strong generalization.https://www.mdpi.com/2076-3417/15/2/698end-to-endvisual text extractioncourier order imageneural network
spellingShingle Wei Shen
Han Li
Youbo Jin
Chase Q. Wu
End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature Enhancement
Applied Sciences
end-to-end
visual text extraction
courier order image
neural network
title End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature Enhancement
title_full End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature Enhancement
title_fullStr End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature Enhancement
title_full_unstemmed End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature Enhancement
title_short End-to-End Information Extraction from Courier Order Images Using a Neural Network Model with Feature Enhancement
title_sort end to end information extraction from courier order images using a neural network model with feature enhancement
topic end-to-end
visual text extraction
courier order image
neural network
url https://www.mdpi.com/2076-3417/15/2/698
work_keys_str_mv AT weishen endtoendinformationextractionfromcourierorderimagesusinganeuralnetworkmodelwithfeatureenhancement
AT hanli endtoendinformationextractionfromcourierorderimagesusinganeuralnetworkmodelwithfeatureenhancement
AT youbojin endtoendinformationextractionfromcourierorderimagesusinganeuralnetworkmodelwithfeatureenhancement
AT chaseqwu endtoendinformationextractionfromcourierorderimagesusinganeuralnetworkmodelwithfeatureenhancement