A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing

Abstract Academic data processing is crucial in scientometrics and bibliometrics, such as research trending analysis and citation recommendation. Existing datasets in this domain have predominantly concentrated on textual data, overlooking the importance of visual elements. To bridge this gap, we in...

Full description

Saved in:
Bibliographic Details
Main Authors: Haitao Song, Hongyi Xu, Zikai Wang, Yifan Wang, Jiajia Li
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-025-04415-z
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832572029110321152
author Haitao Song
Hongyi Xu
Zikai Wang
Yifan Wang
Jiajia Li
author_facet Haitao Song
Hongyi Xu
Zikai Wang
Yifan Wang
Jiajia Li
author_sort Haitao Song
collection DOAJ
description Abstract Academic data processing is crucial in scientometrics and bibliometrics, such as research trending analysis and citation recommendation. Existing datasets in this domain have predominantly concentrated on textual data, overlooking the importance of visual elements. To bridge this gap, we introduce a multidisciplinary multimodal aligned dataset (MMAD) specifically designed for academic data processing. This dataset encompasses over 1.1 million peer-reviewed scholarly articles, enhanced with metadata and visuals that are aligned with the text. We assess the representativeness of MMAD by comparing its country/region distribution against benchmarks from SCImago. Furthermore, we propose an innovative quality validation method for MMAD, leveraging Language Model-based techniques. Utilizing carefully crafted prompts, this approach enhances multimodal processing capabilities to evaluate the accuracy of text-to-visual alignments. We also outline prospective applications for MMAD, providing the way for novel research endeavors, including automated caption generation and analysis of trends in figures. Thus, this work signals new research prospects and provides a fertile ground for advances in academic data processing.
format Article
id doaj-art-bfd19b9b9bb44fbcba023833d6ac582c
institution Kabale University
issn 2052-4463
language English
publishDate 2025-01-01
publisher Nature Portfolio
record_format Article
series Scientific Data
spelling doaj-art-bfd19b9b9bb44fbcba023833d6ac582c2025-02-02T12:08:01ZengNature PortfolioScientific Data2052-44632025-01-0112111210.1038/s41597-025-04415-zA Multidisciplinary Multimodal Aligned Dataset for Academic Data ProcessingHaitao Song0Hongyi Xu1Zikai Wang2Yifan Wang3Jiajia Li4Shanghai Artificial Intelligence Research Institute Co., Ltd.Shanghai Artificial Intelligence Research Institute Co., Ltd.Shanghai Artificial Intelligence Research Institute Co., Ltd.University of Texas MD Anderson Cancer Center, Department of Radiation OncologyShanghai Artificial Intelligence Research Institute Co., Ltd.Abstract Academic data processing is crucial in scientometrics and bibliometrics, such as research trending analysis and citation recommendation. Existing datasets in this domain have predominantly concentrated on textual data, overlooking the importance of visual elements. To bridge this gap, we introduce a multidisciplinary multimodal aligned dataset (MMAD) specifically designed for academic data processing. This dataset encompasses over 1.1 million peer-reviewed scholarly articles, enhanced with metadata and visuals that are aligned with the text. We assess the representativeness of MMAD by comparing its country/region distribution against benchmarks from SCImago. Furthermore, we propose an innovative quality validation method for MMAD, leveraging Language Model-based techniques. Utilizing carefully crafted prompts, this approach enhances multimodal processing capabilities to evaluate the accuracy of text-to-visual alignments. We also outline prospective applications for MMAD, providing the way for novel research endeavors, including automated caption generation and analysis of trends in figures. Thus, this work signals new research prospects and provides a fertile ground for advances in academic data processing.https://doi.org/10.1038/s41597-025-04415-z
spellingShingle Haitao Song
Hongyi Xu
Zikai Wang
Yifan Wang
Jiajia Li
A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing
Scientific Data
title A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing
title_full A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing
title_fullStr A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing
title_full_unstemmed A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing
title_short A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing
title_sort multidisciplinary multimodal aligned dataset for academic data processing
url https://doi.org/10.1038/s41597-025-04415-z
work_keys_str_mv AT haitaosong amultidisciplinarymultimodalaligneddatasetforacademicdataprocessing
AT hongyixu amultidisciplinarymultimodalaligneddatasetforacademicdataprocessing
AT zikaiwang amultidisciplinarymultimodalaligneddatasetforacademicdataprocessing
AT yifanwang amultidisciplinarymultimodalaligneddatasetforacademicdataprocessing
AT jiajiali amultidisciplinarymultimodalaligneddatasetforacademicdataprocessing
AT haitaosong multidisciplinarymultimodalaligneddatasetforacademicdataprocessing
AT hongyixu multidisciplinarymultimodalaligneddatasetforacademicdataprocessing
AT zikaiwang multidisciplinarymultimodalaligneddatasetforacademicdataprocessing
AT yifanwang multidisciplinarymultimodalaligneddatasetforacademicdataprocessing
AT jiajiali multidisciplinarymultimodalaligneddatasetforacademicdataprocessing