A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing
Abstract Academic data processing is crucial in scientometrics and bibliometrics, such as research trending analysis and citation recommendation. Existing datasets in this domain have predominantly concentrated on textual data, overlooking the importance of visual elements. To bridge this gap, we in...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2025-01-01
|
Series: | Scientific Data |
Online Access: | https://doi.org/10.1038/s41597-025-04415-z |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832572029110321152 |
---|---|
author | Haitao Song Hongyi Xu Zikai Wang Yifan Wang Jiajia Li |
author_facet | Haitao Song Hongyi Xu Zikai Wang Yifan Wang Jiajia Li |
author_sort | Haitao Song |
collection | DOAJ |
description | Abstract Academic data processing is crucial in scientometrics and bibliometrics, such as research trending analysis and citation recommendation. Existing datasets in this domain have predominantly concentrated on textual data, overlooking the importance of visual elements. To bridge this gap, we introduce a multidisciplinary multimodal aligned dataset (MMAD) specifically designed for academic data processing. This dataset encompasses over 1.1 million peer-reviewed scholarly articles, enhanced with metadata and visuals that are aligned with the text. We assess the representativeness of MMAD by comparing its country/region distribution against benchmarks from SCImago. Furthermore, we propose an innovative quality validation method for MMAD, leveraging Language Model-based techniques. Utilizing carefully crafted prompts, this approach enhances multimodal processing capabilities to evaluate the accuracy of text-to-visual alignments. We also outline prospective applications for MMAD, providing the way for novel research endeavors, including automated caption generation and analysis of trends in figures. Thus, this work signals new research prospects and provides a fertile ground for advances in academic data processing. |
format | Article |
id | doaj-art-bfd19b9b9bb44fbcba023833d6ac582c |
institution | Kabale University |
issn | 2052-4463 |
language | English |
publishDate | 2025-01-01 |
publisher | Nature Portfolio |
record_format | Article |
series | Scientific Data |
spelling | doaj-art-bfd19b9b9bb44fbcba023833d6ac582c2025-02-02T12:08:01ZengNature PortfolioScientific Data2052-44632025-01-0112111210.1038/s41597-025-04415-zA Multidisciplinary Multimodal Aligned Dataset for Academic Data ProcessingHaitao Song0Hongyi Xu1Zikai Wang2Yifan Wang3Jiajia Li4Shanghai Artificial Intelligence Research Institute Co., Ltd.Shanghai Artificial Intelligence Research Institute Co., Ltd.Shanghai Artificial Intelligence Research Institute Co., Ltd.University of Texas MD Anderson Cancer Center, Department of Radiation OncologyShanghai Artificial Intelligence Research Institute Co., Ltd.Abstract Academic data processing is crucial in scientometrics and bibliometrics, such as research trending analysis and citation recommendation. Existing datasets in this domain have predominantly concentrated on textual data, overlooking the importance of visual elements. To bridge this gap, we introduce a multidisciplinary multimodal aligned dataset (MMAD) specifically designed for academic data processing. This dataset encompasses over 1.1 million peer-reviewed scholarly articles, enhanced with metadata and visuals that are aligned with the text. We assess the representativeness of MMAD by comparing its country/region distribution against benchmarks from SCImago. Furthermore, we propose an innovative quality validation method for MMAD, leveraging Language Model-based techniques. Utilizing carefully crafted prompts, this approach enhances multimodal processing capabilities to evaluate the accuracy of text-to-visual alignments. We also outline prospective applications for MMAD, providing the way for novel research endeavors, including automated caption generation and analysis of trends in figures. Thus, this work signals new research prospects and provides a fertile ground for advances in academic data processing.https://doi.org/10.1038/s41597-025-04415-z |
spellingShingle | Haitao Song Hongyi Xu Zikai Wang Yifan Wang Jiajia Li A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing Scientific Data |
title | A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing |
title_full | A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing |
title_fullStr | A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing |
title_full_unstemmed | A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing |
title_short | A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing |
title_sort | multidisciplinary multimodal aligned dataset for academic data processing |
url | https://doi.org/10.1038/s41597-025-04415-z |
work_keys_str_mv | AT haitaosong amultidisciplinarymultimodalaligneddatasetforacademicdataprocessing AT hongyixu amultidisciplinarymultimodalaligneddatasetforacademicdataprocessing AT zikaiwang amultidisciplinarymultimodalaligneddatasetforacademicdataprocessing AT yifanwang amultidisciplinarymultimodalaligneddatasetforacademicdataprocessing AT jiajiali amultidisciplinarymultimodalaligneddatasetforacademicdataprocessing AT haitaosong multidisciplinarymultimodalaligneddatasetforacademicdataprocessing AT hongyixu multidisciplinarymultimodalaligneddatasetforacademicdataprocessing AT zikaiwang multidisciplinarymultimodalaligneddatasetforacademicdataprocessing AT yifanwang multidisciplinarymultimodalaligneddatasetforacademicdataprocessing AT jiajiali multidisciplinarymultimodalaligneddatasetforacademicdataprocessing |