Enterprise chart question and answer method based on multi modal cross fusion

Abstract To enhance enterprises’ interactive exploration capabilities for unstructured chart data, this paper proposes a multimodal chart question-answering method. Facing the challenge of recognizing curved and irregular text in charts, we introduce Gaussian heatmap encoding technology to achieve c...

Full description

Saved in:
Bibliographic Details
Main Authors: Xinxin Wang, Liang Chen, Changhong Liu, Jinyu Liu
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-024-83652-5
Tags: Add Tag
No Tags, Be the first to tag this record!