A system for automatic construction of knowledge graphs of mathematical documents

This article outlines the process of creating an automated system for knowledge graph construction from collections of mathematical documents in LATEX format. The MathCollectionOntology, which defines the types of objects and relationships in knowledge graphs, was developed. The introduced toolkit i...

Full description

Saved in:
Bibliographic Details
Main Authors: O. A. Nevzorova, B. T. Gizatullin
Format: Article
Language:English
Published: Kazan Federal University 2024-01-01
Series:Учёные записки Казанского университета: Серия Физико-математические науки
Subjects:
Online Access:https://uzakufismat.elpub.ru/jour/article/view/15
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832543001246695424
author O. A. Nevzorova
B. T. Gizatullin
author_facet O. A. Nevzorova
B. T. Gizatullin
author_sort O. A. Nevzorova
collection DOAJ
description This article outlines the process of creating an automated system for knowledge graph construction from collections of mathematical documents in LATEX format. The MathCollectionOntology, which defines the types of objects and relationships in knowledge graphs, was developed. The introduced toolkit includes methods for extracting mathematical terms, browsing and identifying document topics, extracting entities from LATEX code, and calculating statistical parameters of the graph. The parsed entities are mathematical terms, topics generated through the Latent Dirichlet Allocation, UDC codes, used formulas, author affiliations, cited literature, and others. The knowledge graph captures each extracted object using specific types of relationships defined in the MathCollectionOntology. Here, a knowledge graph was coined for a collection of articles published in Izvestiya VUZov. Matematika journal (1114 Russian-language documents in LATEX format). The thematic terms of the document topics were described. The quantitative parameters of the constructed knowledge graph were obtained.
format Article
id doaj-art-307be8941985408598c3b75d40ffa080
institution Kabale University
issn 2541-7746
2500-2198
language English
publishDate 2024-01-01
publisher Kazan Federal University
record_format Article
series Учёные записки Казанского университета: Серия Физико-математические науки
spelling doaj-art-307be8941985408598c3b75d40ffa0802025-02-03T12:00:35ZengKazan Federal UniversityУчёные записки Казанского университета: Серия Физико-математические науки2541-77462500-21982024-01-01165326428110.26907/2541-7746.2023.3.264-28114A system for automatic construction of knowledge graphs of mathematical documentsO. A. Nevzorova0B. T. Gizatullin1Kazan Federal UniversityKazan Federal UniversityThis article outlines the process of creating an automated system for knowledge graph construction from collections of mathematical documents in LATEX format. The MathCollectionOntology, which defines the types of objects and relationships in knowledge graphs, was developed. The introduced toolkit includes methods for extracting mathematical terms, browsing and identifying document topics, extracting entities from LATEX code, and calculating statistical parameters of the graph. The parsed entities are mathematical terms, topics generated through the Latent Dirichlet Allocation, UDC codes, used formulas, author affiliations, cited literature, and others. The knowledge graph captures each extracted object using specific types of relationships defined in the MathCollectionOntology. Here, a knowledge graph was coined for a collection of articles published in Izvestiya VUZov. Matematika journal (1114 Russian-language documents in LATEX format). The thematic terms of the document topics were described. The quantitative parameters of the constructed knowledge graph were obtained.https://uzakufismat.elpub.ru/jour/article/view/15knowledge graph constructionlinked open datatopic modelingmathematical articletext processing
spellingShingle O. A. Nevzorova
B. T. Gizatullin
A system for automatic construction of knowledge graphs of mathematical documents
Учёные записки Казанского университета: Серия Физико-математические науки
knowledge graph construction
linked open data
topic modeling
mathematical article
text processing
title A system for automatic construction of knowledge graphs of mathematical documents
title_full A system for automatic construction of knowledge graphs of mathematical documents
title_fullStr A system for automatic construction of knowledge graphs of mathematical documents
title_full_unstemmed A system for automatic construction of knowledge graphs of mathematical documents
title_short A system for automatic construction of knowledge graphs of mathematical documents
title_sort system for automatic construction of knowledge graphs of mathematical documents
topic knowledge graph construction
linked open data
topic modeling
mathematical article
text processing
url https://uzakufismat.elpub.ru/jour/article/view/15
work_keys_str_mv AT oanevzorova asystemforautomaticconstructionofknowledgegraphsofmathematicaldocuments
AT btgizatullin asystemforautomaticconstructionofknowledgegraphsofmathematicaldocuments
AT oanevzorova systemforautomaticconstructionofknowledgegraphsofmathematicaldocuments
AT btgizatullin systemforautomaticconstructionofknowledgegraphsofmathematicaldocuments