Identification of Proteins and Genes Associated with Hedgehog Signaling Pathway Involved in Neoplasm Formation Using Text-Mining Approach

Analysis of molecular mechanisms that lead to the development of various types of tumors is essential for biology and medicine, because it may help to find new therapeutic opportunities for cancer treatment and cure including personalized treatment approaches. One of the pathways known to be importa...

Full description

Saved in:
Bibliographic Details
Main Authors: Nadezhda Yu. Biziukova, Sergey M. Ivanov, Olga A. Tarasova
Format: Article
Language:English
Published: Tsinghua University Press 2024-03-01
Series:Big Data Mining and Analytics
Subjects:
Online Access:https://www.sciopen.com/article/10.26599/BDMA.2023.9020007
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832568886432628736
author Nadezhda Yu. Biziukova
Sergey M. Ivanov
Olga A. Tarasova
author_facet Nadezhda Yu. Biziukova
Sergey M. Ivanov
Olga A. Tarasova
author_sort Nadezhda Yu. Biziukova
collection DOAJ
description Analysis of molecular mechanisms that lead to the development of various types of tumors is essential for biology and medicine, because it may help to find new therapeutic opportunities for cancer treatment and cure including personalized treatment approaches. One of the pathways known to be important for the development of neoplastic diseases and pathological processes is the Hedgehog signaling pathway that normally controls human embryonic development. Systematic accumulation of various types of biological data, including interactions between proteins, regulation of genes transcription, proteomics, and metabolomics experiments results, allows the application of computational analysis of these big data for identification of key molecular mechanisms of certain diseases and pathologies and promising therapeutic targets. The aim of this study is to develop a computational approach for revealing associations between human proteins and genes interacting with the Hedgehog pathway components, as well as for identifying their roles in the development of various types of tumors. We automatically collect sets of abstract texts from the NCBI PubMed bibliographic database. For recognition of the Hedgehog pathway proteins and genes and neoplastic diseases we use a dictionary-based named entity recognition approach, while for all other proteins and genes machine learning method is used. For association extraction, we develop a set of semantic rules. We complete the results of the text analysis with the gene set enrichment analysis. The identified key pathways that may influence the Hedgehog pathway and their roles in tumor development are then verified using the information in the literature.
format Article
id doaj-art-ff80b30a7c7f43b4958a3c8283f68e02
institution Kabale University
issn 2096-0654
language English
publishDate 2024-03-01
publisher Tsinghua University Press
record_format Article
series Big Data Mining and Analytics
spelling doaj-art-ff80b30a7c7f43b4958a3c8283f68e022025-02-03T00:17:03ZengTsinghua University PressBig Data Mining and Analytics2096-06542024-03-017110713010.26599/BDMA.2023.9020007Identification of Proteins and Genes Associated with Hedgehog Signaling Pathway Involved in Neoplasm Formation Using Text-Mining ApproachNadezhda Yu. Biziukova0Sergey M. Ivanov1Olga A. Tarasova2Department of Bioinformatics, Institute of Biomedical Chemistry, Moscow 119121, RussiaDepartment of Bioinformatics, Institute of Biomedical Chemistry, Moscow 119121, Russia, and also with Department of Bioinformatics, Pirogov Russian National Research Medical University, Moscow 117997, RussiaDepartment of Bioinformatics, Institute of Biomedical Chemistry, Moscow 119121, RussiaAnalysis of molecular mechanisms that lead to the development of various types of tumors is essential for biology and medicine, because it may help to find new therapeutic opportunities for cancer treatment and cure including personalized treatment approaches. One of the pathways known to be important for the development of neoplastic diseases and pathological processes is the Hedgehog signaling pathway that normally controls human embryonic development. Systematic accumulation of various types of biological data, including interactions between proteins, regulation of genes transcription, proteomics, and metabolomics experiments results, allows the application of computational analysis of these big data for identification of key molecular mechanisms of certain diseases and pathologies and promising therapeutic targets. The aim of this study is to develop a computational approach for revealing associations between human proteins and genes interacting with the Hedgehog pathway components, as well as for identifying their roles in the development of various types of tumors. We automatically collect sets of abstract texts from the NCBI PubMed bibliographic database. For recognition of the Hedgehog pathway proteins and genes and neoplastic diseases we use a dictionary-based named entity recognition approach, while for all other proteins and genes machine learning method is used. For association extraction, we develop a set of semantic rules. We complete the results of the text analysis with the gene set enrichment analysis. The identified key pathways that may influence the Hedgehog pathway and their roles in tumor development are then verified using the information in the literature.https://www.sciopen.com/article/10.26599/BDMA.2023.9020007text-miningdata mininghedgehog pathwayneoplastic processesenrichment analysispathology molecular mechanisms
spellingShingle Nadezhda Yu. Biziukova
Sergey M. Ivanov
Olga A. Tarasova
Identification of Proteins and Genes Associated with Hedgehog Signaling Pathway Involved in Neoplasm Formation Using Text-Mining Approach
Big Data Mining and Analytics
text-mining
data mining
hedgehog pathway
neoplastic processes
enrichment analysis
pathology molecular mechanisms
title Identification of Proteins and Genes Associated with Hedgehog Signaling Pathway Involved in Neoplasm Formation Using Text-Mining Approach
title_full Identification of Proteins and Genes Associated with Hedgehog Signaling Pathway Involved in Neoplasm Formation Using Text-Mining Approach
title_fullStr Identification of Proteins and Genes Associated with Hedgehog Signaling Pathway Involved in Neoplasm Formation Using Text-Mining Approach
title_full_unstemmed Identification of Proteins and Genes Associated with Hedgehog Signaling Pathway Involved in Neoplasm Formation Using Text-Mining Approach
title_short Identification of Proteins and Genes Associated with Hedgehog Signaling Pathway Involved in Neoplasm Formation Using Text-Mining Approach
title_sort identification of proteins and genes associated with hedgehog signaling pathway involved in neoplasm formation using text mining approach
topic text-mining
data mining
hedgehog pathway
neoplastic processes
enrichment analysis
pathology molecular mechanisms
url https://www.sciopen.com/article/10.26599/BDMA.2023.9020007
work_keys_str_mv AT nadezhdayubiziukova identificationofproteinsandgenesassociatedwithhedgehogsignalingpathwayinvolvedinneoplasmformationusingtextminingapproach
AT sergeymivanov identificationofproteinsandgenesassociatedwithhedgehogsignalingpathwayinvolvedinneoplasmformationusingtextminingapproach
AT olgaatarasova identificationofproteinsandgenesassociatedwithhedgehogsignalingpathwayinvolvedinneoplasmformationusingtextminingapproach