Combining and congruence evaluation of phylogenetic signals from different genes based on geometric approach

A new Euclidean distance based algorithm is used for analysis of congruence and combining molecular genetic data. This approach is called geometric, since Euclidean distance satisfies all metric axioms and the points representing the sequences can be placed in a geometric space without distorting th...

Full description

Saved in:
Bibliographic Details
Main Authors: V. M. Efimov, V. Yu. Kovaleva, Yu. N. Litvinov
Format: Article
Language:English
Published: Siberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and Breeders 2017-02-01
Series:Вавиловский журнал генетики и селекции
Subjects:
Online Access:https://vavilov.elpub.ru/jour/article/view/615
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832575242513416192
author V. M. Efimov
V. Yu. Kovaleva
Yu. N. Litvinov
author_facet V. M. Efimov
V. Yu. Kovaleva
Yu. N. Litvinov
author_sort V. M. Efimov
collection DOAJ
description A new Euclidean distance based algorithm is used for analysis of congruence and combining molecular genetic data. This approach is called geometric, since Euclidean distance satisfies all metric axioms and the points representing the sequences can be placed in a geometric space without distorting the mutual distances and can be endowed with the coordinates in this space. Geometricness of Euclidean distances allows to apply to molecular data methods of multivariate analysis, which are relevant for intra- and interspecies variability investigating, visualization of possible directions of evolution, combining data and evaluation of the congruence of phylogenetic signals. The algorithm is used for the analysis of more than 1500 nucleotide sequences of two nuclear (apoB, brca1) and two mitochondrial (co1, cytb) genes of 15 Palaearctic and Nearctic shrews species of genus Sorex (Soricidae, Eulipotyphla). All sequences of each gene are represented as a set of points in Euclidean space. Centroids of a set of points belonging to the same species are calculated. The matrix of Euclidean distances between the species centroids is calculated for each gene. Mantel test is applied to estimate pairwise similarity (congruence) of interspecies distances matrices relating to different genes. nDNA genes congruence is equal 0.961, mtDNA – 0.748. All matrices of the interspecies distances are combined into a joint matrix by weighing. Joint genetic space for all species is built by principal coordinate method from the joint matrix. Several variability directions reflecting evolutionary events of different scale are visualized in a joint genetic space. In addition, the joint matrix of interspecies distances is used for building a phylogenetic tree which is consistent with the zoological systematics accepted for today. This confirms the efficiency of our proposed method.
format Article
id doaj-art-8f404a319b2e4b81a481b131afeabbbb
institution Kabale University
issn 2500-3259
language English
publishDate 2017-02-01
publisher Siberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and Breeders
record_format Article
series Вавиловский журнал генетики и селекции
spelling doaj-art-8f404a319b2e4b81a481b131afeabbbb2025-02-01T09:58:03ZengSiberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and BreedersВавиловский журнал генетики и селекции2500-32592017-02-0120681682210.18699/VJ16.153476Combining and congruence evaluation of phylogenetic signals from different genes based on geometric approachV. M. Efimov0V. Yu. Kovaleva1Yu. N. Litvinov2Institute of Cytology and Genetics SB RAS; Novosibirsk State University; Tomsk State UniversityInstitute of Systematics and Ecology of Animals SB RASInstitute of Systematics and Ecology of Animals SB RASA new Euclidean distance based algorithm is used for analysis of congruence and combining molecular genetic data. This approach is called geometric, since Euclidean distance satisfies all metric axioms and the points representing the sequences can be placed in a geometric space without distorting the mutual distances and can be endowed with the coordinates in this space. Geometricness of Euclidean distances allows to apply to molecular data methods of multivariate analysis, which are relevant for intra- and interspecies variability investigating, visualization of possible directions of evolution, combining data and evaluation of the congruence of phylogenetic signals. The algorithm is used for the analysis of more than 1500 nucleotide sequences of two nuclear (apoB, brca1) and two mitochondrial (co1, cytb) genes of 15 Palaearctic and Nearctic shrews species of genus Sorex (Soricidae, Eulipotyphla). All sequences of each gene are represented as a set of points in Euclidean space. Centroids of a set of points belonging to the same species are calculated. The matrix of Euclidean distances between the species centroids is calculated for each gene. Mantel test is applied to estimate pairwise similarity (congruence) of interspecies distances matrices relating to different genes. nDNA genes congruence is equal 0.961, mtDNA – 0.748. All matrices of the interspecies distances are combined into a joint matrix by weighing. Joint genetic space for all species is built by principal coordinate method from the joint matrix. Several variability directions reflecting evolutionary events of different scale are visualized in a joint genetic space. In addition, the joint matrix of interspecies distances is used for building a phylogenetic tree which is consistent with the zoological systematics accepted for today. This confirms the efficiency of our proposed method.https://vavilov.elpub.ru/jour/article/view/615sorexmtdnanuclear dnadj-methodphylogeneticseuclidean space
spellingShingle V. M. Efimov
V. Yu. Kovaleva
Yu. N. Litvinov
Combining and congruence evaluation of phylogenetic signals from different genes based on geometric approach
Вавиловский журнал генетики и селекции
sorex
mtdna
nuclear dna
dj-method
phylogenetics
euclidean space
title Combining and congruence evaluation of phylogenetic signals from different genes based on geometric approach
title_full Combining and congruence evaluation of phylogenetic signals from different genes based on geometric approach
title_fullStr Combining and congruence evaluation of phylogenetic signals from different genes based on geometric approach
title_full_unstemmed Combining and congruence evaluation of phylogenetic signals from different genes based on geometric approach
title_short Combining and congruence evaluation of phylogenetic signals from different genes based on geometric approach
title_sort combining and congruence evaluation of phylogenetic signals from different genes based on geometric approach
topic sorex
mtdna
nuclear dna
dj-method
phylogenetics
euclidean space
url https://vavilov.elpub.ru/jour/article/view/615
work_keys_str_mv AT vmefimov combiningandcongruenceevaluationofphylogeneticsignalsfromdifferentgenesbasedongeometricapproach
AT vyukovaleva combiningandcongruenceevaluationofphylogeneticsignalsfromdifferentgenesbasedongeometricapproach
AT yunlitvinov combiningandcongruenceevaluationofphylogeneticsignalsfromdifferentgenesbasedongeometricapproach