A novel approach to analyzing the evolution of SARS-CoV-2 based on visualization and clustering of large genetic data compactly represented in operative memory

SARS-CoV-2 is a virus for which an outstanding number of genome variants were collected, sequenced and stored from sources all around the world. Raw data in FASTA format include 16.8 million genomes, each ≈29,900 nt (nu­cleotides), with a total size of ≈500 ∙ 109 nt, or 465 Gb. We suggest an approac...

Full description

Saved in:
Bibliographic Details
Main Authors: A. Yu. Palyanov, N. V. Palyanova
Format: Article
Language:English
Published: Siberian Branch of the Russian Academy of Sciences, Federal Research Center Institute of Cytology and Genetics, The Vavilov Society of Geneticists and Breeders 2025-01-01
Series:Вавиловский журнал генетики и селекции
Subjects:
Online Access:https://vavilov.elpub.ru/jour/article/view/4406
Tags: Add Tag
No Tags, Be the first to tag this record!