MultiTax-human: an extensive and high-resolution human-related full-length 16S rRNA reference database and taxonomy

ABSTRACT Considering that the human microbiota plays a critical role in health and disease, an accurate and high-resolution taxonomic classification is thus essential for meaningful microbiome analysis. In this study, we developed an automatic system, named MultiTax pipeline, for generating de novo...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhiwei Bao, Bin Zhang, Jianhua Yao, Ming D. Li
Format: Article
Language:English
Published: American Society for Microbiology 2025-02-01
Series:Microbiology Spectrum
Subjects:
Online Access:https://journals.asm.org/doi/10.1128/spectrum.01312-24
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832540846202814464
author Zhiwei Bao
Bin Zhang
Jianhua Yao
Ming D. Li
author_facet Zhiwei Bao
Bin Zhang
Jianhua Yao
Ming D. Li
author_sort Zhiwei Bao
collection DOAJ
description ABSTRACT Considering that the human microbiota plays a critical role in health and disease, an accurate and high-resolution taxonomic classification is thus essential for meaningful microbiome analysis. In this study, we developed an automatic system, named MultiTax pipeline, for generating de novo taxonomy from full-length 16S rRNA sequences using the Genome Taxonomy Database and other existing reference databases. We first constructed the MultiTax-human database, a high-resolution resource specifically designed for human microbiome research and clinical applications. The database includes 842,649 high-quality full-length 16S rRNA sequences, extracted from multiple public repositories and human-related studies, offering a comprehensive and accurate portrayal of the human microbiome. To validate the MultiTax-human database, we profiled the human microbiome across various body sites, identified core microbial taxa, and tested its performance using an independent data set. Additionally, the database is equipped with a user-friendly web interface for easy querying and data exploration. The MultiTax-human database is poised to serve as a valuable tool for researchers, enhancing the precision of human microbiome studies and advancing our understanding of its impact on human health and diseases.IMPORTANCEUnderstanding the human microbiome, the collection of microorganisms in and on our bodies, is essential for advancing health research. Current methods often lack precision and consistency, hindering our ability to study these microorganisms effectively. Our study presents the MultiTax-human database, a high-resolution reference tool specifically designed for human microbiome research. By integrating data from multiple sources and employing advanced classification techniques, this database offers an accurate and detailed map of the human microbiome. This resource enhances the ability of researchers and clinicians to explore the roles of microorganisms in health and disease, potentially leading to improved diagnostics, treatments, and insights into various medical conditions.
format Article
id doaj-art-c3aca05ea9864f62be6b61281e84fae5
institution Kabale University
issn 2165-0497
language English
publishDate 2025-02-01
publisher American Society for Microbiology
record_format Article
series Microbiology Spectrum
spelling doaj-art-c3aca05ea9864f62be6b61281e84fae52025-02-04T14:03:40ZengAmerican Society for MicrobiologyMicrobiology Spectrum2165-04972025-02-0113210.1128/spectrum.01312-24MultiTax-human: an extensive and high-resolution human-related full-length 16S rRNA reference database and taxonomyZhiwei Bao0Bin Zhang1Jianhua Yao2Ming D. Li3State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, ChinaState Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, ChinaJoint Institute of Tobacco and Health, Kunming, Yunnan, ChinaState Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, ChinaABSTRACT Considering that the human microbiota plays a critical role in health and disease, an accurate and high-resolution taxonomic classification is thus essential for meaningful microbiome analysis. In this study, we developed an automatic system, named MultiTax pipeline, for generating de novo taxonomy from full-length 16S rRNA sequences using the Genome Taxonomy Database and other existing reference databases. We first constructed the MultiTax-human database, a high-resolution resource specifically designed for human microbiome research and clinical applications. The database includes 842,649 high-quality full-length 16S rRNA sequences, extracted from multiple public repositories and human-related studies, offering a comprehensive and accurate portrayal of the human microbiome. To validate the MultiTax-human database, we profiled the human microbiome across various body sites, identified core microbial taxa, and tested its performance using an independent data set. Additionally, the database is equipped with a user-friendly web interface for easy querying and data exploration. The MultiTax-human database is poised to serve as a valuable tool for researchers, enhancing the precision of human microbiome studies and advancing our understanding of its impact on human health and diseases.IMPORTANCEUnderstanding the human microbiome, the collection of microorganisms in and on our bodies, is essential for advancing health research. Current methods often lack precision and consistency, hindering our ability to study these microorganisms effectively. Our study presents the MultiTax-human database, a high-resolution reference tool specifically designed for human microbiome research. By integrating data from multiple sources and employing advanced classification techniques, this database offers an accurate and detailed map of the human microbiome. This resource enhances the ability of researchers and clinicians to explore the roles of microorganisms in health and disease, potentially leading to improved diagnostics, treatments, and insights into various medical conditions.https://journals.asm.org/doi/10.1128/spectrum.01312-24human microbiome16S rRNAGTDBtaxonomyreference database
spellingShingle Zhiwei Bao
Bin Zhang
Jianhua Yao
Ming D. Li
MultiTax-human: an extensive and high-resolution human-related full-length 16S rRNA reference database and taxonomy
Microbiology Spectrum
human microbiome
16S rRNA
GTDB
taxonomy
reference database
title MultiTax-human: an extensive and high-resolution human-related full-length 16S rRNA reference database and taxonomy
title_full MultiTax-human: an extensive and high-resolution human-related full-length 16S rRNA reference database and taxonomy
title_fullStr MultiTax-human: an extensive and high-resolution human-related full-length 16S rRNA reference database and taxonomy
title_full_unstemmed MultiTax-human: an extensive and high-resolution human-related full-length 16S rRNA reference database and taxonomy
title_short MultiTax-human: an extensive and high-resolution human-related full-length 16S rRNA reference database and taxonomy
title_sort multitax human an extensive and high resolution human related full length 16s rrna reference database and taxonomy
topic human microbiome
16S rRNA
GTDB
taxonomy
reference database
url https://journals.asm.org/doi/10.1128/spectrum.01312-24
work_keys_str_mv AT zhiweibao multitaxhumananextensiveandhighresolutionhumanrelatedfulllength16srrnareferencedatabaseandtaxonomy
AT binzhang multitaxhumananextensiveandhighresolutionhumanrelatedfulllength16srrnareferencedatabaseandtaxonomy
AT jianhuayao multitaxhumananextensiveandhighresolutionhumanrelatedfulllength16srrnareferencedatabaseandtaxonomy
AT mingdli multitaxhumananextensiveandhighresolutionhumanrelatedfulllength16srrnareferencedatabaseandtaxonomy