Evaluating the potential of assembler-binner combinations in recovering low-abundance and strain-resolved genomes from human metagenomes

Human-associated microbial communities are a complex mixture of bacterial species and diverse strains prevalent at varying abundances. Due to the inherent limitations of metagenomic assemblers and genome binning tools in recovering low-abundance species (<1 %) and strains, we lack comprehensive i...

Full description

Saved in:
Bibliographic Details
Main Authors: Hajra Qayyum, Muhammad Sarfraz Talib, Amjad Ali, Masood Ur Rehman Kayani
Format: Article
Language:English
Published: Elsevier 2025-01-01
Series:Heliyon
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2405844025003184
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832573075600703488
author Hajra Qayyum
Muhammad Sarfraz Talib
Amjad Ali
Masood Ur Rehman Kayani
author_facet Hajra Qayyum
Muhammad Sarfraz Talib
Amjad Ali
Masood Ur Rehman Kayani
author_sort Hajra Qayyum
collection DOAJ
description Human-associated microbial communities are a complex mixture of bacterial species and diverse strains prevalent at varying abundances. Due to the inherent limitations of metagenomic assemblers and genome binning tools in recovering low-abundance species (<1 %) and strains, we lack comprehensive insight into these communities. Although many bioinformatics approaches are available for recovering metagenome-assembled genomes, their effectiveness in recovering low-abundance species and strains is often questioned. Moreover, each tool has its trade-offs, making selecting the right tools challenging. In this study, we investigated the combinatory effect of various assemblers and binning tools on the recovery of low-abundance species and strain-resolved genomes from real and simulated human metagenomes. We evaluated the performance of nine combinations of metagenome assemblers and genome binning tools for their potential to recover genomes of useable quality. Our results revealed that the metaSPAdes-MetaBAT2 combination is highly effective in recovering low-abundance species, while MEGAHIT-MetaBAT2 excels in recovering strain-resolved genomes. These findings highlight the significant variation in the performance of different combinations, even when aiming for the same objective. This suggests the profound impact of selecting the right assembler-binner combination for metagenome analyses. We believe this study will be a cornerstone for the scientific community, guiding the choice of tools by highlighting their complementary effects. Furthermore, it underscores the potential of existing tools to address the current challenges in the field improving the recovery of information from metagenomes.
format Article
id doaj-art-531f426fffe14b7e874f631adc04d9dd
institution Kabale University
issn 2405-8440
language English
publishDate 2025-01-01
publisher Elsevier
record_format Article
series Heliyon
spelling doaj-art-531f426fffe14b7e874f631adc04d9dd2025-02-02T05:28:36ZengElsevierHeliyon2405-84402025-01-01112e41938Evaluating the potential of assembler-binner combinations in recovering low-abundance and strain-resolved genomes from human metagenomesHajra Qayyum0Muhammad Sarfraz Talib1Amjad Ali2Masood Ur Rehman Kayani3Integrative Biology Laboratory, Department of Microbiology and Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Srinagar Highway, Sector H-12, Islamabad, Pakistan; Capital University of Science &amp; Technology, Islamabad Expressway, Kahuta Road Zone-V Sihala, Islamabad, PakistanIntegrative Biology Laboratory, Department of Microbiology and Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Srinagar Highway, Sector H-12, Islamabad, PakistanIntegrative Biology Laboratory, Department of Microbiology and Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Srinagar Highway, Sector H-12, Islamabad, Pakistan; Corresponding author.Metagenomics Discovery Lab, School of Interdisciplinary Engineering and Sciences (SINES), National University of Sciences and Technology (NUST), Srinagar Highway, Sector H-12, Islamabad, Pakistan; Corresponding author.Human-associated microbial communities are a complex mixture of bacterial species and diverse strains prevalent at varying abundances. Due to the inherent limitations of metagenomic assemblers and genome binning tools in recovering low-abundance species (<1 %) and strains, we lack comprehensive insight into these communities. Although many bioinformatics approaches are available for recovering metagenome-assembled genomes, their effectiveness in recovering low-abundance species and strains is often questioned. Moreover, each tool has its trade-offs, making selecting the right tools challenging. In this study, we investigated the combinatory effect of various assemblers and binning tools on the recovery of low-abundance species and strain-resolved genomes from real and simulated human metagenomes. We evaluated the performance of nine combinations of metagenome assemblers and genome binning tools for their potential to recover genomes of useable quality. Our results revealed that the metaSPAdes-MetaBAT2 combination is highly effective in recovering low-abundance species, while MEGAHIT-MetaBAT2 excels in recovering strain-resolved genomes. These findings highlight the significant variation in the performance of different combinations, even when aiming for the same objective. This suggests the profound impact of selecting the right assembler-binner combination for metagenome analyses. We believe this study will be a cornerstone for the scientific community, guiding the choice of tools by highlighting their complementary effects. Furthermore, it underscores the potential of existing tools to address the current challenges in the field improving the recovery of information from metagenomes.http://www.sciencedirect.com/science/article/pii/S2405844025003184Gut microbiomeDe novo assemblyGenome binningMetagenome assembled genomes (MAGs)Low abundance speciesStrain-resolved analysis
spellingShingle Hajra Qayyum
Muhammad Sarfraz Talib
Amjad Ali
Masood Ur Rehman Kayani
Evaluating the potential of assembler-binner combinations in recovering low-abundance and strain-resolved genomes from human metagenomes
Heliyon
Gut microbiome
De novo assembly
Genome binning
Metagenome assembled genomes (MAGs)
Low abundance species
Strain-resolved analysis
title Evaluating the potential of assembler-binner combinations in recovering low-abundance and strain-resolved genomes from human metagenomes
title_full Evaluating the potential of assembler-binner combinations in recovering low-abundance and strain-resolved genomes from human metagenomes
title_fullStr Evaluating the potential of assembler-binner combinations in recovering low-abundance and strain-resolved genomes from human metagenomes
title_full_unstemmed Evaluating the potential of assembler-binner combinations in recovering low-abundance and strain-resolved genomes from human metagenomes
title_short Evaluating the potential of assembler-binner combinations in recovering low-abundance and strain-resolved genomes from human metagenomes
title_sort evaluating the potential of assembler binner combinations in recovering low abundance and strain resolved genomes from human metagenomes
topic Gut microbiome
De novo assembly
Genome binning
Metagenome assembled genomes (MAGs)
Low abundance species
Strain-resolved analysis
url http://www.sciencedirect.com/science/article/pii/S2405844025003184
work_keys_str_mv AT hajraqayyum evaluatingthepotentialofassemblerbinnercombinationsinrecoveringlowabundanceandstrainresolvedgenomesfromhumanmetagenomes
AT muhammadsarfraztalib evaluatingthepotentialofassemblerbinnercombinationsinrecoveringlowabundanceandstrainresolvedgenomesfromhumanmetagenomes
AT amjadali evaluatingthepotentialofassemblerbinnercombinationsinrecoveringlowabundanceandstrainresolvedgenomesfromhumanmetagenomes
AT masoodurrehmankayani evaluatingthepotentialofassemblerbinnercombinationsinrecoveringlowabundanceandstrainresolvedgenomesfromhumanmetagenomes