Metagenomic estimation of absolute bacterial biomass in the mammalian gut through host-derived read normalization

ABSTRACT Absolute bacterial biomass estimation in the human gut is crucial for understanding microbiome dynamics and host-microbe interactions. Current methods for quantifying bacterial biomass in stool, such as flow cytometry, quantitative polymerase chain reaction (qPCR), or spike-ins, can be labo...

Full description

Saved in:
Bibliographic Details
Main Authors: Gechlang Tang, Alex V. Carr, Crystal Perez, Katherine Ramos Sarmiento, Lisa Levy, Johanna W. Lampe, Christian Diener, Sean M. Gibbons
Format: Article
Language:English
Published: American Society for Microbiology 2025-08-01
Series:mSystems
Subjects:
Online Access:https://journals.asm.org/doi/10.1128/msystems.00984-25
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849233714142248960
author Gechlang Tang
Alex V. Carr
Crystal Perez
Katherine Ramos Sarmiento
Lisa Levy
Johanna W. Lampe
Christian Diener
Sean M. Gibbons
author_facet Gechlang Tang
Alex V. Carr
Crystal Perez
Katherine Ramos Sarmiento
Lisa Levy
Johanna W. Lampe
Christian Diener
Sean M. Gibbons
author_sort Gechlang Tang
collection DOAJ
description ABSTRACT Absolute bacterial biomass estimation in the human gut is crucial for understanding microbiome dynamics and host-microbe interactions. Current methods for quantifying bacterial biomass in stool, such as flow cytometry, quantitative polymerase chain reaction (qPCR), or spike-ins, can be labor-intensive, costly, and confounded by factors like water content, DNA extraction efficiency, PCR inhibitors, and other technical challenges that add bias and noise. We propose a simple, cost-effective approach that circumvents some of these technical challenges: directly estimating bacterial biomass from metagenomes using bacterial-to-host (B:H) read count ratios. We compared B:H ratios to the standard methods outlined above, demonstrating that B:H ratios are useful proxies for bacterial biomass in stool and possibly in other host-associated substrates. B:H ratios in stool were correlated with bacterial-to-diet (B:D) read count ratios, but B:D ratios exhibited a substantial number of outlier points. Host read depletion methods reduced the total number of human reads in a given sample, but B:H ratios were strongly correlated before and after host read depletion, indicating that host read depletion did not reduce the utility of B:H ratios. B:H ratios showed expected variation between health and disease states and were generally stable in healthy individuals over time. Finally, we showed how B:H and B:D ratios can be used to track antibiotic treatment response and recovery. B:H ratios offer a convenient alternative to other absolute biomass quantification methods, without the need for additional measurements, experimental design considerations, or machine learning, enabling robust absolute biomass estimates directly from stool metagenomic data.IMPORTANCEIn this study, we asked whether normalization by host reads alone was sufficient to estimate absolute bacterial biomass directly from stool metagenomic data, without the need for synthetic spike-ins, additional experimental biomass measurements, or training data. The approach assumes that the contribution of host DNA to stool is more constant or stable than biologically relevant fluctuations in bacterial biomass. We find that host read normalization is an effective method for detecting variation in gut bacterial biomass. Absolute bacterial biomass is a key metric that often gets left out of gut microbiome studies, and empowering researchers to include this measure more broadly in their metagenomic analyses should serve to improve our understanding of host-microbiota interactions.
format Article
id doaj-art-06e27c123e0b43e6b5dba9c46e58efc5
institution Kabale University
issn 2379-5077
language English
publishDate 2025-08-01
publisher American Society for Microbiology
record_format Article
series mSystems
spelling doaj-art-06e27c123e0b43e6b5dba9c46e58efc52025-08-20T04:03:26ZengAmerican Society for MicrobiologymSystems2379-50772025-08-0110810.1128/msystems.00984-25Metagenomic estimation of absolute bacterial biomass in the mammalian gut through host-derived read normalizationGechlang Tang0Alex V. Carr1Crystal Perez2Katherine Ramos Sarmiento3Lisa Levy4Johanna W. Lampe5Christian Diener6Sean M. Gibbons7Institute for Systems Biology, Seattle, Washington, USAInstitute for Systems Biology, Seattle, Washington, USAInstitute for Systems Biology, Seattle, Washington, USAInstitute for Systems Biology, Seattle, Washington, USAFred Hutchinson Cancer Center, Seattle, Washington, USAFred Hutchinson Cancer Center, Seattle, Washington, USADiagnostic and Research Institute of Hygiene, Microbiology and Environmental Medicine, Medical University of Graz, Graz, AustriaInstitute for Systems Biology, Seattle, Washington, USAABSTRACT Absolute bacterial biomass estimation in the human gut is crucial for understanding microbiome dynamics and host-microbe interactions. Current methods for quantifying bacterial biomass in stool, such as flow cytometry, quantitative polymerase chain reaction (qPCR), or spike-ins, can be labor-intensive, costly, and confounded by factors like water content, DNA extraction efficiency, PCR inhibitors, and other technical challenges that add bias and noise. We propose a simple, cost-effective approach that circumvents some of these technical challenges: directly estimating bacterial biomass from metagenomes using bacterial-to-host (B:H) read count ratios. We compared B:H ratios to the standard methods outlined above, demonstrating that B:H ratios are useful proxies for bacterial biomass in stool and possibly in other host-associated substrates. B:H ratios in stool were correlated with bacterial-to-diet (B:D) read count ratios, but B:D ratios exhibited a substantial number of outlier points. Host read depletion methods reduced the total number of human reads in a given sample, but B:H ratios were strongly correlated before and after host read depletion, indicating that host read depletion did not reduce the utility of B:H ratios. B:H ratios showed expected variation between health and disease states and were generally stable in healthy individuals over time. Finally, we showed how B:H and B:D ratios can be used to track antibiotic treatment response and recovery. B:H ratios offer a convenient alternative to other absolute biomass quantification methods, without the need for additional measurements, experimental design considerations, or machine learning, enabling robust absolute biomass estimates directly from stool metagenomic data.IMPORTANCEIn this study, we asked whether normalization by host reads alone was sufficient to estimate absolute bacterial biomass directly from stool metagenomic data, without the need for synthetic spike-ins, additional experimental biomass measurements, or training data. The approach assumes that the contribution of host DNA to stool is more constant or stable than biologically relevant fluctuations in bacterial biomass. We find that host read normalization is an effective method for detecting variation in gut bacterial biomass. Absolute bacterial biomass is a key metric that often gets left out of gut microbiome studies, and empowering researchers to include this measure more broadly in their metagenomic analyses should serve to improve our understanding of host-microbiota interactions.https://journals.asm.org/doi/10.1128/msystems.00984-25metagenomicshuman microbiomegut microbiomeabsolute biomasshost DNAdiet DNA
spellingShingle Gechlang Tang
Alex V. Carr
Crystal Perez
Katherine Ramos Sarmiento
Lisa Levy
Johanna W. Lampe
Christian Diener
Sean M. Gibbons
Metagenomic estimation of absolute bacterial biomass in the mammalian gut through host-derived read normalization
mSystems
metagenomics
human microbiome
gut microbiome
absolute biomass
host DNA
diet DNA
title Metagenomic estimation of absolute bacterial biomass in the mammalian gut through host-derived read normalization
title_full Metagenomic estimation of absolute bacterial biomass in the mammalian gut through host-derived read normalization
title_fullStr Metagenomic estimation of absolute bacterial biomass in the mammalian gut through host-derived read normalization
title_full_unstemmed Metagenomic estimation of absolute bacterial biomass in the mammalian gut through host-derived read normalization
title_short Metagenomic estimation of absolute bacterial biomass in the mammalian gut through host-derived read normalization
title_sort metagenomic estimation of absolute bacterial biomass in the mammalian gut through host derived read normalization
topic metagenomics
human microbiome
gut microbiome
absolute biomass
host DNA
diet DNA
url https://journals.asm.org/doi/10.1128/msystems.00984-25
work_keys_str_mv AT gechlangtang metagenomicestimationofabsolutebacterialbiomassinthemammaliangutthroughhostderivedreadnormalization
AT alexvcarr metagenomicestimationofabsolutebacterialbiomassinthemammaliangutthroughhostderivedreadnormalization
AT crystalperez metagenomicestimationofabsolutebacterialbiomassinthemammaliangutthroughhostderivedreadnormalization
AT katherineramossarmiento metagenomicestimationofabsolutebacterialbiomassinthemammaliangutthroughhostderivedreadnormalization
AT lisalevy metagenomicestimationofabsolutebacterialbiomassinthemammaliangutthroughhostderivedreadnormalization
AT johannawlampe metagenomicestimationofabsolutebacterialbiomassinthemammaliangutthroughhostderivedreadnormalization
AT christiandiener metagenomicestimationofabsolutebacterialbiomassinthemammaliangutthroughhostderivedreadnormalization
AT seanmgibbons metagenomicestimationofabsolutebacterialbiomassinthemammaliangutthroughhostderivedreadnormalization