Pangenome-scale annotation of mycobacteriophages for dissecting phage–host interactions based on a sequence clustering and structural homology analysis strategy

ABSTRACT With the increasing severity of bacterial drug resistance, there is a growing need for phages with well-defined genetic backgrounds to combat drug-resistant infections. Mycobacteriophages constitute the largest genome-sequenced phage group; however, the vast majority of these phage proteins...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiao Guo, Zheng-Guo He
Format: Article
Language:English
Published: American Society for Microbiology 2025-08-01
Series:mSystems
Subjects:
Online Access:https://journals.asm.org/doi/10.1128/msystems.00508-25
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849338706979192832
author Xiao Guo
Zheng-Guo He
author_facet Xiao Guo
Zheng-Guo He
author_sort Xiao Guo
collection DOAJ
description ABSTRACT With the increasing severity of bacterial drug resistance, there is a growing need for phages with well-defined genetic backgrounds to combat drug-resistant infections. Mycobacteriophages constitute the largest genome-sequenced phage group; however, the vast majority of these phage proteins have not yet been effectively annotated. In this study, we employed a structure-based similarity search approach to improve protein annotation. Through the application of this approach to 240,754 proteins from 2,169 mycobacteriophage genomes, we increased the protein annotation rate from 34% to 52.11%. Additionally, we identified a series of predicted counter-defense proteins, including anti-CRISPR proteins and antitoxins, and inferred the potential interaction network of phage-encoded proteins involved in replication, transcription, and translation with host-associated molecular machinery. This study addresses a substantial gap in the current knowledge of the potential function of phage proteins and provides key insights into the interactions between mycobacteriophages and their hosts.IMPORTANCEMycobacteriophages constitute the largest group of phages with sequenced genomes. However, a significant portion of these phage proteins have not yet been effectively annotated, seriously hindering our understanding of the basic biological processes of phage–host interactions and their practical applications. This study utilized a structure-based similarity search approach to enhance phage protein annotation. This approach led to the identification of novel predicted protein folds, structural domain fusion phenomena, and putative new enzymes. Additionally, the study identified a series of phage-encoded proteins that may play a role in hijacking host-associated replication, transcription, and translation processes, providing insights into the molecular mechanisms underlying mycobacteriophage interactions with host machinery. This study addresses a critical knowledge gap regarding the potential function of phage proteins and provides key insights into the interactions between mycobacteriophages and their hosts.
format Article
id doaj-art-9bff061d128c4e40a74154fba922b489
institution Kabale University
issn 2379-5077
language English
publishDate 2025-08-01
publisher American Society for Microbiology
record_format Article
series mSystems
spelling doaj-art-9bff061d128c4e40a74154fba922b4892025-08-20T03:44:19ZengAmerican Society for MicrobiologymSystems2379-50772025-08-0110810.1128/msystems.00508-25Pangenome-scale annotation of mycobacteriophages for dissecting phage–host interactions based on a sequence clustering and structural homology analysis strategyXiao Guo0Zheng-Guo He1College of Life Science and Technology, Guangxi University, Nanning, ChinaCollege of Life Science and Technology, Guangxi University, Nanning, ChinaABSTRACT With the increasing severity of bacterial drug resistance, there is a growing need for phages with well-defined genetic backgrounds to combat drug-resistant infections. Mycobacteriophages constitute the largest genome-sequenced phage group; however, the vast majority of these phage proteins have not yet been effectively annotated. In this study, we employed a structure-based similarity search approach to improve protein annotation. Through the application of this approach to 240,754 proteins from 2,169 mycobacteriophage genomes, we increased the protein annotation rate from 34% to 52.11%. Additionally, we identified a series of predicted counter-defense proteins, including anti-CRISPR proteins and antitoxins, and inferred the potential interaction network of phage-encoded proteins involved in replication, transcription, and translation with host-associated molecular machinery. This study addresses a substantial gap in the current knowledge of the potential function of phage proteins and provides key insights into the interactions between mycobacteriophages and their hosts.IMPORTANCEMycobacteriophages constitute the largest group of phages with sequenced genomes. However, a significant portion of these phage proteins have not yet been effectively annotated, seriously hindering our understanding of the basic biological processes of phage–host interactions and their practical applications. This study utilized a structure-based similarity search approach to enhance phage protein annotation. This approach led to the identification of novel predicted protein folds, structural domain fusion phenomena, and putative new enzymes. Additionally, the study identified a series of phage-encoded proteins that may play a role in hijacking host-associated replication, transcription, and translation processes, providing insights into the molecular mechanisms underlying mycobacteriophage interactions with host machinery. This study addresses a critical knowledge gap regarding the potential function of phage proteins and provides key insights into the interactions between mycobacteriophages and their hosts.https://journals.asm.org/doi/10.1128/msystems.00508-25Mycobacterium tuberculosisphagestructural homologysequence clustering
spellingShingle Xiao Guo
Zheng-Guo He
Pangenome-scale annotation of mycobacteriophages for dissecting phage–host interactions based on a sequence clustering and structural homology analysis strategy
mSystems
Mycobacterium tuberculosis
phage
structural homology
sequence clustering
title Pangenome-scale annotation of mycobacteriophages for dissecting phage–host interactions based on a sequence clustering and structural homology analysis strategy
title_full Pangenome-scale annotation of mycobacteriophages for dissecting phage–host interactions based on a sequence clustering and structural homology analysis strategy
title_fullStr Pangenome-scale annotation of mycobacteriophages for dissecting phage–host interactions based on a sequence clustering and structural homology analysis strategy
title_full_unstemmed Pangenome-scale annotation of mycobacteriophages for dissecting phage–host interactions based on a sequence clustering and structural homology analysis strategy
title_short Pangenome-scale annotation of mycobacteriophages for dissecting phage–host interactions based on a sequence clustering and structural homology analysis strategy
title_sort pangenome scale annotation of mycobacteriophages for dissecting phage host interactions based on a sequence clustering and structural homology analysis strategy
topic Mycobacterium tuberculosis
phage
structural homology
sequence clustering
url https://journals.asm.org/doi/10.1128/msystems.00508-25
work_keys_str_mv AT xiaoguo pangenomescaleannotationofmycobacteriophagesfordissectingphagehostinteractionsbasedonasequenceclusteringandstructuralhomologyanalysisstrategy
AT zhengguohe pangenomescaleannotationofmycobacteriophagesfordissectingphagehostinteractionsbasedonasequenceclusteringandstructuralhomologyanalysisstrategy