Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillance
Abstract Background The burden of Clostridioides difficile as a nosocomial- and community-acquired pathogen has been increasing over the recent decades, including reports of severe outbreaks. Molecular and virulence genotyping are central for the epidemiological surveillance of this pathogen, but ne...
Saved in:
Main Authors: | , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2025-01-01
|
Series: | BMC Genomics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12864-025-11267-9 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832572037137170432 |
---|---|
author | Semeh Bejaoui Sofie Holtsmark Nielsen Astrid Rasmussen John Eugenio Coia Dorte Terp Andersen Tobias Bruun Pedersen Martin Vad Møller Marc Trunjer Kusk Nielsen Dorte Frees Søren Persson |
author_facet | Semeh Bejaoui Sofie Holtsmark Nielsen Astrid Rasmussen John Eugenio Coia Dorte Terp Andersen Tobias Bruun Pedersen Martin Vad Møller Marc Trunjer Kusk Nielsen Dorte Frees Søren Persson |
author_sort | Semeh Bejaoui |
collection | DOAJ |
description | Abstract Background The burden of Clostridioides difficile as a nosocomial- and community-acquired pathogen has been increasing over the recent decades, including reports of severe outbreaks. Molecular and virulence genotyping are central for the epidemiological surveillance of this pathogen, but need to balance accuracy and rapid turnaround time of the results. While Illumina short-read sequencing has been adopted as the gold standard to investigate C. difficile virulence and transmission routes, little is known about the potential of Nanopore long-read sequencing in this field. The goal of our study was to compare sequencing and assembly quality of 37 C. difficile isolates using Illumina (SPAdes assembled) and Nanopore (Flye and Unicycler assembled) data alone, along with hybrid assemblies obtained with short-read polishing of long reads. Results Illumina sequencing produced reads with an average quality of 99.68% (Q25), while Nanopore sequencing produced reads reaching an average quality of 96.84% (Q15), showing a tenfold difference in quality. Sequence type (ST) designation from Nanopore assemblies failed to detect ST5, ST7, ST8, ST13 and ST49, while ST designation based on unpolished Nanopore reads using Krocus was successful for all STs. Nanopore sequences exhibited an average of 640 base errors per genome (~ 0.015% substitution rate), which was reflected by the incorrect assignment of over 180 alleles in core genome multilocus sequence typing (cgMLST) analysis. As a result, Nanopore-derived phylogenies were not as accurate as the Illumina reference, and therefore inadequate for precise investigation of transmission events. Both sequencing platforms provided comparable, satisfactory results for the detection of virulence genes tcdA, tcdB, cdtAB and in-frame deletions in tcdC. Conclusion Compared to Illumina, Nanopore has higher error rate, which limits its application for high-resolution epidemiological surveillance. However, the short analysis time, lower cost and more simple procedure combined with correctly identified STs and virulence genes, makes it an alternative when fast and less detailed analyses are preferred. |
format | Article |
id | doaj-art-e4d12d12f2be4d92bbd8562bfad81c7d |
institution | Kabale University |
issn | 1471-2164 |
language | English |
publishDate | 2025-01-01 |
publisher | BMC |
record_format | Article |
series | BMC Genomics |
spelling | doaj-art-e4d12d12f2be4d92bbd8562bfad81c7d2025-02-02T12:10:10ZengBMCBMC Genomics1471-21642025-01-0126111510.1186/s12864-025-11267-9Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillanceSemeh Bejaoui0Sofie Holtsmark Nielsen1Astrid Rasmussen2John Eugenio Coia3Dorte Terp Andersen4Tobias Bruun Pedersen5Martin Vad Møller6Marc Trunjer Kusk Nielsen7Dorte Frees8Søren Persson9Department of Veterinary and Animal Science, University of CopenhagenDepartment of Bacteria, Parasites and Fungi, Statens Serum InstitutDepartment of Bacteria, Parasites and Fungi, Statens Serum InstitutDepartment of Regional Health Research (Esbjerg), University of Southern DenmarkDepartment of Clinical Diagnostics, Esbjerg and Grindsted Hospital, University Hospital of SouthernDepartment of Clinical Diagnostics, Esbjerg and Grindsted Hospital, University Hospital of SouthernDepartment of Clinical Diagnostics, Esbjerg and Grindsted Hospital, University Hospital of SouthernDepartment of Clinical Diagnostics, Esbjerg and Grindsted Hospital, University Hospital of SouthernDepartment of Veterinary and Animal Science, University of CopenhagenDepartment of Bacteria, Parasites and Fungi, Statens Serum InstitutAbstract Background The burden of Clostridioides difficile as a nosocomial- and community-acquired pathogen has been increasing over the recent decades, including reports of severe outbreaks. Molecular and virulence genotyping are central for the epidemiological surveillance of this pathogen, but need to balance accuracy and rapid turnaround time of the results. While Illumina short-read sequencing has been adopted as the gold standard to investigate C. difficile virulence and transmission routes, little is known about the potential of Nanopore long-read sequencing in this field. The goal of our study was to compare sequencing and assembly quality of 37 C. difficile isolates using Illumina (SPAdes assembled) and Nanopore (Flye and Unicycler assembled) data alone, along with hybrid assemblies obtained with short-read polishing of long reads. Results Illumina sequencing produced reads with an average quality of 99.68% (Q25), while Nanopore sequencing produced reads reaching an average quality of 96.84% (Q15), showing a tenfold difference in quality. Sequence type (ST) designation from Nanopore assemblies failed to detect ST5, ST7, ST8, ST13 and ST49, while ST designation based on unpolished Nanopore reads using Krocus was successful for all STs. Nanopore sequences exhibited an average of 640 base errors per genome (~ 0.015% substitution rate), which was reflected by the incorrect assignment of over 180 alleles in core genome multilocus sequence typing (cgMLST) analysis. As a result, Nanopore-derived phylogenies were not as accurate as the Illumina reference, and therefore inadequate for precise investigation of transmission events. Both sequencing platforms provided comparable, satisfactory results for the detection of virulence genes tcdA, tcdB, cdtAB and in-frame deletions in tcdC. Conclusion Compared to Illumina, Nanopore has higher error rate, which limits its application for high-resolution epidemiological surveillance. However, the short analysis time, lower cost and more simple procedure combined with correctly identified STs and virulence genes, makes it an alternative when fast and less detailed analyses are preferred.https://doi.org/10.1186/s12864-025-11267-9Clostridioides difficileWhole genome sequencing (WGS)IlluminaOxford NanoporeEpidemiological surveillanceGenotyping |
spellingShingle | Semeh Bejaoui Sofie Holtsmark Nielsen Astrid Rasmussen John Eugenio Coia Dorte Terp Andersen Tobias Bruun Pedersen Martin Vad Møller Marc Trunjer Kusk Nielsen Dorte Frees Søren Persson Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillance BMC Genomics Clostridioides difficile Whole genome sequencing (WGS) Illumina Oxford Nanopore Epidemiological surveillance Genotyping |
title | Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillance |
title_full | Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillance |
title_fullStr | Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillance |
title_full_unstemmed | Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillance |
title_short | Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillance |
title_sort | comparison of illumina and oxford nanopore sequencing data quality for clostridioides difficile genome analysis and their application for epidemiological surveillance |
topic | Clostridioides difficile Whole genome sequencing (WGS) Illumina Oxford Nanopore Epidemiological surveillance Genotyping |
url | https://doi.org/10.1186/s12864-025-11267-9 |
work_keys_str_mv | AT semehbejaoui comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance AT sofieholtsmarknielsen comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance AT astridrasmussen comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance AT johneugeniocoia comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance AT dorteterpandersen comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance AT tobiasbruunpedersen comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance AT martinvadmøller comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance AT marctrunjerkusknielsen comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance AT dortefrees comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance AT sørenpersson comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance |