Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillance

Abstract Background The burden of Clostridioides difficile as a nosocomial- and community-acquired pathogen has been increasing over the recent decades, including reports of severe outbreaks. Molecular and virulence genotyping are central for the epidemiological surveillance of this pathogen, but ne...

Full description

Saved in:
Bibliographic Details
Main Authors: Semeh Bejaoui, Sofie Holtsmark Nielsen, Astrid Rasmussen, John Eugenio Coia, Dorte Terp Andersen, Tobias Bruun Pedersen, Martin Vad Møller, Marc Trunjer Kusk Nielsen, Dorte Frees, Søren Persson
Format: Article
Language:English
Published: BMC 2025-01-01
Series:BMC Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12864-025-11267-9
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832572037137170432
author Semeh Bejaoui
Sofie Holtsmark Nielsen
Astrid Rasmussen
John Eugenio Coia
Dorte Terp Andersen
Tobias Bruun Pedersen
Martin Vad Møller
Marc Trunjer Kusk Nielsen
Dorte Frees
Søren Persson
author_facet Semeh Bejaoui
Sofie Holtsmark Nielsen
Astrid Rasmussen
John Eugenio Coia
Dorte Terp Andersen
Tobias Bruun Pedersen
Martin Vad Møller
Marc Trunjer Kusk Nielsen
Dorte Frees
Søren Persson
author_sort Semeh Bejaoui
collection DOAJ
description Abstract Background The burden of Clostridioides difficile as a nosocomial- and community-acquired pathogen has been increasing over the recent decades, including reports of severe outbreaks. Molecular and virulence genotyping are central for the epidemiological surveillance of this pathogen, but need to balance accuracy and rapid turnaround time of the results. While Illumina short-read sequencing has been adopted as the gold standard to investigate C. difficile virulence and transmission routes, little is known about the potential of Nanopore long-read sequencing in this field. The goal of our study was to compare sequencing and assembly quality of 37 C. difficile isolates using Illumina (SPAdes assembled) and Nanopore (Flye and Unicycler assembled) data alone, along with hybrid assemblies obtained with short-read polishing of long reads. Results Illumina sequencing produced reads with an average quality of 99.68% (Q25), while Nanopore sequencing produced reads reaching an average quality of 96.84% (Q15), showing a tenfold difference in quality. Sequence type (ST) designation from Nanopore assemblies failed to detect ST5, ST7, ST8, ST13 and ST49, while ST designation based on unpolished Nanopore reads using Krocus was successful for all STs. Nanopore sequences exhibited an average of 640 base errors per genome (~ 0.015% substitution rate), which was reflected by the incorrect assignment of over 180 alleles in core genome multilocus sequence typing (cgMLST) analysis. As a result, Nanopore-derived phylogenies were not as accurate as the Illumina reference, and therefore inadequate for precise investigation of transmission events. Both sequencing platforms provided comparable, satisfactory results for the detection of virulence genes tcdA, tcdB, cdtAB and in-frame deletions in tcdC. Conclusion Compared to Illumina, Nanopore has higher error rate, which limits its application for high-resolution epidemiological surveillance. However, the short analysis time, lower cost and more simple procedure combined with correctly identified STs and virulence genes, makes it an alternative when fast and less detailed analyses are preferred.
format Article
id doaj-art-e4d12d12f2be4d92bbd8562bfad81c7d
institution Kabale University
issn 1471-2164
language English
publishDate 2025-01-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj-art-e4d12d12f2be4d92bbd8562bfad81c7d2025-02-02T12:10:10ZengBMCBMC Genomics1471-21642025-01-0126111510.1186/s12864-025-11267-9Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillanceSemeh Bejaoui0Sofie Holtsmark Nielsen1Astrid Rasmussen2John Eugenio Coia3Dorte Terp Andersen4Tobias Bruun Pedersen5Martin Vad Møller6Marc Trunjer Kusk Nielsen7Dorte Frees8Søren Persson9Department of Veterinary and Animal Science, University of CopenhagenDepartment of Bacteria, Parasites and Fungi, Statens Serum InstitutDepartment of Bacteria, Parasites and Fungi, Statens Serum InstitutDepartment of Regional Health Research (Esbjerg), University of Southern DenmarkDepartment of Clinical Diagnostics, Esbjerg and Grindsted Hospital, University Hospital of SouthernDepartment of Clinical Diagnostics, Esbjerg and Grindsted Hospital, University Hospital of SouthernDepartment of Clinical Diagnostics, Esbjerg and Grindsted Hospital, University Hospital of SouthernDepartment of Clinical Diagnostics, Esbjerg and Grindsted Hospital, University Hospital of SouthernDepartment of Veterinary and Animal Science, University of CopenhagenDepartment of Bacteria, Parasites and Fungi, Statens Serum InstitutAbstract Background The burden of Clostridioides difficile as a nosocomial- and community-acquired pathogen has been increasing over the recent decades, including reports of severe outbreaks. Molecular and virulence genotyping are central for the epidemiological surveillance of this pathogen, but need to balance accuracy and rapid turnaround time of the results. While Illumina short-read sequencing has been adopted as the gold standard to investigate C. difficile virulence and transmission routes, little is known about the potential of Nanopore long-read sequencing in this field. The goal of our study was to compare sequencing and assembly quality of 37 C. difficile isolates using Illumina (SPAdes assembled) and Nanopore (Flye and Unicycler assembled) data alone, along with hybrid assemblies obtained with short-read polishing of long reads. Results Illumina sequencing produced reads with an average quality of 99.68% (Q25), while Nanopore sequencing produced reads reaching an average quality of 96.84% (Q15), showing a tenfold difference in quality. Sequence type (ST) designation from Nanopore assemblies failed to detect ST5, ST7, ST8, ST13 and ST49, while ST designation based on unpolished Nanopore reads using Krocus was successful for all STs. Nanopore sequences exhibited an average of 640 base errors per genome (~ 0.015% substitution rate), which was reflected by the incorrect assignment of over 180 alleles in core genome multilocus sequence typing (cgMLST) analysis. As a result, Nanopore-derived phylogenies were not as accurate as the Illumina reference, and therefore inadequate for precise investigation of transmission events. Both sequencing platforms provided comparable, satisfactory results for the detection of virulence genes tcdA, tcdB, cdtAB and in-frame deletions in tcdC. Conclusion Compared to Illumina, Nanopore has higher error rate, which limits its application for high-resolution epidemiological surveillance. However, the short analysis time, lower cost and more simple procedure combined with correctly identified STs and virulence genes, makes it an alternative when fast and less detailed analyses are preferred.https://doi.org/10.1186/s12864-025-11267-9Clostridioides difficileWhole genome sequencing (WGS)IlluminaOxford NanoporeEpidemiological surveillanceGenotyping
spellingShingle Semeh Bejaoui
Sofie Holtsmark Nielsen
Astrid Rasmussen
John Eugenio Coia
Dorte Terp Andersen
Tobias Bruun Pedersen
Martin Vad Møller
Marc Trunjer Kusk Nielsen
Dorte Frees
Søren Persson
Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillance
BMC Genomics
Clostridioides difficile
Whole genome sequencing (WGS)
Illumina
Oxford Nanopore
Epidemiological surveillance
Genotyping
title Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillance
title_full Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillance
title_fullStr Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillance
title_full_unstemmed Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillance
title_short Comparison of Illumina and Oxford Nanopore sequencing data quality for Clostridioides difficile genome analysis and their application for epidemiological surveillance
title_sort comparison of illumina and oxford nanopore sequencing data quality for clostridioides difficile genome analysis and their application for epidemiological surveillance
topic Clostridioides difficile
Whole genome sequencing (WGS)
Illumina
Oxford Nanopore
Epidemiological surveillance
Genotyping
url https://doi.org/10.1186/s12864-025-11267-9
work_keys_str_mv AT semehbejaoui comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance
AT sofieholtsmarknielsen comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance
AT astridrasmussen comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance
AT johneugeniocoia comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance
AT dorteterpandersen comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance
AT tobiasbruunpedersen comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance
AT martinvadmøller comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance
AT marctrunjerkusknielsen comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance
AT dortefrees comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance
AT sørenpersson comparisonofilluminaandoxfordnanoporesequencingdataqualityforclostridioidesdifficilegenomeanalysisandtheirapplicationforepidemiologicalsurveillance