Different orthology inference algorithms generate similar predicted orthogroups among Brassicaceae species

Abstract Premise Orthology inference is crucial for comparative genomics, and multiple algorithms have been developed to identify putative orthologs for downstream analyses. Despite the abundance of proposed solutions, including publicly available benchmarks, it is difficult to assess which tool is...

Full description

Saved in:
Bibliographic Details
Main Authors: Irene T. Liao, Karen E. Sears, Lena C. Hileman, Lachezar A. Nikolov
Format: Article
Language:English
Published: Wiley 2025-01-01
Series:Applications in Plant Sciences
Subjects:
Online Access:https://doi.org/10.1002/aps3.11627
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Premise Orthology inference is crucial for comparative genomics, and multiple algorithms have been developed to identify putative orthologs for downstream analyses. Despite the abundance of proposed solutions, including publicly available benchmarks, it is difficult to assess which tool is most suitable for plant species, which commonly have complex genomic histories. Methods We explored the performance of four orthology inference algorithms—OrthoFinder, SonicParanoid, Broccoli, and OrthNet—on eight Brassicaceae genomes in two groups: one group comprising only diploids and another set comprising the diploids, two mesopolyploids, and one recent hexaploid genome. Results The composition of the orthogroups reflected the species' ploidy and genomic histories, with the diploid set having a higher proportion of identical orthogroups. While the diploid + higher ploidy set had a lower proportion of orthogroups with identical compositions, the average degree of similarity between the orthogroups was not different from the diploid set. Discussion Three algorithms—OrthoFinder, SonicParanoid, and Broccoli—are helpful for initial orthology predictions. Results produced using OrthNet were generally outliers but could still provide detailed information about gene colinearity. With our Brassicaceae dataset, slight discrepancies were found across the orthology inference algorithms, necessitating additional analyses such as tree inference to fine‐tune results.
ISSN:2168-0450