Species determination using AI machine-learning algorithms: Hebeloma as a case study

Abstract The genus Hebeloma is renowned as difficult when it comes to species determination. Historically, many dichotomous keys have been published and used with varying success rate. Over the last 20 years the authors have built a database of Hebeloma collections containing not only metadata but a...

Full description

Saved in:
Bibliographic Details
Main Authors: Peter Bartlett, Ursula Eberhardt, Nicole Schütz, Henry J. Beker
Format: Article
Language:English
Published: BMC 2022-06-01
Series:IMA Fungus
Subjects:
Online Access:https://doi.org/10.1186/s43008-022-00099-x
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832564621664321536
author Peter Bartlett
Ursula Eberhardt
Nicole Schütz
Henry J. Beker
author_facet Peter Bartlett
Ursula Eberhardt
Nicole Schütz
Henry J. Beker
author_sort Peter Bartlett
collection DOAJ
description Abstract The genus Hebeloma is renowned as difficult when it comes to species determination. Historically, many dichotomous keys have been published and used with varying success rate. Over the last 20 years the authors have built a database of Hebeloma collections containing not only metadata but also parametrized morphological descriptions, where for about a third of the cases micromorphological characters have been analysed and are included, as well as DNA sequences for almost every collection. The database now has about 9000 collections including nearly every type collection worldwide and represents over 120 different taxa. Almost every collection has been analysed and identified to species using a combination of the available molecular and morphological data in addition to locality and habitat information. Based on these data an Artificial Intelligence (AI) machine-learning species identifier has been developed that takes as input locality data and a small number of the morphological parameters. Using a random test set of more than 600 collections from the database, not utilized within the set of collections used to train the identifier, the species identifier was able to identify 77% correctly with its highest probabilistic match, 96% within its three most likely determinations and over 99% of collections within its five most likely determinations.
format Article
id doaj-art-8c014a7648cd41d5b9c0c2e67fe74f00
institution Kabale University
issn 2210-6359
language English
publishDate 2022-06-01
publisher BMC
record_format Article
series IMA Fungus
spelling doaj-art-8c014a7648cd41d5b9c0c2e67fe74f002025-02-03T01:10:39ZengBMCIMA Fungus2210-63592022-06-0113112010.1186/s43008-022-00099-xSpecies determination using AI machine-learning algorithms: Hebeloma as a case studyPeter Bartlett0Ursula Eberhardt1Nicole Schütz2Henry J. BekerLa BarakaStaatliches Museum für Naturkunde StuttgartStaatliches Museum für Naturkunde StuttgartAbstract The genus Hebeloma is renowned as difficult when it comes to species determination. Historically, many dichotomous keys have been published and used with varying success rate. Over the last 20 years the authors have built a database of Hebeloma collections containing not only metadata but also parametrized morphological descriptions, where for about a third of the cases micromorphological characters have been analysed and are included, as well as DNA sequences for almost every collection. The database now has about 9000 collections including nearly every type collection worldwide and represents over 120 different taxa. Almost every collection has been analysed and identified to species using a combination of the available molecular and morphological data in addition to locality and habitat information. Based on these data an Artificial Intelligence (AI) machine-learning species identifier has been developed that takes as input locality data and a small number of the morphological parameters. Using a random test set of more than 600 collections from the database, not utilized within the set of collections used to train the identifier, the species identifier was able to identify 77% correctly with its highest probabilistic match, 96% within its three most likely determinations and over 99% of collections within its five most likely determinations.https://doi.org/10.1186/s43008-022-00099-xAgaricalesEctomycorrhizal fungiIdentification keysTaxonomyNeural networks
spellingShingle Peter Bartlett
Ursula Eberhardt
Nicole Schütz
Henry J. Beker
Species determination using AI machine-learning algorithms: Hebeloma as a case study
IMA Fungus
Agaricales
Ectomycorrhizal fungi
Identification keys
Taxonomy
Neural networks
title Species determination using AI machine-learning algorithms: Hebeloma as a case study
title_full Species determination using AI machine-learning algorithms: Hebeloma as a case study
title_fullStr Species determination using AI machine-learning algorithms: Hebeloma as a case study
title_full_unstemmed Species determination using AI machine-learning algorithms: Hebeloma as a case study
title_short Species determination using AI machine-learning algorithms: Hebeloma as a case study
title_sort species determination using ai machine learning algorithms hebeloma as a case study
topic Agaricales
Ectomycorrhizal fungi
Identification keys
Taxonomy
Neural networks
url https://doi.org/10.1186/s43008-022-00099-x
work_keys_str_mv AT peterbartlett speciesdeterminationusingaimachinelearningalgorithmshebelomaasacasestudy
AT ursulaeberhardt speciesdeterminationusingaimachinelearningalgorithmshebelomaasacasestudy
AT nicoleschutz speciesdeterminationusingaimachinelearningalgorithmshebelomaasacasestudy
AT henryjbeker speciesdeterminationusingaimachinelearningalgorithmshebelomaasacasestudy