Genome assembly and population analysis of tetraploid marama bean reveal two distinct genome types

Abstract Tylosema esculentum (marama bean), an underutilized orphan legume native to southern Africa, holds significant potential for domestication as a rescue crop to enhance local food security. Well-adapted to harsh desert environments, it offers valuable insights into plant resilience to extreme...

Full description

Saved in:
Bibliographic Details
Main Authors: Jin Li, Christopher Cullis
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-86023-w
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832585866598416384
author Jin Li
Christopher Cullis
author_facet Jin Li
Christopher Cullis
author_sort Jin Li
collection DOAJ
description Abstract Tylosema esculentum (marama bean), an underutilized orphan legume native to southern Africa, holds significant potential for domestication as a rescue crop to enhance local food security. Well-adapted to harsh desert environments, it offers valuable insights into plant resilience to extreme drought and high temperatures. In this study, k-mer analysis indicated marama as an ancient allotetraploid legume. Using 21.5 Gb of PacBio HiFi data, the genome was assembled with two assemblers, HiCanu and Hifiasm, followed by scaffolding with Omni-C data from Dovetail Genomics (Cantata Bio) using HiRise, resulting in a 558.78 Mb assembly with near chromosome-level continuity (N50 = 22.68 Mb, L50 = 8). Repeats accounted for 58.43% of the genome. Phylogenetic analysis indicated a close relationship with Bauhinia variegata and Cercis canadensis, diverging approximately 27.22 and 31.68 million years ago (Ma), respectively. Whole-genome duplication (WGD) analysis revealed an ancient duplication event in marama. Gene family analysis revealed expanded families enriched in pathways related to stress adaptation, energy metabolism, and environmental signaling, including the spliceosome, citrate cycle, and carbon fixation pathways. These findings highlight marama’s resilience to arid environments. In contrast, contracted gene families associated with secondary metabolite biosynthesis and defense pathways suggest a trade-off, potentially due to reduced pathogen pressure. Marama-specific genes were enriched in amino acid catabolism pathways, potentially playing roles in stress signaling and energy regulation. Core gene families shared with other legumes were enriched in conserved pathways, such as photosynthesis and hormone signaling, which are fundamental for plant growth and survival. Population analysis of geographically diverse samples revealed two distinct clusters, though phenotypic differences remain unclear. Overall, this study presents the first high-quality genome assembly of marama bean, offering a valuable genomic reference for understanding its unique biology and highlighting its potential for crop improvement in challenging environments.
format Article
id doaj-art-b0844691ceeb4a3ca387ecaf0eac2d60
institution Kabale University
issn 2045-2322
language English
publishDate 2025-01-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-b0844691ceeb4a3ca387ecaf0eac2d602025-01-26T12:28:26ZengNature PortfolioScientific Reports2045-23222025-01-0115111610.1038/s41598-025-86023-wGenome assembly and population analysis of tetraploid marama bean reveal two distinct genome typesJin Li0Christopher Cullis1Department of Biology, Case Western Reserve UniversityDepartment of Biology, Case Western Reserve UniversityAbstract Tylosema esculentum (marama bean), an underutilized orphan legume native to southern Africa, holds significant potential for domestication as a rescue crop to enhance local food security. Well-adapted to harsh desert environments, it offers valuable insights into plant resilience to extreme drought and high temperatures. In this study, k-mer analysis indicated marama as an ancient allotetraploid legume. Using 21.5 Gb of PacBio HiFi data, the genome was assembled with two assemblers, HiCanu and Hifiasm, followed by scaffolding with Omni-C data from Dovetail Genomics (Cantata Bio) using HiRise, resulting in a 558.78 Mb assembly with near chromosome-level continuity (N50 = 22.68 Mb, L50 = 8). Repeats accounted for 58.43% of the genome. Phylogenetic analysis indicated a close relationship with Bauhinia variegata and Cercis canadensis, diverging approximately 27.22 and 31.68 million years ago (Ma), respectively. Whole-genome duplication (WGD) analysis revealed an ancient duplication event in marama. Gene family analysis revealed expanded families enriched in pathways related to stress adaptation, energy metabolism, and environmental signaling, including the spliceosome, citrate cycle, and carbon fixation pathways. These findings highlight marama’s resilience to arid environments. In contrast, contracted gene families associated with secondary metabolite biosynthesis and defense pathways suggest a trade-off, potentially due to reduced pathogen pressure. Marama-specific genes were enriched in amino acid catabolism pathways, potentially playing roles in stress signaling and energy regulation. Core gene families shared with other legumes were enriched in conserved pathways, such as photosynthesis and hormone signaling, which are fundamental for plant growth and survival. Population analysis of geographically diverse samples revealed two distinct clusters, though phenotypic differences remain unclear. Overall, this study presents the first high-quality genome assembly of marama bean, offering a valuable genomic reference for understanding its unique biology and highlighting its potential for crop improvement in challenging environments.https://doi.org/10.1038/s41598-025-86023-wMarama beanGenome assemblyTetraploidPlant adaptationPhylogeneticsWhole genome duplication (WGD)
spellingShingle Jin Li
Christopher Cullis
Genome assembly and population analysis of tetraploid marama bean reveal two distinct genome types
Scientific Reports
Marama bean
Genome assembly
Tetraploid
Plant adaptation
Phylogenetics
Whole genome duplication (WGD)
title Genome assembly and population analysis of tetraploid marama bean reveal two distinct genome types
title_full Genome assembly and population analysis of tetraploid marama bean reveal two distinct genome types
title_fullStr Genome assembly and population analysis of tetraploid marama bean reveal two distinct genome types
title_full_unstemmed Genome assembly and population analysis of tetraploid marama bean reveal two distinct genome types
title_short Genome assembly and population analysis of tetraploid marama bean reveal two distinct genome types
title_sort genome assembly and population analysis of tetraploid marama bean reveal two distinct genome types
topic Marama bean
Genome assembly
Tetraploid
Plant adaptation
Phylogenetics
Whole genome duplication (WGD)
url https://doi.org/10.1038/s41598-025-86023-w
work_keys_str_mv AT jinli genomeassemblyandpopulationanalysisoftetraploidmaramabeanrevealtwodistinctgenometypes
AT christophercullis genomeassemblyandpopulationanalysisoftetraploidmaramabeanrevealtwodistinctgenometypes