Accelerated enzyme engineering by machine-learning guided cell-free expression
Abstract Enzyme engineering is limited by the challenge of rapidly generating and using large datasets of sequence-function relationships for predictive design. To address this challenge, we develop a machine learning (ML)-guided platform that integrates cell-free DNA assembly, cell-free gene expres...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2025-01-01
|
Series: | Nature Communications |
Online Access: | https://doi.org/10.1038/s41467-024-55399-0 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832585544536686592 |
---|---|
author | Grant M. Landwehr Jonathan W. Bogart Carol Magalhaes Eric G. Hammarlund Ashty S. Karim Michael C. Jewett |
author_facet | Grant M. Landwehr Jonathan W. Bogart Carol Magalhaes Eric G. Hammarlund Ashty S. Karim Michael C. Jewett |
author_sort | Grant M. Landwehr |
collection | DOAJ |
description | Abstract Enzyme engineering is limited by the challenge of rapidly generating and using large datasets of sequence-function relationships for predictive design. To address this challenge, we develop a machine learning (ML)-guided platform that integrates cell-free DNA assembly, cell-free gene expression, and functional assays to rapidly map fitness landscapes across protein sequence space and optimize enzymes for multiple, distinct chemical reactions. We apply this platform to engineer amide synthetases by evaluating substrate preference for 1217 enzyme variants in 10,953 unique reactions. We use these data to build augmented ridge regression ML models for predicting amide synthetase variants capable of making 9 small molecule pharmaceuticals. Over these nine compounds, ML-predicted enzyme variants demonstrate 1.6- to 42-fold improved activity relative to the parent. Our ML-guided, cell-free framework promises to accelerate enzyme engineering by enabling iterative exploration of protein sequence space to build specialized biocatalysts in parallel. |
format | Article |
id | doaj-art-0e787d1dfc844c9e93cb02901d01f8b2 |
institution | Kabale University |
issn | 2041-1723 |
language | English |
publishDate | 2025-01-01 |
publisher | Nature Portfolio |
record_format | Article |
series | Nature Communications |
spelling | doaj-art-0e787d1dfc844c9e93cb02901d01f8b22025-01-26T12:41:37ZengNature PortfolioNature Communications2041-17232025-01-0116111310.1038/s41467-024-55399-0Accelerated enzyme engineering by machine-learning guided cell-free expressionGrant M. Landwehr0Jonathan W. Bogart1Carol Magalhaes2Eric G. Hammarlund3Ashty S. Karim4Michael C. Jewett5Department of Chemical and Biological Engineering, Northwestern UniversityDepartment of Chemical and Biological Engineering, Northwestern UniversityDepartment of Chemical and Biological Engineering, Northwestern UniversityDepartment of Chemical and Biological Engineering, Northwestern UniversityDepartment of Chemical and Biological Engineering, Northwestern UniversityDepartment of Chemical and Biological Engineering, Northwestern UniversityAbstract Enzyme engineering is limited by the challenge of rapidly generating and using large datasets of sequence-function relationships for predictive design. To address this challenge, we develop a machine learning (ML)-guided platform that integrates cell-free DNA assembly, cell-free gene expression, and functional assays to rapidly map fitness landscapes across protein sequence space and optimize enzymes for multiple, distinct chemical reactions. We apply this platform to engineer amide synthetases by evaluating substrate preference for 1217 enzyme variants in 10,953 unique reactions. We use these data to build augmented ridge regression ML models for predicting amide synthetase variants capable of making 9 small molecule pharmaceuticals. Over these nine compounds, ML-predicted enzyme variants demonstrate 1.6- to 42-fold improved activity relative to the parent. Our ML-guided, cell-free framework promises to accelerate enzyme engineering by enabling iterative exploration of protein sequence space to build specialized biocatalysts in parallel.https://doi.org/10.1038/s41467-024-55399-0 |
spellingShingle | Grant M. Landwehr Jonathan W. Bogart Carol Magalhaes Eric G. Hammarlund Ashty S. Karim Michael C. Jewett Accelerated enzyme engineering by machine-learning guided cell-free expression Nature Communications |
title | Accelerated enzyme engineering by machine-learning guided cell-free expression |
title_full | Accelerated enzyme engineering by machine-learning guided cell-free expression |
title_fullStr | Accelerated enzyme engineering by machine-learning guided cell-free expression |
title_full_unstemmed | Accelerated enzyme engineering by machine-learning guided cell-free expression |
title_short | Accelerated enzyme engineering by machine-learning guided cell-free expression |
title_sort | accelerated enzyme engineering by machine learning guided cell free expression |
url | https://doi.org/10.1038/s41467-024-55399-0 |
work_keys_str_mv | AT grantmlandwehr acceleratedenzymeengineeringbymachinelearningguidedcellfreeexpression AT jonathanwbogart acceleratedenzymeengineeringbymachinelearningguidedcellfreeexpression AT carolmagalhaes acceleratedenzymeengineeringbymachinelearningguidedcellfreeexpression AT ericghammarlund acceleratedenzymeengineeringbymachinelearningguidedcellfreeexpression AT ashtyskarim acceleratedenzymeengineeringbymachinelearningguidedcellfreeexpression AT michaelcjewett acceleratedenzymeengineeringbymachinelearningguidedcellfreeexpression |