Accelerated enzyme engineering by machine-learning guided cell-free expression

Abstract Enzyme engineering is limited by the challenge of rapidly generating and using large datasets of sequence-function relationships for predictive design. To address this challenge, we develop a machine learning (ML)-guided platform that integrates cell-free DNA assembly, cell-free gene expres...

Full description

Saved in:
Bibliographic Details
Main Authors: Grant M. Landwehr, Jonathan W. Bogart, Carol Magalhaes, Eric G. Hammarlund, Ashty S. Karim, Michael C. Jewett
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-024-55399-0
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832585544536686592
author Grant M. Landwehr
Jonathan W. Bogart
Carol Magalhaes
Eric G. Hammarlund
Ashty S. Karim
Michael C. Jewett
author_facet Grant M. Landwehr
Jonathan W. Bogart
Carol Magalhaes
Eric G. Hammarlund
Ashty S. Karim
Michael C. Jewett
author_sort Grant M. Landwehr
collection DOAJ
description Abstract Enzyme engineering is limited by the challenge of rapidly generating and using large datasets of sequence-function relationships for predictive design. To address this challenge, we develop a machine learning (ML)-guided platform that integrates cell-free DNA assembly, cell-free gene expression, and functional assays to rapidly map fitness landscapes across protein sequence space and optimize enzymes for multiple, distinct chemical reactions. We apply this platform to engineer amide synthetases by evaluating substrate preference for 1217 enzyme variants in 10,953 unique reactions. We use these data to build augmented ridge regression ML models for predicting amide synthetase variants capable of making 9 small molecule pharmaceuticals. Over these nine compounds, ML-predicted enzyme variants demonstrate 1.6- to 42-fold improved activity relative to the parent. Our ML-guided, cell-free framework promises to accelerate enzyme engineering by enabling iterative exploration of protein sequence space to build specialized biocatalysts in parallel.
format Article
id doaj-art-0e787d1dfc844c9e93cb02901d01f8b2
institution Kabale University
issn 2041-1723
language English
publishDate 2025-01-01
publisher Nature Portfolio
record_format Article
series Nature Communications
spelling doaj-art-0e787d1dfc844c9e93cb02901d01f8b22025-01-26T12:41:37ZengNature PortfolioNature Communications2041-17232025-01-0116111310.1038/s41467-024-55399-0Accelerated enzyme engineering by machine-learning guided cell-free expressionGrant M. Landwehr0Jonathan W. Bogart1Carol Magalhaes2Eric G. Hammarlund3Ashty S. Karim4Michael C. Jewett5Department of Chemical and Biological Engineering, Northwestern UniversityDepartment of Chemical and Biological Engineering, Northwestern UniversityDepartment of Chemical and Biological Engineering, Northwestern UniversityDepartment of Chemical and Biological Engineering, Northwestern UniversityDepartment of Chemical and Biological Engineering, Northwestern UniversityDepartment of Chemical and Biological Engineering, Northwestern UniversityAbstract Enzyme engineering is limited by the challenge of rapidly generating and using large datasets of sequence-function relationships for predictive design. To address this challenge, we develop a machine learning (ML)-guided platform that integrates cell-free DNA assembly, cell-free gene expression, and functional assays to rapidly map fitness landscapes across protein sequence space and optimize enzymes for multiple, distinct chemical reactions. We apply this platform to engineer amide synthetases by evaluating substrate preference for 1217 enzyme variants in 10,953 unique reactions. We use these data to build augmented ridge regression ML models for predicting amide synthetase variants capable of making 9 small molecule pharmaceuticals. Over these nine compounds, ML-predicted enzyme variants demonstrate 1.6- to 42-fold improved activity relative to the parent. Our ML-guided, cell-free framework promises to accelerate enzyme engineering by enabling iterative exploration of protein sequence space to build specialized biocatalysts in parallel.https://doi.org/10.1038/s41467-024-55399-0
spellingShingle Grant M. Landwehr
Jonathan W. Bogart
Carol Magalhaes
Eric G. Hammarlund
Ashty S. Karim
Michael C. Jewett
Accelerated enzyme engineering by machine-learning guided cell-free expression
Nature Communications
title Accelerated enzyme engineering by machine-learning guided cell-free expression
title_full Accelerated enzyme engineering by machine-learning guided cell-free expression
title_fullStr Accelerated enzyme engineering by machine-learning guided cell-free expression
title_full_unstemmed Accelerated enzyme engineering by machine-learning guided cell-free expression
title_short Accelerated enzyme engineering by machine-learning guided cell-free expression
title_sort accelerated enzyme engineering by machine learning guided cell free expression
url https://doi.org/10.1038/s41467-024-55399-0
work_keys_str_mv AT grantmlandwehr acceleratedenzymeengineeringbymachinelearningguidedcellfreeexpression
AT jonathanwbogart acceleratedenzymeengineeringbymachinelearningguidedcellfreeexpression
AT carolmagalhaes acceleratedenzymeengineeringbymachinelearningguidedcellfreeexpression
AT ericghammarlund acceleratedenzymeengineeringbymachinelearningguidedcellfreeexpression
AT ashtyskarim acceleratedenzymeengineeringbymachinelearningguidedcellfreeexpression
AT michaelcjewett acceleratedenzymeengineeringbymachinelearningguidedcellfreeexpression