A Deep Learning-Based Approach for Predicting Michaelis Constants from Enzymatic Reactions

The Michaelis constant (Km) is defined as the substrate concentration at which an enzymatic reaction reaches half of its maximum reaction velocity. The determination of Km can be applied to the construction and optimization of metabolic networks. Conventional determinations of Km values based on in...

Full description

Saved in:
Bibliographic Details
Main Authors: Yulong Li, Kai Wang
Format: Article
Language:English
Published: MDPI AG 2025-04-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/7/4017
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The Michaelis constant (Km) is defined as the substrate concentration at which an enzymatic reaction reaches half of its maximum reaction velocity. The determination of Km can be applied to the construction and optimization of metabolic networks. Conventional determinations of Km values based on in vitro experiments are time-consuming and expensive. Although there are a series of computational approaches of determining Km values based on deep learning, the complex biological information in enzymatic reactions still makes it challenging to achieve accurate predictions. In this study, we develop a novel deep learning approach called DLERKm for predicting Km by combining the features of enzymatic reactions including products. We constructed a new enzymatic reaction dataset from the Sabio-RK and UniProt databases for the training and testing of DLERKm, which include the information on substrates, products, enzyme sequences, and Km values. DLERKm utilizes pre-trained language models (ESM-2 and RXNFP), molecular fingerprints, and attention mechanisms to extract enzymatic reaction features for the prediction of Km values. To evaluate the performance of DLERKm, we compared it with a state-of-the-art model (UniKP) on the constructed enzymatic reaction datasets. The model prediction results demonstrate that DLERKm achieved superior prediction performances in terms of the evaluated metrics on the benchmark datasets, where the relative improvements of four metrics (RMSE, MAE, PCC, and R<sup>2</sup>) were 16.3%, 16.5%, 27.7%, and 14.9%, respectively. Ablation experiments and interpretability analysis demonstrate the importance of considering product information when predicting Km values. In addition, DLERKm exhibits reliable predictive performances for different types of enzymatic reactions.
ISSN:2076-3417