FS-DDPG: Optimal Control of a Fan Coil Unit System Based on Safe Reinforcement Learning

To optimize the control of fan coil unit (FCU) systems under model-free conditions, researchers have integrated reinforcement learning (RL) into the control processes of system pumps and fans. However, traditional RL methods can lead to significant fluctuations in the flow of pumps and fans, posing...

Full description

Saved in:

Bibliographic Details
Main Authors:	Chenyang Li, Qiming Fu, Jianping Chen, You Lu, Yunzhe Wang, Hongjie Wu
Format:	Article
Language:	English
Published:	MDPI AG 2025-01-01
Series:	Buildings
Subjects:	safe reinforcement learning fan coil unit system model-free control building energy efficiency
Online Access:	https://www.mdpi.com/2075-5309/15/2/226
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832588840241463296
author	Chenyang Li Qiming Fu Jianping Chen You Lu Yunzhe Wang Hongjie Wu
author_facet	Chenyang Li Qiming Fu Jianping Chen You Lu Yunzhe Wang Hongjie Wu
author_sort	Chenyang Li
collection	DOAJ
description	To optimize the control of fan coil unit (FCU) systems under model-free conditions, researchers have integrated reinforcement learning (RL) into the control processes of system pumps and fans. However, traditional RL methods can lead to significant fluctuations in the flow of pumps and fans, posing a safety risk. To address this issue, we propose a novel FCU control method, Fluctuation Suppression–Deep Deterministic Policy Gradient (FS-DDPG). The key innovation lies in applying a constrained Markov decision process to model the FCU control problem, where a penalty term for process constraints is incorporated into the reward function, and constraint tightening is introduced to limit the action space. In addition, to validate the performance of the proposed method, we established a variable operating conditions FCU simulation platform based on the parameters of the actual FCU system and ten years of historical weather data. The platform’s correctness and effectiveness were verified from three aspects: heat transfer, the air side and the water side, under different dry and wet operating conditions. The experimental results show that compared with DDPG, FS-DDPG avoids 98.20% of the pump flow and 95.82% of the fan flow fluctuations, ensuring the safety of the equipment. Compared with DDPG and RBC, FS-DDPG achieves 11.9% and 51.76% energy saving rates, respectively, and also shows better performance in terms of operational performance and satisfaction. In the future, we will further improve the scalability and apply the method to more complex FCU systems in variable environments.
format	Article
id	doaj-art-66fcb40a137f4d8dbeafa9e5444d3f57
institution	Kabale University
issn	2075-5309
language	English
publishDate	2025-01-01
publisher	MDPI AG
record_format	Article
series	Buildings
spelling	doaj-art-66fcb40a137f4d8dbeafa9e5444d3f572025-01-24T13:26:15ZengMDPI AGBuildings2075-53092025-01-0115222610.3390/buildings15020226FS-DDPG: Optimal Control of a Fan Coil Unit System Based on Safe Reinforcement LearningChenyang Li0Qiming Fu1Jianping Chen2You Lu3Yunzhe Wang4Hongjie Wu5School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, ChinaSchool of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, ChinaSchool of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, ChinaSchool of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, ChinaSchool of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, ChinaSchool of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, ChinaTo optimize the control of fan coil unit (FCU) systems under model-free conditions, researchers have integrated reinforcement learning (RL) into the control processes of system pumps and fans. However, traditional RL methods can lead to significant fluctuations in the flow of pumps and fans, posing a safety risk. To address this issue, we propose a novel FCU control method, Fluctuation Suppression–Deep Deterministic Policy Gradient (FS-DDPG). The key innovation lies in applying a constrained Markov decision process to model the FCU control problem, where a penalty term for process constraints is incorporated into the reward function, and constraint tightening is introduced to limit the action space. In addition, to validate the performance of the proposed method, we established a variable operating conditions FCU simulation platform based on the parameters of the actual FCU system and ten years of historical weather data. The platform’s correctness and effectiveness were verified from three aspects: heat transfer, the air side and the water side, under different dry and wet operating conditions. The experimental results show that compared with DDPG, FS-DDPG avoids 98.20% of the pump flow and 95.82% of the fan flow fluctuations, ensuring the safety of the equipment. Compared with DDPG and RBC, FS-DDPG achieves 11.9% and 51.76% energy saving rates, respectively, and also shows better performance in terms of operational performance and satisfaction. In the future, we will further improve the scalability and apply the method to more complex FCU systems in variable environments.https://www.mdpi.com/2075-5309/15/2/226safe reinforcement learningfan coil unit systemmodel-free controlbuilding energy efficiency
spellingShingle	Chenyang Li Qiming Fu Jianping Chen You Lu Yunzhe Wang Hongjie Wu FS-DDPG: Optimal Control of a Fan Coil Unit System Based on Safe Reinforcement Learning Buildings safe reinforcement learning fan coil unit system model-free control building energy efficiency
title	FS-DDPG: Optimal Control of a Fan Coil Unit System Based on Safe Reinforcement Learning
title_full	FS-DDPG: Optimal Control of a Fan Coil Unit System Based on Safe Reinforcement Learning
title_fullStr	FS-DDPG: Optimal Control of a Fan Coil Unit System Based on Safe Reinforcement Learning
title_full_unstemmed	FS-DDPG: Optimal Control of a Fan Coil Unit System Based on Safe Reinforcement Learning
title_short	FS-DDPG: Optimal Control of a Fan Coil Unit System Based on Safe Reinforcement Learning
title_sort	fs ddpg optimal control of a fan coil unit system based on safe reinforcement learning
topic	safe reinforcement learning fan coil unit system model-free control building energy efficiency
url	https://www.mdpi.com/2075-5309/15/2/226
work_keys_str_mv	AT chenyangli fsddpgoptimalcontrolofafancoilunitsystembasedonsafereinforcementlearning AT qimingfu fsddpgoptimalcontrolofafancoilunitsystembasedonsafereinforcementlearning AT jianpingchen fsddpgoptimalcontrolofafancoilunitsystembasedonsafereinforcementlearning AT youlu fsddpgoptimalcontrolofafancoilunitsystembasedonsafereinforcementlearning AT yunzhewang fsddpgoptimalcontrolofafancoilunitsystembasedonsafereinforcementlearning AT hongjiewu fsddpgoptimalcontrolofafancoilunitsystembasedonsafereinforcementlearning

FS-DDPG: Optimal Control of a Fan Coil Unit System Based on Safe Reinforcement Learning

Similar Items