Human-in-the-loop control strategy for IoT-based smart thermostats with Deep Reinforcement Learning

Thermostatic Radiator Valves (TRVs) are a widely used technology for regulating room heating in Europe countries. Smart TRVs can provide significant energy savings, often ranging from 20–40% compared to conventional heating systems. They use sensors and algorithms to learn user behavior and optimize...

Full description

Saved in:
Bibliographic Details
Main Authors: Payam Fatehi Karjou, Fabian Stupperich, Phillip Stoffel, Drk Müller
Format: Article
Language:English
Published: Elsevier 2025-05-01
Series:Energy and AI
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666546825000229
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850219560334000128
author Payam Fatehi Karjou
Fabian Stupperich
Phillip Stoffel
Drk Müller
author_facet Payam Fatehi Karjou
Fabian Stupperich
Phillip Stoffel
Drk Müller
author_sort Payam Fatehi Karjou
collection DOAJ
description Thermostatic Radiator Valves (TRVs) are a widely used technology for regulating room heating in Europe countries. Smart TRVs can provide significant energy savings, often ranging from 20–40% compared to conventional heating systems. They use sensors and algorithms to learn user behavior and optimize heating schedules accordingly. They can often be easily retrofitted to existing heating systems, making them a practical option for enhancing energy efficiency in present buildings, especially in office buildings due to their highly dynamic operational patterns. This work presents a novel human-in-the-loop control strategy for Internet of Things (IoT)-based TRVs using Deep Reinforcement Learning (DRL). A key focus of this research is enhancing the adaptability of agents’ behavior by implementing a more generic and flexible Markov Decision Process (MDP) to promote policy generalization across diverse scenarios. The study explores the challenges of transferring control behaviors from simulation environments to real-world settings, examining the performance across different thermal zones and evaluating the integration flexibility of the control strategy within building systems. Real-world occupant behavior is incorporated, including dynamic comfort preferences and occupancy predictions, to better align thermostat operation with user preferences. Furthermore, this paper discusses the practical challenges encountered during implementation, including battery consumption of IoT devices, integration of occupancy detection and prediction systems, and maintenance requirements. By addressing these issues, the proposed control strategy seeks to improve the scalability and feasibility of IoT-based TRVs, thereby providing a viable solution for their widespread deployment in buildings.
format Article
id doaj-art-84c0b4e3dc53473fbf05dcacd1dd389d
institution OA Journals
issn 2666-5468
language English
publishDate 2025-05-01
publisher Elsevier
record_format Article
series Energy and AI
spelling doaj-art-84c0b4e3dc53473fbf05dcacd1dd389d2025-08-20T02:07:20ZengElsevierEnergy and AI2666-54682025-05-012010049010.1016/j.egyai.2025.100490Human-in-the-loop control strategy for IoT-based smart thermostats with Deep Reinforcement LearningPayam Fatehi Karjou0Fabian Stupperich1Phillip Stoffel2Drk Müller3Corresponding author.; RWTH Aachen University, Institute for Energy Efficient Buildings and Indoor Climate, Aachen, GermanyRWTH Aachen University, Institute for Energy Efficient Buildings and Indoor Climate, Aachen, GermanyRWTH Aachen University, Institute for Energy Efficient Buildings and Indoor Climate, Aachen, GermanyRWTH Aachen University, Institute for Energy Efficient Buildings and Indoor Climate, Aachen, GermanyThermostatic Radiator Valves (TRVs) are a widely used technology for regulating room heating in Europe countries. Smart TRVs can provide significant energy savings, often ranging from 20–40% compared to conventional heating systems. They use sensors and algorithms to learn user behavior and optimize heating schedules accordingly. They can often be easily retrofitted to existing heating systems, making them a practical option for enhancing energy efficiency in present buildings, especially in office buildings due to their highly dynamic operational patterns. This work presents a novel human-in-the-loop control strategy for Internet of Things (IoT)-based TRVs using Deep Reinforcement Learning (DRL). A key focus of this research is enhancing the adaptability of agents’ behavior by implementing a more generic and flexible Markov Decision Process (MDP) to promote policy generalization across diverse scenarios. The study explores the challenges of transferring control behaviors from simulation environments to real-world settings, examining the performance across different thermal zones and evaluating the integration flexibility of the control strategy within building systems. Real-world occupant behavior is incorporated, including dynamic comfort preferences and occupancy predictions, to better align thermostat operation with user preferences. Furthermore, this paper discusses the practical challenges encountered during implementation, including battery consumption of IoT devices, integration of occupancy detection and prediction systems, and maintenance requirements. By addressing these issues, the proposed control strategy seeks to improve the scalability and feasibility of IoT-based TRVs, thereby providing a viable solution for their widespread deployment in buildings.http://www.sciencedirect.com/science/article/pii/S2666546825000229Human-in-the-loop controlAITRVRLCIoT
spellingShingle Payam Fatehi Karjou
Fabian Stupperich
Phillip Stoffel
Drk Müller
Human-in-the-loop control strategy for IoT-based smart thermostats with Deep Reinforcement Learning
Energy and AI
Human-in-the-loop control
AI
TRV
RLC
IoT
title Human-in-the-loop control strategy for IoT-based smart thermostats with Deep Reinforcement Learning
title_full Human-in-the-loop control strategy for IoT-based smart thermostats with Deep Reinforcement Learning
title_fullStr Human-in-the-loop control strategy for IoT-based smart thermostats with Deep Reinforcement Learning
title_full_unstemmed Human-in-the-loop control strategy for IoT-based smart thermostats with Deep Reinforcement Learning
title_short Human-in-the-loop control strategy for IoT-based smart thermostats with Deep Reinforcement Learning
title_sort human in the loop control strategy for iot based smart thermostats with deep reinforcement learning
topic Human-in-the-loop control
AI
TRV
RLC
IoT
url http://www.sciencedirect.com/science/article/pii/S2666546825000229
work_keys_str_mv AT payamfatehikarjou humanintheloopcontrolstrategyforiotbasedsmartthermostatswithdeepreinforcementlearning
AT fabianstupperich humanintheloopcontrolstrategyforiotbasedsmartthermostatswithdeepreinforcementlearning
AT phillipstoffel humanintheloopcontrolstrategyforiotbasedsmartthermostatswithdeepreinforcementlearning
AT drkmuller humanintheloopcontrolstrategyforiotbasedsmartthermostatswithdeepreinforcementlearning