Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning

Global fruit production costs are increasing amid intensified labor shortages, driving heightened interest in robotic harvesting technologies. Although multi-arm coordination in harvesting robots is considered a highly promising solution to this issue, it introduces technical challenges in achieving...

Full description

Saved in:

Bibliographic Details
Main Authors:	Feng Xie, Zhengwei Guo, Tao Li, Qingchun Feng, Chunjiang Zhao
Format:	Article
Language:	English
Published:	MDPI AG 2025-01-01
Series:	Horticulturae
Subjects:	multi-arm harvesting robots target planning multiple constraints deep reinforcement learning
Online Access:	https://www.mdpi.com/2311-7524/11/1/88
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832588342333538304
author	Feng Xie Zhengwei Guo Tao Li Qingchun Feng Chunjiang Zhao
author_facet	Feng Xie Zhengwei Guo Tao Li Qingchun Feng Chunjiang Zhao
author_sort	Feng Xie
collection	DOAJ
description	Global fruit production costs are increasing amid intensified labor shortages, driving heightened interest in robotic harvesting technologies. Although multi-arm coordination in harvesting robots is considered a highly promising solution to this issue, it introduces technical challenges in achieving effective coordination. These challenges include mutual interference among multi-arm mechanical structures, task allocation across multiple arms, and dynamic operating conditions. This imposes higher demands on task coordination for multi-arm harvesting robots, requiring collision-free collaboration, optimization of task sequences, and dynamic re-planning. In this work, we propose a framework that models the task planning problem of multi-arm operation as a Markov game. First, considering multi-arm cooperative movement and picking sequence optimization, we employ a two-agent Markov game framework to model the multi-arm harvesting robot task planning problem. Second, we introduce a self-attention mechanism and a centralized training and execution strategy in the design and training of our deep reinforcement learning (DRL) model, thereby enhancing the model’s adaptability in dynamic and uncertain environments and improving decision accuracy. Finally, we conduct extensive numerical simulations in static environments; when the harvesting targets are set to 25 and 50, the execution time is reduced by 10.7% and 3.1%, respectively, compared to traditional methods. Additionally, in dynamic environments, both operational efficiency and robustness are superior to traditional approaches. The results underscore the potential of our approach to revolutionize multi-arm harvesting robotics by providing a more adaptive and efficient task planning solution. We will research improving the positioning accuracy of fruits in the future, which will make it possible to apply this framework to real robots.
format	Article
id	doaj-art-5435ee0123b548929f08f24065f07b1c
institution	Kabale University
issn	2311-7524
language	English
publishDate	2025-01-01
publisher	MDPI AG
record_format	Article
series	Horticulturae
spelling	doaj-art-5435ee0123b548929f08f24065f07b1c2025-01-24T13:34:44ZengMDPI AGHorticulturae2311-75242025-01-011118810.3390/horticulturae11010088Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement LearningFeng Xie0Zhengwei Guo1Tao Li2Qingchun Feng3Chunjiang Zhao4School of Agricultural Engineering, Jiangsu University, Zhenjiang 212000, ChinaIntelligent Equipment Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, ChinaIntelligent Equipment Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, ChinaIntelligent Equipment Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, ChinaSchool of Mechanical Engineering, Guangxi University, Nanning 530000, ChinaGlobal fruit production costs are increasing amid intensified labor shortages, driving heightened interest in robotic harvesting technologies. Although multi-arm coordination in harvesting robots is considered a highly promising solution to this issue, it introduces technical challenges in achieving effective coordination. These challenges include mutual interference among multi-arm mechanical structures, task allocation across multiple arms, and dynamic operating conditions. This imposes higher demands on task coordination for multi-arm harvesting robots, requiring collision-free collaboration, optimization of task sequences, and dynamic re-planning. In this work, we propose a framework that models the task planning problem of multi-arm operation as a Markov game. First, considering multi-arm cooperative movement and picking sequence optimization, we employ a two-agent Markov game framework to model the multi-arm harvesting robot task planning problem. Second, we introduce a self-attention mechanism and a centralized training and execution strategy in the design and training of our deep reinforcement learning (DRL) model, thereby enhancing the model’s adaptability in dynamic and uncertain environments and improving decision accuracy. Finally, we conduct extensive numerical simulations in static environments; when the harvesting targets are set to 25 and 50, the execution time is reduced by 10.7% and 3.1%, respectively, compared to traditional methods. Additionally, in dynamic environments, both operational efficiency and robustness are superior to traditional approaches. The results underscore the potential of our approach to revolutionize multi-arm harvesting robotics by providing a more adaptive and efficient task planning solution. We will research improving the positioning accuracy of fruits in the future, which will make it possible to apply this framework to real robots.https://www.mdpi.com/2311-7524/11/1/88multi-arm harvesting robotstarget planningmultiple constraintsdeep reinforcement learning
spellingShingle	Feng Xie Zhengwei Guo Tao Li Qingchun Feng Chunjiang Zhao Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning Horticulturae multi-arm harvesting robots target planning multiple constraints deep reinforcement learning
title	Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning
title_full	Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning
title_fullStr	Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning
title_full_unstemmed	Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning
title_short	Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning
title_sort	dynamic task planning for multi arm harvesting robots under multiple constraints using deep reinforcement learning
topic	multi-arm harvesting robots target planning multiple constraints deep reinforcement learning
url	https://www.mdpi.com/2311-7524/11/1/88
work_keys_str_mv	AT fengxie dynamictaskplanningformultiarmharvestingrobotsundermultipleconstraintsusingdeepreinforcementlearning AT zhengweiguo dynamictaskplanningformultiarmharvestingrobotsundermultipleconstraintsusingdeepreinforcementlearning AT taoli dynamictaskplanningformultiarmharvestingrobotsundermultipleconstraintsusingdeepreinforcementlearning AT qingchunfeng dynamictaskplanningformultiarmharvestingrobotsundermultipleconstraintsusingdeepreinforcementlearning AT chunjiangzhao dynamictaskplanningformultiarmharvestingrobotsundermultipleconstraintsusingdeepreinforcementlearning

Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning

Similar Items