Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning

Global fruit production costs are increasing amid intensified labor shortages, driving heightened interest in robotic harvesting technologies. Although multi-arm coordination in harvesting robots is considered a highly promising solution to this issue, it introduces technical challenges in achieving...

Full description

Saved in:
Bibliographic Details
Main Authors: Feng Xie, Zhengwei Guo, Tao Li, Qingchun Feng, Chunjiang Zhao
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Horticulturae
Subjects:
Online Access:https://www.mdpi.com/2311-7524/11/1/88
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832588342333538304
author Feng Xie
Zhengwei Guo
Tao Li
Qingchun Feng
Chunjiang Zhao
author_facet Feng Xie
Zhengwei Guo
Tao Li
Qingchun Feng
Chunjiang Zhao
author_sort Feng Xie
collection DOAJ
description Global fruit production costs are increasing amid intensified labor shortages, driving heightened interest in robotic harvesting technologies. Although multi-arm coordination in harvesting robots is considered a highly promising solution to this issue, it introduces technical challenges in achieving effective coordination. These challenges include mutual interference among multi-arm mechanical structures, task allocation across multiple arms, and dynamic operating conditions. This imposes higher demands on task coordination for multi-arm harvesting robots, requiring collision-free collaboration, optimization of task sequences, and dynamic re-planning. In this work, we propose a framework that models the task planning problem of multi-arm operation as a Markov game. First, considering multi-arm cooperative movement and picking sequence optimization, we employ a two-agent Markov game framework to model the multi-arm harvesting robot task planning problem. Second, we introduce a self-attention mechanism and a centralized training and execution strategy in the design and training of our deep reinforcement learning (DRL) model, thereby enhancing the model’s adaptability in dynamic and uncertain environments and improving decision accuracy. Finally, we conduct extensive numerical simulations in static environments; when the harvesting targets are set to 25 and 50, the execution time is reduced by 10.7% and 3.1%, respectively, compared to traditional methods. Additionally, in dynamic environments, both operational efficiency and robustness are superior to traditional approaches. The results underscore the potential of our approach to revolutionize multi-arm harvesting robotics by providing a more adaptive and efficient task planning solution. We will research improving the positioning accuracy of fruits in the future, which will make it possible to apply this framework to real robots.
format Article
id doaj-art-5435ee0123b548929f08f24065f07b1c
institution Kabale University
issn 2311-7524
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Horticulturae
spelling doaj-art-5435ee0123b548929f08f24065f07b1c2025-01-24T13:34:44ZengMDPI AGHorticulturae2311-75242025-01-011118810.3390/horticulturae11010088Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement LearningFeng Xie0Zhengwei Guo1Tao Li2Qingchun Feng3Chunjiang Zhao4School of Agricultural Engineering, Jiangsu University, Zhenjiang 212000, ChinaIntelligent Equipment Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, ChinaIntelligent Equipment Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, ChinaIntelligent Equipment Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, ChinaSchool of Mechanical Engineering, Guangxi University, Nanning 530000, ChinaGlobal fruit production costs are increasing amid intensified labor shortages, driving heightened interest in robotic harvesting technologies. Although multi-arm coordination in harvesting robots is considered a highly promising solution to this issue, it introduces technical challenges in achieving effective coordination. These challenges include mutual interference among multi-arm mechanical structures, task allocation across multiple arms, and dynamic operating conditions. This imposes higher demands on task coordination for multi-arm harvesting robots, requiring collision-free collaboration, optimization of task sequences, and dynamic re-planning. In this work, we propose a framework that models the task planning problem of multi-arm operation as a Markov game. First, considering multi-arm cooperative movement and picking sequence optimization, we employ a two-agent Markov game framework to model the multi-arm harvesting robot task planning problem. Second, we introduce a self-attention mechanism and a centralized training and execution strategy in the design and training of our deep reinforcement learning (DRL) model, thereby enhancing the model’s adaptability in dynamic and uncertain environments and improving decision accuracy. Finally, we conduct extensive numerical simulations in static environments; when the harvesting targets are set to 25 and 50, the execution time is reduced by 10.7% and 3.1%, respectively, compared to traditional methods. Additionally, in dynamic environments, both operational efficiency and robustness are superior to traditional approaches. The results underscore the potential of our approach to revolutionize multi-arm harvesting robotics by providing a more adaptive and efficient task planning solution. We will research improving the positioning accuracy of fruits in the future, which will make it possible to apply this framework to real robots.https://www.mdpi.com/2311-7524/11/1/88multi-arm harvesting robotstarget planningmultiple constraintsdeep reinforcement learning
spellingShingle Feng Xie
Zhengwei Guo
Tao Li
Qingchun Feng
Chunjiang Zhao
Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning
Horticulturae
multi-arm harvesting robots
target planning
multiple constraints
deep reinforcement learning
title Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning
title_full Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning
title_fullStr Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning
title_full_unstemmed Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning
title_short Dynamic Task Planning for Multi-Arm Harvesting Robots Under Multiple Constraints Using Deep Reinforcement Learning
title_sort dynamic task planning for multi arm harvesting robots under multiple constraints using deep reinforcement learning
topic multi-arm harvesting robots
target planning
multiple constraints
deep reinforcement learning
url https://www.mdpi.com/2311-7524/11/1/88
work_keys_str_mv AT fengxie dynamictaskplanningformultiarmharvestingrobotsundermultipleconstraintsusingdeepreinforcementlearning
AT zhengweiguo dynamictaskplanningformultiarmharvestingrobotsundermultipleconstraintsusingdeepreinforcementlearning
AT taoli dynamictaskplanningformultiarmharvestingrobotsundermultipleconstraintsusingdeepreinforcementlearning
AT qingchunfeng dynamictaskplanningformultiarmharvestingrobotsundermultipleconstraintsusingdeepreinforcementlearning
AT chunjiangzhao dynamictaskplanningformultiarmharvestingrobotsundermultipleconstraintsusingdeepreinforcementlearning