Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance

In this research, we focus on developing an autonomous system for multiship collision avoidance. The proposed approach combines global path planning based on deep reinforcement learning (DRL) and local motion control to improve computational efficiency and alleviate the sensitivity to heading angle...

Full description

Saved in:
Bibliographic Details
Main Authors: Luman Zhao, Guoyuan Li, Houxiang Zhang
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Open Journal of Intelligent Transportation Systems
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10587203/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832590335403884544
author Luman Zhao
Guoyuan Li
Houxiang Zhang
author_facet Luman Zhao
Guoyuan Li
Houxiang Zhang
author_sort Luman Zhao
collection DOAJ
description In this research, we focus on developing an autonomous system for multiship collision avoidance. The proposed approach combines global path planning based on deep reinforcement learning (DRL) and local motion control to improve computational efficiency and alleviate the sensitivity to heading angle changes. To achieve this, firstly, DRL is used to learn a policy that maps observable states of target ships to a sequence of predicted waypoints. This learning task aims to generate a specific trajectory while avoiding collision with target ships complying with the international regulations for preventing collisions at sea (COLREGs). The learned policy is used as a global path planner during navigation. Secondly, the line-of-sight (LOS) guidance system is applied to calculate the desired course command based on the collision-free trajectory generated according to the policy. Lastly, a model-based control strategy is implemented to control the ship to the specific goal in collision-free space while satisfying the desired commands. We demonstrate the performance of the approach using an example of an autonomous surface vehicle. In comparison to other methods, our proposed control can provide a more stable and smoother maneuvering effect.
format Article
id doaj-art-738f0f9baccc4d098738efe34eccf784
institution Kabale University
issn 2687-7813
language English
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Open Journal of Intelligent Transportation Systems
spelling doaj-art-738f0f9baccc4d098738efe34eccf7842025-01-24T00:02:41ZengIEEEIEEE Open Journal of Intelligent Transportation Systems2687-78132024-01-01542243210.1109/OJITS.2024.342458710587203Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision AvoidanceLuman Zhao0https://orcid.org/0000-0003-1555-5183Guoyuan Li1https://orcid.org/0000-0001-7553-0899Houxiang Zhang2https://orcid.org/0000-0003-0122-0964Department of Ocean Operations and Civil Engineering, Norwegian University of Science and Technology, Ålesund, NorwayDepartment of Ocean Operations and Civil Engineering, Norwegian University of Science and Technology, Ålesund, NorwayDepartment of Ocean Operations and Civil Engineering, Norwegian University of Science and Technology, Ålesund, NorwayIn this research, we focus on developing an autonomous system for multiship collision avoidance. The proposed approach combines global path planning based on deep reinforcement learning (DRL) and local motion control to improve computational efficiency and alleviate the sensitivity to heading angle changes. To achieve this, firstly, DRL is used to learn a policy that maps observable states of target ships to a sequence of predicted waypoints. This learning task aims to generate a specific trajectory while avoiding collision with target ships complying with the international regulations for preventing collisions at sea (COLREGs). The learned policy is used as a global path planner during navigation. Secondly, the line-of-sight (LOS) guidance system is applied to calculate the desired course command based on the collision-free trajectory generated according to the policy. Lastly, a model-based control strategy is implemented to control the ship to the specific goal in collision-free space while satisfying the desired commands. We demonstrate the performance of the approach using an example of an autonomous surface vehicle. In comparison to other methods, our proposed control can provide a more stable and smoother maneuvering effect.https://ieeexplore.ieee.org/document/10587203/Control application of autonomous systemsdeep reinforcement learningmulti-ship collision avoidanceonline path followingthe international regulations for preventing collisions at sea (COLREGs)
spellingShingle Luman Zhao
Guoyuan Li
Houxiang Zhang
Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance
IEEE Open Journal of Intelligent Transportation Systems
Control application of autonomous systems
deep reinforcement learning
multi-ship collision avoidance
online path following
the international regulations for preventing collisions at sea (COLREGs)
title Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance
title_full Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance
title_fullStr Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance
title_full_unstemmed Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance
title_short Global and Local Awareness: Combine Reinforcement Learning and Model-Based Control for Collision Avoidance
title_sort global and local awareness combine reinforcement learning and model based control for collision avoidance
topic Control application of autonomous systems
deep reinforcement learning
multi-ship collision avoidance
online path following
the international regulations for preventing collisions at sea (COLREGs)
url https://ieeexplore.ieee.org/document/10587203/
work_keys_str_mv AT lumanzhao globalandlocalawarenesscombinereinforcementlearningandmodelbasedcontrolforcollisionavoidance
AT guoyuanli globalandlocalawarenesscombinereinforcementlearningandmodelbasedcontrolforcollisionavoidance
AT houxiangzhang globalandlocalawarenesscombinereinforcementlearningandmodelbasedcontrolforcollisionavoidance