UAV Collision Avoidance in Unknown Scenarios with Causal Representation Disentanglement

Deep reinforcement learning (DRL) has significantly advanced online path planning for unmanned aerial vehicles (UAVs). Nonetheless, DRL-based methods often encounter reduced performance when dealing with unfamiliar scenarios. This decline is mainly caused by the presence of non-causal and domain-spe...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhun Fan, Zihao Xia, Che Lin, Gaofei Han, Wenji Li, Dongliang Wang, Yindong Chen, Zhifeng Hao, Ruichu Cai, Jiafan Zhuang
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Drones
Subjects:
Online Access:https://www.mdpi.com/2504-446X/9/1/10
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832588617541746688
author Zhun Fan
Zihao Xia
Che Lin
Gaofei Han
Wenji Li
Dongliang Wang
Yindong Chen
Zhifeng Hao
Ruichu Cai
Jiafan Zhuang
author_facet Zhun Fan
Zihao Xia
Che Lin
Gaofei Han
Wenji Li
Dongliang Wang
Yindong Chen
Zhifeng Hao
Ruichu Cai
Jiafan Zhuang
author_sort Zhun Fan
collection DOAJ
description Deep reinforcement learning (DRL) has significantly advanced online path planning for unmanned aerial vehicles (UAVs). Nonetheless, DRL-based methods often encounter reduced performance when dealing with unfamiliar scenarios. This decline is mainly caused by the presence of non-causal and domain-specific elements within visual representations, which negatively impact the learning of policies. Present techniques generally depend on predefined augmentation or regularization methods intended to direct the model toward identifying causal and domain-invariant components, thereby enhancing the model’s ability to generalize. However, these manually crafted approaches are intrinsically constrained in their coverage and do not consider the entire spectrum of possible scenarios, resulting in less effective performance in new environments. Unlike prior studies, this work prioritizes representation learning and presents a novel method for causal representation disentanglement. The approach successfully distinguishes between causal and non-causal elements in visual data. By concentrating on causal aspects during the policy learning phase, the impact of non-causal factors is minimized, thereby improving the generalizability of DRL models. Experimental results demonstrate that our technique achieves reliable navigation and effective collision avoidance in unseen scenarios, surpassing state-of-the-art deep reinforcement learning algorithms.
format Article
id doaj-art-43ea04e4308342acb2a07eaeabe965bb
institution Kabale University
issn 2504-446X
language English
publishDate 2024-12-01
publisher MDPI AG
record_format Article
series Drones
spelling doaj-art-43ea04e4308342acb2a07eaeabe965bb2025-01-24T13:29:38ZengMDPI AGDrones2504-446X2024-12-01911010.3390/drones9010010UAV Collision Avoidance in Unknown Scenarios with Causal Representation DisentanglementZhun Fan0Zihao Xia1Che Lin2Gaofei Han3Wenji Li4Dongliang Wang5Yindong Chen6Zhifeng Hao7Ruichu Cai8Jiafan Zhuang9Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China, Shenzhen 518110, ChinaCollege of Engineering, Shantou University, Shantou 515063, ChinaCollege of Engineering, Shantou University, Shantou 515063, ChinaCollege of Engineering, Shantou University, Shantou 515063, ChinaInternational Cooperation Base of Evolutionary Intelligence and Robotics, Shantou 515041, ChinaInternational Cooperation Base of Evolutionary Intelligence and Robotics, Shantou 515041, ChinaCollege of Engineering, Shantou University, Shantou 515063, ChinaCollege of Engineering, Shantou University, Shantou 515063, ChinaSchool of Computer Science, Guangdong University of Technology, Guangzhou 510006, ChinaInternational Cooperation Base of Evolutionary Intelligence and Robotics, Shantou 515041, ChinaDeep reinforcement learning (DRL) has significantly advanced online path planning for unmanned aerial vehicles (UAVs). Nonetheless, DRL-based methods often encounter reduced performance when dealing with unfamiliar scenarios. This decline is mainly caused by the presence of non-causal and domain-specific elements within visual representations, which negatively impact the learning of policies. Present techniques generally depend on predefined augmentation or regularization methods intended to direct the model toward identifying causal and domain-invariant components, thereby enhancing the model’s ability to generalize. However, these manually crafted approaches are intrinsically constrained in their coverage and do not consider the entire spectrum of possible scenarios, resulting in less effective performance in new environments. Unlike prior studies, this work prioritizes representation learning and presents a novel method for causal representation disentanglement. The approach successfully distinguishes between causal and non-causal elements in visual data. By concentrating on causal aspects during the policy learning phase, the impact of non-causal factors is minimized, thereby improving the generalizability of DRL models. Experimental results demonstrate that our technique achieves reliable navigation and effective collision avoidance in unseen scenarios, surpassing state-of-the-art deep reinforcement learning algorithms.https://www.mdpi.com/2504-446X/9/1/10deep reinforcement learningunmanned aerial vehiclesgeneralization capabilitypolicy learningcausal representation disentanglement
spellingShingle Zhun Fan
Zihao Xia
Che Lin
Gaofei Han
Wenji Li
Dongliang Wang
Yindong Chen
Zhifeng Hao
Ruichu Cai
Jiafan Zhuang
UAV Collision Avoidance in Unknown Scenarios with Causal Representation Disentanglement
Drones
deep reinforcement learning
unmanned aerial vehicles
generalization capability
policy learning
causal representation disentanglement
title UAV Collision Avoidance in Unknown Scenarios with Causal Representation Disentanglement
title_full UAV Collision Avoidance in Unknown Scenarios with Causal Representation Disentanglement
title_fullStr UAV Collision Avoidance in Unknown Scenarios with Causal Representation Disentanglement
title_full_unstemmed UAV Collision Avoidance in Unknown Scenarios with Causal Representation Disentanglement
title_short UAV Collision Avoidance in Unknown Scenarios with Causal Representation Disentanglement
title_sort uav collision avoidance in unknown scenarios with causal representation disentanglement
topic deep reinforcement learning
unmanned aerial vehicles
generalization capability
policy learning
causal representation disentanglement
url https://www.mdpi.com/2504-446X/9/1/10
work_keys_str_mv AT zhunfan uavcollisionavoidanceinunknownscenarioswithcausalrepresentationdisentanglement
AT zihaoxia uavcollisionavoidanceinunknownscenarioswithcausalrepresentationdisentanglement
AT chelin uavcollisionavoidanceinunknownscenarioswithcausalrepresentationdisentanglement
AT gaofeihan uavcollisionavoidanceinunknownscenarioswithcausalrepresentationdisentanglement
AT wenjili uavcollisionavoidanceinunknownscenarioswithcausalrepresentationdisentanglement
AT dongliangwang uavcollisionavoidanceinunknownscenarioswithcausalrepresentationdisentanglement
AT yindongchen uavcollisionavoidanceinunknownscenarioswithcausalrepresentationdisentanglement
AT zhifenghao uavcollisionavoidanceinunknownscenarioswithcausalrepresentationdisentanglement
AT ruichucai uavcollisionavoidanceinunknownscenarioswithcausalrepresentationdisentanglement
AT jiafanzhuang uavcollisionavoidanceinunknownscenarioswithcausalrepresentationdisentanglement