Crowding distance and IGD-driven grey wolf reinforcement learning approach for multi-objective agile earth observation satellite scheduling
With the rise of low-cost launches, miniaturized space technology, and commercialization, the cost of space missions has dropped, leading to a surge in flexible Earth observation satellites. This increased demand for complex and diverse imaging products requires addressing multi-objective optimizati...
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Taylor & Francis Group
2025-12-01
|
Series: | International Journal of Digital Earth |
Subjects: | |
Online Access: | https://www.tandfonline.com/doi/10.1080/17538947.2025.2458024 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832557708502368256 |
---|---|
author | He Wang Weiquan Huang Sindri Magnússon Tony Lindgren Chen Chen Junyu Wu Yanjie Song |
author_facet | He Wang Weiquan Huang Sindri Magnússon Tony Lindgren Chen Chen Junyu Wu Yanjie Song |
author_sort | He Wang |
collection | DOAJ |
description | With the rise of low-cost launches, miniaturized space technology, and commercialization, the cost of space missions has dropped, leading to a surge in flexible Earth observation satellites. This increased demand for complex and diverse imaging products requires addressing multi-objective optimization in practice. To this end, we propose a multi-objective agile Earth observation satellite scheduling problem (MOAEOSSP) model and introduce a reinforcement learning-based multi-objective grey wolf optimization (RLMOGWO) algorithm. It aims to maximize observation efficiency while minimizing energy consumption. During population initialization, the algorithm uses chaos mapping and opposition-based learning to enhance diversity and global search, reducing the risk of local optima. It integrates Q-learning into an improved multi-objective grey wolf optimization framework, designing state-action combinations that balance exploration and exploitation. Dynamic parameter adjustments guide position updates, boosting adaptability across different optimization stages. Moreover, the algorithm introduces a reward mechanism based on the crowding distance and inverted generational distance (IGD) to maintain Pareto front diversity and distribution, ensuring a strong multi-objective optimization performance. The experimental results show that the algorithm excels at solving the MOAEOSSP, outperforming competing algorithms across several metrics and demonstrating its effectiveness for complex optimization problems. |
format | Article |
id | doaj-art-700fb57e75d640e89a61be0f1ce224a4 |
institution | Kabale University |
issn | 1753-8947 1753-8955 |
language | English |
publishDate | 2025-12-01 |
publisher | Taylor & Francis Group |
record_format | Article |
series | International Journal of Digital Earth |
spelling | doaj-art-700fb57e75d640e89a61be0f1ce224a42025-02-03T03:08:34ZengTaylor & Francis GroupInternational Journal of Digital Earth1753-89471753-89552025-12-0118110.1080/17538947.2025.2458024Crowding distance and IGD-driven grey wolf reinforcement learning approach for multi-objective agile earth observation satellite schedulingHe Wang0Weiquan Huang1Sindri Magnússon2Tony Lindgren3Chen Chen4Junyu Wu5Yanjie Song6College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, People's Republic of ChinaCollege of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, People's Republic of ChinaDepartment of Computer and Systems Sciences, Stockholm University, Stockholm, SwedenDepartment of Computer and Systems Sciences, Stockholm University, Stockholm, SwedenCollege of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, People's Republic of ChinaCollege of Mechanical and Electrical Engineering, Harbin Institute of Technology, Harbin, People's Republic of ChinaNational Engineering Research Center of Maritime Navigation System, Dalian Maritime University, Dalian, People's Republic of ChinaWith the rise of low-cost launches, miniaturized space technology, and commercialization, the cost of space missions has dropped, leading to a surge in flexible Earth observation satellites. This increased demand for complex and diverse imaging products requires addressing multi-objective optimization in practice. To this end, we propose a multi-objective agile Earth observation satellite scheduling problem (MOAEOSSP) model and introduce a reinforcement learning-based multi-objective grey wolf optimization (RLMOGWO) algorithm. It aims to maximize observation efficiency while minimizing energy consumption. During population initialization, the algorithm uses chaos mapping and opposition-based learning to enhance diversity and global search, reducing the risk of local optima. It integrates Q-learning into an improved multi-objective grey wolf optimization framework, designing state-action combinations that balance exploration and exploitation. Dynamic parameter adjustments guide position updates, boosting adaptability across different optimization stages. Moreover, the algorithm introduces a reward mechanism based on the crowding distance and inverted generational distance (IGD) to maintain Pareto front diversity and distribution, ensuring a strong multi-objective optimization performance. The experimental results show that the algorithm excels at solving the MOAEOSSP, outperforming competing algorithms across several metrics and demonstrating its effectiveness for complex optimization problems.https://www.tandfonline.com/doi/10.1080/17538947.2025.2458024Earth observation satellitereinforcement learningQ-learningschedulingmulti-objective optimizationgrey wolf algorithm |
spellingShingle | He Wang Weiquan Huang Sindri Magnússon Tony Lindgren Chen Chen Junyu Wu Yanjie Song Crowding distance and IGD-driven grey wolf reinforcement learning approach for multi-objective agile earth observation satellite scheduling International Journal of Digital Earth Earth observation satellite reinforcement learning Q-learning scheduling multi-objective optimization grey wolf algorithm |
title | Crowding distance and IGD-driven grey wolf reinforcement learning approach for multi-objective agile earth observation satellite scheduling |
title_full | Crowding distance and IGD-driven grey wolf reinforcement learning approach for multi-objective agile earth observation satellite scheduling |
title_fullStr | Crowding distance and IGD-driven grey wolf reinforcement learning approach for multi-objective agile earth observation satellite scheduling |
title_full_unstemmed | Crowding distance and IGD-driven grey wolf reinforcement learning approach for multi-objective agile earth observation satellite scheduling |
title_short | Crowding distance and IGD-driven grey wolf reinforcement learning approach for multi-objective agile earth observation satellite scheduling |
title_sort | crowding distance and igd driven grey wolf reinforcement learning approach for multi objective agile earth observation satellite scheduling |
topic | Earth observation satellite reinforcement learning Q-learning scheduling multi-objective optimization grey wolf algorithm |
url | https://www.tandfonline.com/doi/10.1080/17538947.2025.2458024 |
work_keys_str_mv | AT hewang crowdingdistanceandigddrivengreywolfreinforcementlearningapproachformultiobjectiveagileearthobservationsatellitescheduling AT weiquanhuang crowdingdistanceandigddrivengreywolfreinforcementlearningapproachformultiobjectiveagileearthobservationsatellitescheduling AT sindrimagnusson crowdingdistanceandigddrivengreywolfreinforcementlearningapproachformultiobjectiveagileearthobservationsatellitescheduling AT tonylindgren crowdingdistanceandigddrivengreywolfreinforcementlearningapproachformultiobjectiveagileearthobservationsatellitescheduling AT chenchen crowdingdistanceandigddrivengreywolfreinforcementlearningapproachformultiobjectiveagileearthobservationsatellitescheduling AT junyuwu crowdingdistanceandigddrivengreywolfreinforcementlearningapproachformultiobjectiveagileearthobservationsatellitescheduling AT yanjiesong crowdingdistanceandigddrivengreywolfreinforcementlearningapproachformultiobjectiveagileearthobservationsatellitescheduling |