Deep deterministic policy gradient algorithm based on dung beetle optimization and priority experience replay mechanism

Abstract Reinforcement learning algorithms that handle continuous action spaces have the problem of slow convergence and local optimality. Hence, we propose a deep deterministic policy gradient algorithm based on the dung beetle optimization algorithm (DBOP–DDPG) and priority experience replay mecha...

Full description

Saved in:
Bibliographic Details
Main Authors: Hengwei Zhu, Chuiting Rong, Haorui Liu
Format: Article
Language:English
Published: Nature Portfolio 2025-04-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-025-99213-3
Tags: Add Tag
No Tags, Be the first to tag this record!