A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
In recent years, Deep Reinforcement Learning (DRL) algorithms have achieved state-of-the-art performance in many challenging strategy games. Because these games have complicated rules, an action sampled from the full discrete action distribution predicted by the learned policy is likely to be invali...
Saved in:
| Main Authors: | Shengyi Huang, Santiago Ontañón |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
LibraryPress@UF
2022-05-01
|
| Series: | Proceedings of the International Florida Artificial Intelligence Research Society Conference |
| Subjects: | |
| Online Access: | https://journals.flvc.org/FLAIRS/article/view/130584 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Parallel Task Offloading and Trajectory Optimization for UAV-Assisted Mobile Edge Computing via Hierarchical Reinforcement Learning
by: Tuo Wang, et al.
Published: (2025-05-01) -
Multi-Agent Reinforcement Learning With Action Masking for UAV-Enabled Mobile Communications
by: Danish Rizvi, et al.
Published: (2025-01-01) -
Intelligent Predetermination of Generator Tripping Scheme: Knowledge Fusion-based Deep Reinforcement Learning Framework
by: Lingkang Zeng, et al.
Published: (2024-01-01) -
Deep Learning-Based Invalid Point Removal Method for Fringe Projection Profilometry
by: Nan He, et al.
Published: (2024-11-01) -
Short video preloading via domain knowledge assisted deep reinforcement learning
by: Yuhong Xie, et al.
Published: (2024-12-01)