A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

In recent years, Deep Reinforcement Learning (DRL) algorithms have achieved state-of-the-art performance in many challenging strategy games. Because these games have complicated rules, an action sampled from the full discrete action distribution predicted by the learned policy is likely to be invali...

Full description

Saved in:

Bibliographic Details
Main Authors:	Shengyi Huang, Santiago Ontañón
Format:	Article
Language:	English
Published:	LibraryPress@UF 2022-05-01
Series:	Proceedings of the International Florida Artificial Intelligence Research Society Conference
Subjects:	reinforcement learning deep learning deep reinforcement learning real-time strategy games implementation details invalid action masking
Online Access:	https://journals.flvc.org/FLAIRS/article/view/130584
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Parallel Task Offloading and Trajectory Optimization for UAV-Assisted Mobile Edge Computing via Hierarchical Reinforcement Learning
by: Tuo Wang, et al.
Published: (2025-05-01)

Multi-Agent Reinforcement Learning With Action Masking for UAV-Enabled Mobile Communications
by: Danish Rizvi, et al.
Published: (2025-01-01)

Intelligent Predetermination of Generator Tripping Scheme: Knowledge Fusion-based Deep Reinforcement Learning Framework
by: Lingkang Zeng, et al.
Published: (2024-01-01)

Deep Learning-Based Invalid Point Removal Method for Fringe Projection Profilometry
by: Nan He, et al.
Published: (2024-11-01)

Short video preloading via domain knowledge assisted deep reinforcement learning
by: Yuhong Xie, et al.
Published: (2024-12-01)

Dynamic Path Planning for Vehicles Based on Causal State-Masking Deep Reinforcement Learning
by: Xia Hua, et al.
Published: (2025-03-01)

Biasing Exploration towards Positive Error for Efficient Reinforcement Learning
by: Adam Parker, et al.
Published: (2025-05-01)

Economic Evaluation of Losses From Invalidism of the Population in Russia: Approaches and Methods
by: Olga I. Goleva
Published: (2017-11-01)

Orthogonal Adversarial Deep Reinforcement Learning for Discrete- and Continuous-Action Problems
by: Kohei Ohashi, et al.
Published: (2024-01-01)

Towards fair lights: A multi-agent masked deep reinforcement learning for efficient corridor-level traffic signal control
by: Xiaocai Zhang, et al.
Published: (2025-12-01)

Survey on reinforcement learning based adaptive bit rate algorithm for mobile video streaming services
by: Li’na DU, et al.
Published: (2021-09-01)

Survey on reinforcement learning based adaptive bit rate algorithm for mobile video streaming services
by: Li’na DU, et al.
Published: (2021-09-01)

TEMPO: Timestep Explanations for Modeling Preferences in Online Preference-Based RL
by: Jakob Karlaus, et al.
Published: (2025-01-01)

Machine Learning for Decision Support and Automation in Games: A Study on Vehicle Optimal Path
by: Gonçalo Penelas, et al.
Published: (2025-02-01)

Machine Learning Applications in Energy Harvesting Internet of Things Networks: A Review
by: Olumide Alamu, et al.
Published: (2025-01-01)

Visual Explanation With Action Query Transformer in Deep Reinforcement Learning and Visual Feedback via Augmented Reality
by: Hidenori Itaya, et al.
Published: (2025-01-01)

AlphaRouter: Bridging the Gap Between Reinforcement Learning and Optimization for Vehicle Routing with Monte Carlo Tree Searches
by: Won-Jun Kim, et al.
Published: (2025-02-01)

Marriage Invalidity – A Comparison of English and Hungarian Rules
by: Sarolta Molnár
Published: (2024-12-01)

Reinforcement learning:toward action-knowledge merged intelligent mechanisms and algorithms
by: Fei-Yue WANG, et al.
Published: (2020-06-01)

A survey of reinforcement and deep reinforcement learning for coordination in intelligent traffic light control
by: Aicha Saadi, et al.
Published: (2025-04-01)

Solving Action Semantic Conflict in Physically Heterogeneous Multi-Agent Reinforcement Learning with Generalized Action-Prediction Optimization
by: Xiaoyang Yu, et al.
Published: (2025-02-01)

AI in game intelligence—from multi-role game to parallel game
by: Yu SHEN, et al.
Published: (2020-09-01)

Causes of invalidity in patients with juvenile idiopathic arthritis
by: T A Slielepina
Published: (2005-04-01)

OTFS-Assisted Sensing Adaptive Cruise Control for Highways: A Reinforcement Learning Approach
by: Yulin Liu, et al.
Published: (2025-01-01)

Multi-Step Quality-Oriented Training for Cross-Dataset Offline Iterative Speech Enhancement
by: Shih-Chuan Chu, et al.
Published: (2025-01-01)

Efficient and assured reinforcement learning-based building HVAC control with heterogeneous expert-guided training
by: Shichao Xu, et al.
Published: (2025-03-01)

Deep reinforcement learning applications and prospects in industrial scenarios
by: JING TAN, et al.
Published: (2025-04-01)

Contrastive Mask Learning for Self-Supervised 3D Skeleton-Based Action Recognition
by: Haoyuan Zhang
Published: (2025-02-01)

Guided Reinforcement Learning with Twin Delayed Deep Deterministic Policy Gradient for a Rotary Flexible-Link System
by: Carlos Saldaña Enderica, et al.
Published: (2025-05-01)

Electric Bus Charging Schedules Relying on Real Data-Driven Targets Based on Hierarchical Deep Reinforcement Learning
by: Jiaju Qi, et al.
Published: (2025-01-01)

Vital node searcher: find out critical node measure with deep reinforcement learning
by: Guanting Du, et al.
Published: (2022-12-01)

QMIX-GNN: A Graph Neural Network-Based Heterogeneous Multi-Agent Reinforcement Learning Model for Improved Collaboration and Decision-Making
by: Taiyin Zhao, et al.
Published: (2025-03-01)

Expert-Trajectory-Based Features for Apprenticeship Learning via Inverse Reinforcement Learning for Robotic Manipulation
by: Francisco J. Naranjo-Campos, et al.
Published: (2024-11-01)

Detecting heavy trucks from mobile phone trajectories using image-based behavioral representations and deep learning models
by: Franco Basso, et al.
Published: (2025-07-01)

Optimizing navigation and chemical application in precision agriculture with deep reinforcement learning and conditional action tree
by: Mahsa Khosravi, et al.
Published: (2025-12-01)

Hybrid Machine Learning and Reinforcement Learning Framework for Adaptive UAV Obstacle Avoidance
by: Wojciech Skarka, et al.
Published: (2024-10-01)

A Generalized Deep Reinforcement Learning Model for Distribution Network Reconfiguration with Power Flow-Based Action-Space Sampling
by: Nastaran Gholizadeh, et al.
Published: (2024-10-01)

An automatic and unsupervised image mask acquisition method based on generative adversarial networks
by: Hao Wu, et al.
Published: (2024-12-01)

ReMAV: Reward Modeling of Autonomous Vehicles for Finding Likely Failure Events
by: Aizaz Sharif, et al.
Published: (2024-01-01)

Deep reinforcement learning based resource provisioning for federated edge learning
by: Xingyun Chen, et al.
Published: (2025-06-01)