Multilevel Constrained Bandits: A Hierarchical Upper Confidence Bound Approach with Safety Guarantees

Multilevel Constrained Bandits: A Hierarchical Upper Confidence Bound Approach with Safety Guarantees

The multi-armed bandit (MAB) problem is a foundational model for sequential decision-making under uncertainty. While MAB has proven valuable in applications such as clinical trials and online advertising, traditional formulations have limitations; specifically, they struggle to handle three key real...

Full description

Saved in:

Bibliographic Details
Main Author:	Ali Baheri
Format:	Article
Language:	English
Published:	MDPI AG 2025-01-01
Series:	Mathematics
Subjects:	multi-armed bandit constrained optimization decision making under uncertainty
Online Access:	https://www.mdpi.com/2227-7390/13/1/149
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Constrained Restless Bandits for Dynamic Scheduling in Cyber-Physical Systems
by: Kesav Ram Kaza, et al.
Published: (2024-01-01)

Multi-Dimensional Arms for Combinatorial Multi-Armed Bandit
by: Qi Li, et al.
Published: (2025-01-01)

Thompson Sampling for Non-Stationary Bandit Problems
by: Han Qi, et al.
Published: (2025-01-01)

Multi-Armed Bandit Approaches for Location Planning with Dynamic Relief Supplies Allocation Under Disaster Uncertainty
by: Jun Liang, et al.
Published: (2024-12-01)

Gaussian Process with Vine Copula-Based Context Modeling for Contextual Multi-Armed Bandits
by: Jong-Min Kim
Published: (2025-06-01)

Adaptive Noise Exploration for Neural Contextual Multi-Armed Bandits
by: Chi Wang, et al.
Published: (2025-01-01)

Causal contextual bandits with one-shot data integration
by: Chandrasekar Subramanian, et al.
Published: (2024-12-01)

Fair Probabilistic Multi-Armed Bandit With Applications to Network Optimization
by: Zhiwu Guo, et al.
Published: (2024-01-01)

Modified Index Policies for Multi-Armed Bandits with Network-like Markovian Dependencies
by: Abdalaziz Sawwan, et al.
Published: (2025-01-01)

Optimistic Algorithms for Safe Linear Bandits Under General Constraints
by: Spencer Hutchinson, et al.
Published: (2025-01-01)

Neural Network-Based Bandit: A Medium Access Control for the IIoT Alarm Scenario
by: Prasoon Raghuwanshi, et al.
Published: (2024-01-01)

Cost-Efficient Distributed Learning via Combinatorial Multi-Armed Bandits
by: Maximilian Egger, et al.
Published: (2025-05-01)

Designing digital health interventions with causal inference and multi-armed bandits: a review
by: Radoslava Švihrová, et al.
Published: (2025-06-01)

Bandit-Based Multiple Access Approach for Multi-Link Operation in Heterogeneous Dynamic Networks
by: Mingqi Han, et al.
Published: (2025-01-01)

Client Selection for Generalization in Accelerated Federated Learning: A Multi-Armed Bandit Approach
by: Dan Ben Ami, et al.
Published: (2025-01-01)

Multi armed bandit based resource allocation in Near Memory Processing architectures
by: Shubhang Pandey, et al.
Published: (2025-12-01)

Reducing Computational Time in Pixel-Based Path Planning for GMA-DED by Using Multi-Armed Bandit Reinforcement Learning Algorithm
by: Rafael P. Ferreira, et al.
Published: (2025-03-01)

Context-aware Multi-stakeholder Recommender Systems
by: Tahereh Arabghalizi, et al.
Published: (2022-05-01)

A Hybrid Proactive Caching System in Vehicular Networks Based on Contextual Multi-Armed Bandit Learning
by: Qiao Wang, et al.
Published: (2023-01-01)

THE ROLE OF INFORMANTS IN THE ACCENTUATION OF ARMED BANDITRY IN NORTH-WESTERN NIGERIA: A CASE STUDY OF ZAMFARA STATE
by: TUKUR ABDULKADIR, et al.
Published: (2024-07-01)

Cooperate or Not Cooperate: Transfer Learning With Multi-Armed Bandit for Spatial Reuse in Wi-Fi
by: Pedro Enrique Iturria-Rivera, et al.
Published: (2024-01-01)

Le stéréotype du bandit catalan dans la littérature espagnole du Siècle d’Or
by: Mathias Ledroit
Published: (2009-12-01)

AI-Driven Nudge Optimization: Integrating Two-Tower Networks and Multi-Armed Bandit With Behavioral Economics for Digital Banking Campaign
by: Idha Kristiana, et al.
Published: (2025-01-01)

Deciphering algorithmic collusion: Insights from bandit algorithms and implications for antitrust enforcement
by: Frédéric Marty, et al.
Published: (2025-11-01)

NeIL: Intelligent Replica Selection for Distributed Applications
by: Faraz Ahmed, et al.
Published: (2024-01-01)

MAB-Based Online Client Scheduling for Decentralized Federated Learning in the IoT
by: Zhenning Chen, et al.
Published: (2025-04-01)

Forest, Bandits, and State: Some Measures Taken against the Use of Forests as Illegal Activity Areas in the Ottoman Empire (16th-18th centuries)
by: Yusuf Alperen Aydın
Published: (2024-10-01)

Nonstationary Stochastic Bandits: UCB Policies and Minimax Regret
by: Lai Wei, et al.
Published: (2024-01-01)

The Planning of Business Processes in Undefined Condition of Oligopolistic Market
by: ALEXAKHIN A.V., et al.
Published: (2018-10-01)

Navigating Uncertainty: The Role of Mood and Confidence in Decision-Making Flexibility and Performance
by: Claudio Lavín, et al.
Published: (2024-11-01)

Features of constrained radial form turning
by: D. V. Moiseev, et al.
Published: (2018-06-01)

Model-based exploration is measurable across tasks but not linked to personality and psychiatric assessments
by: Kristin Witte, et al.
Published: (2025-07-01)

Probabilistic inference and Bayesian‐like estimation in animals: Empirical evidence
by: Thomas J. Valone
Published: (2024-07-01)

LLM-Guided Ensemble Learning for Contextual Bandits with Copula and Gaussian Process Models
by: Jong-Min Kim
Published: (2025-08-01)

Bandit Algorithms for Efficient Toxicity Detection in Competitive Online Video Games
by: Jacob Morrier, et al.
Published: (2025-01-01)

Balancing Efficiency and Efficacy: A Contextual Bandit-Driven Framework for Multi-Tier Cyber Threat Detection
by: Ibrahim Mutambik, et al.
Published: (2025-06-01)

Selective Reviews of Bandit Problems in AI via a Statistical View
by: Pengjie Zhou, et al.
Published: (2025-02-01)

An Intelligent Client Selection Algorithm of Federated Learning for Class-imbalance
by: ZHU Suxia, et al.
Published: (2024-04-01)

MAB-RSP: Data pricing based on Stackelberg game in MCS
by: Yongjiao Sun, et al.
Published: (2025-07-01)

Evaluating Implementation Uncertainties and Defining Safe Operating Spaces for Deeply Uncertain Cooperative Multi‐City Water Supply Investment Pathways
by: Lillian B. Lau, et al.
Published: (2023-07-01)