COM-MABs: From Users' Feedback to Recommendation

Recently, the COMbinatorial Multi-Armed Bandits (COM-MAB) problem has arisen as an active research field. In systems interacting with humans, those reinforcement learning approaches use a feedback strategy as their reward function. On the study of those strategies, this paper present three contribut...

Full description

Saved in:
Bibliographic Details
Main Authors: Alexandre Letard, Tassadit Amghar, Olivier Camp, Nicolas Gutowski
Format: Article
Language:English
Published: LibraryPress@UF 2022-05-01
Series:Proceedings of the International Florida Artificial Intelligence Research Society Conference
Online Access:https://journals.flvc.org/FLAIRS/article/view/130560
Tags: Add Tag
No Tags, Be the first to tag this record!

Similar Items