COM-MABs: From Users' Feedback to Recommendation
Recently, the COMbinatorial Multi-Armed Bandits (COM-MAB) problem has arisen as an active research field. In systems interacting with humans, those reinforcement learning approaches use a feedback strategy as their reward function. On the study of those strategies, this paper present three contribut...
Saved in:
| Main Authors: | Alexandre Letard, Tassadit Amghar, Olivier Camp, Nicolas Gutowski |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
LibraryPress@UF
2022-05-01
|
| Series: | Proceedings of the International Florida Artificial Intelligence Research Society Conference |
| Online Access: | https://journals.flvc.org/FLAIRS/article/view/130560 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Non-stationary MAB based access method for heterogeneous users in LEO satellite systems
by: LIN Min, et al.
Published: (2025-03-01) -
Mutations in the transcriptional regulator MAB_2885 confer tedizolid and linezolid resistance through the MmpS-MmpL efflux pump MAB_2302-MAB_2303 in Mycobacterium abscessus.
by: Huiyun Zhang, et al.
Published: (2025-05-01) -
Novel identification of mAbs by Raman spectroscopy
by: Maoqin Duan, et al.
Published: (2024-12-01) -
MabTera. Kratkoe rukovodstvo po primeneniyu
by: - -
Published: (2008-02-01) -
MAB-RSP: Data pricing based on Stackelberg game in MCS
by: Yongjiao Sun, et al.
Published: (2025-07-01)