Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond.
Xutong LiuSiwei WangJinhang ZuoHan ZhongXuchuang WangZhiyong WangShuai LiMohammad HajiesmailiJohn C. S. LuiWei ChenPublished in: CoRR (2024)
Keyphrases