​
Login / Signup
Xingzhou Lou
ORCID
Publication Activity (10 Years)
Years Active: 2022-2024
Publications (10 Years): 11
Top Topics
Cooperative
Language Model
Actor Critic
Reinforcement Learning
Top Venues
CoRR
AAMAS
Inf. Sci.
AAAI
</>
Publications
</>
Xingzhou Lou
,
Junge Zhang
,
Yali Du
,
Chao Yu
,
Zhaofeng He
,
Kaiqi Huang
Leveraging Joint-Action Embedding in Multiagent Reinforcement Learning for Cooperative Games.
IEEE Trans. Games
16 (2) (2024)
Xingzhou Lou
,
Junge Zhang
,
Timothy J. Norman
,
Kaiqi Huang
,
Yali Du
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient.
AAAI
(2024)
Xiaoqian Liu
,
Xingzhou Lou
,
Jianbin Jiao
,
Junge Zhang
Position: Foundation Agents as the Paradigm Shift for Decision Making.
CoRR
(2024)
Xingzhou Lou
,
Junge Zhang
,
Ziyan Wang
,
Kaiqi Huang
,
Yali Du
Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models.
CoRR
(2024)
Xingzhou Lou
,
Junge Zhang
,
Jian Xie
,
Lifeng Liu
,
Dong Yan
,
Kaiqi Huang
SPO: Multi-Dimensional Preference Sequential Alignment With Implicit Reward Modeling.
CoRR
(2024)
Xingzhou Lou
,
Junge Zhang
,
Ziyan Wang
,
Kaiqi Huang
,
Yali Du
Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models.
AAMAS
(2024)
Xingzhou Lou
,
Jiaxian Guo
,
Junge Zhang
,
Jun Wang
,
Kaiqi Huang
,
Yali Du
PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination.
CoRR
(2023)
Xue Yan
,
Jiaxian Guo
,
Xingzhou Lou
,
Jun Wang
,
Haifeng Zhang
,
Yali Du
An Efficient End-to-End Training Approach for Zero-Shot Human-AI Coordination.
NeurIPS
(2023)
Xingzhou Lou
,
Jiaxian Guo
,
Junge Zhang
,
Jun Wang
,
Kaiqi Huang
,
Yali Du
PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination.
AAMAS
(2023)
Xingzhou Lou
,
Junge Zhang
,
Timothy J. Norman
,
Kaiqi Huang
,
Yali Du
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient.
CoRR
(2023)
Xingzhou Lou
,
Qiyue Yin
,
Junge Zhang
,
Chao Yu
,
Zhaofeng He
,
Nengjie Cheng
,
Kaiqi Huang
Offline reinforcement learning with representations for actions.
Inf. Sci.
610 (2022)