Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts.

Jing-Cheng PangSi-Hang YangKaiyuan LiJiaji ZhangXiong-Hui ChenNan TangYang Yu
Published in: CoRR (2024)
Keyphrases