Login / Signup
A POMDP Dialogue Policy with 3-way Grounding and Adaptive Sensing for Learning through Communication.
Maryam Zare
Alan R. Wagner
Rebecca J. Passonneau
Published in:
EMNLP (Findings) (2022)
Keyphrases
</>
reinforcement learning
model free reinforcement learning
learning algorithm
learning tasks
learning process
human computer
sensor networks
learning systems
partially observable
mixed initiative
hidden state