Prompt-Based Monte-Carlo Tree Search for Goal-oriented Dialogue Policy Planning.
Xiao YuMaximillian ChenZhou YuPublished in: EMNLP (2023)
Keyphrases
- goal oriented
- monte carlo tree search
- bayesian reinforcement learning
- monte carlo
- monte carlo search
- evaluation function
- optimal policy
- tree search algorithm
- requirements analysis
- mixed initiative
- temporal difference
- reinforcement learning
- learning process
- heuristic search
- markov decision process
- temporal difference learning