Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning.
Xiao YuMaximillian ChenZhou YuPublished in: CoRR (2023)
Keyphrases
- markov chain
- goal oriented
- monte carlo tree search
- monte carlo
- bayesian reinforcement learning
- tree search algorithm
- monte carlo search
- state space
- optimal policy
- mixed initiative
- requirements analysis
- dialogue system
- game tree
- evaluation function
- temporal difference learning
- policy iteration
- learning algorithm
- alpha beta search