Preference learning for guiding the tree search in continuous POMDPs.

Jiyong Ahn Sanghyeon Son Dongryung Lee Jisu Han Dongwon Son Beomjoon Kim

Published in: CoRL (2023)

Keyphrases

tree search
preference learning
branch and bound
search algorithm
ordinal regression
constraint propagation
gaussian processes
pairwise comparison
state space
search tree
reinforcement learning
mathematical programming
active learning
ranking functions
dynamic programming
recommender systems
search space
upper bound
constraint programming
heuristic search
markov chain
special case