Login / Signup
DPO: Differential reinforcement learning with application to optimal configuration search.
Chandrajit Bajaj
Minh Nguyen
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
optimal configuration
search algorithm
database
machine learning
learning algorithm
search space
search strategies
data mining
genetic algorithm
lower bound
dynamic programming
mobile robot
function approximation
query formulation