C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Hindsight Optimization for Hybrid State and Action MDPs.
Aswin Raghavan
Scott Sanner
Roni Khardon
Prasad Tadepalli
Alan Fern
Published in:
AAAI (2017)
Keyphrases
</>
state space
markov decision processes
initial state
action space
optimization method
reinforcement learning
neural network
transition model
search space
markov chain
optimization algorithm
constrained optimization