Hidden Parameter Markov Decision Processes: A Semiparametric Regression Approach for Discovering Latent Task Parametrizations.
Finale Doshi-VelezGeorge Dimitri KonidarisPublished in: IJCAI (2016)
Keyphrases
- markov decision processes
- semi parametric
- policy evaluation
- regression model
- linear model
- finite state
- regression problems
- optimal policy
- transition matrices
- density estimation
- statistical inference
- least squares
- rainfall forecasting
- reinforcement learning
- state space
- dynamic programming
- policy iteration
- constrained optimization
- average reward
- partially observable
- neural network
- decision theoretic planning
- infinite horizon
- parametric models
- latent variables
- average cost
- markov decision process
- evolutionary algorithm
- genetic programming
- markov chain
- image sequences
- linear regression
- learning algorithm