Sequential Bayesian Optimisation as a POMDP for Environment Monitoring with UAVs.
Philippe MorereRomán MarchantFabio RamosPublished in: CoRR (2017)
Keyphrases
- real time
- dynamic environments
- reinforcement learning
- mobile robot
- monitoring system
- partially observable markov decision process
- state space
- finite state
- maximum likelihood
- environmental data
- virtual world
- path planning
- posterior probability
- human operators
- markov decision process
- model free reinforcement learning