Deep reinforcement learning for the olfactory search POMDP: a quantitative benchmark.
Aurore LoisyRobin A. HeinonenPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- function approximation
- search space
- markov decision processes
- optimal policy
- search algorithm
- state space
- continuous state
- partially observable
- machine learning
- multi agent
- policy search
- dynamic programming
- search strategy
- search strategies
- belief space
- reinforcement learning methods
- markov decision process
- model free reinforcement learning
- partially observable markov decision processes
- reinforcement learning algorithms
- temporal difference
- action selection
- model free
- dynamical systems
- monte carlo
- learning algorithm