The mirror descent control algorithm for weakly regular homogeneous finite Markov chains with unknown mean losses.
Alexander V. NazinBoris M. MillerPublished in: CDC/ECC (2011)
Keyphrases
- control algorithm
- markov chain
- steady state
- finite state
- control system
- control method
- probabilistic automata
- adaptive fuzzy
- monte carlo
- stationary distribution
- transition probabilities
- markov process
- random walk
- control strategy
- markov model
- control law
- stochastic process
- finite automata
- temperature control
- markov processes
- matlab simulink
- fuzzy logic controller
- state space
- transition matrix
- wheeled mobile robot
- nonlinear systems
- fuzzy controller
- neural network controller
- markov models
- relative entropy
- machine learning
- numerical simulations
- reinforcement learning