Reinforcement learning algorithms for the Untangling of Braids.
Abdullah KhanAlexei VernitskiAlexei LisitsaPublished in: FLAIRS (2022)
Keyphrases
- reinforcement learning algorithms
- reinforcement learning
- markov decision processes
- state space
- model free
- reinforcement learning problems
- temporal difference
- eligibility traces
- reinforcement learning methods
- learning algorithm
- function approximation
- dynamic environments
- reward function
- stochastic games
- partially observable environments
- optimal policy
- policy search
- multiagent reinforcement learning
- neural network
- least squares
- hidden markov models
- prior knowledge