Piano Fingering with Reinforcement Learning.
Pedro RamonedaMarius MironXavier SerraPublished in: CoRR (2021)
Keyphrases
- reinforcement learning
- function approximation
- learning algorithm
- model free
- reinforcement learning algorithms
- multi agent
- state space
- robotic control
- optimal policy
- markov decision processes
- control problems
- website
- temporal difference learning
- learning classifier systems
- machine learning
- temporal difference
- direct policy search
- active exploration
- multi agent reinforcement learning
- function approximators
- learning agent
- action selection
- search engine
- learning problems
- transfer learning
- supervised learning
- knowledge base