Nonprehensile Planar Manipulation through Reinforcement Learning with Multimodal Categorical Exploration.
Juan Del Aguila FerrandisJoão MouraSethu VijayakumarPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- active exploration
- exploration strategy
- exploration exploitation
- action selection
- model based reinforcement learning
- function approximation
- autonomous learning
- multimodal interaction
- categorical data
- exploration exploitation tradeoff
- multi modal
- machine learning
- optimal policy
- model free
- state space
- ground plane
- robotic control
- multimedia
- reinforcement learning algorithms
- temporal difference learning
- markov decision processes
- data mining
- transition model
- manipulation tasks
- dynamic programming
- curved surfaces
- single agent
- temporal difference
- real robot
- evolutionary algorithm
- policy search
- numerical data
- learning algorithm
- genetic algorithm
- supervised learning
- line drawings