Benchmarking Potential Based Rewards for Learning Humanoid Locomotion.
Se Hwan JeonSteve HeimCharles KhazoomSangbae KimPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- learning process
- learning systems
- knowledge acquisition
- real time
- supervised learning
- prior knowledge
- mobile robot
- online learning
- artificial intelligence
- learning scheme
- expert systems
- case study
- vision system
- database
- learning problems
- incremental learning
- inductive inference
- multi armed bandits