Learning to Score Behaviors for Guided Policy Optimization.
Aldo PacchianoJack Parker-HolderYunhao TangKrzysztof ChoromanskiAnna ChoromanskaMichael I. JordanPublished in: ICML (2020)
Keyphrases
- learning algorithm
- reinforcement learning
- active learning
- learning systems
- optimization problems
- learning process
- supervised learning
- machine learning
- learning problems
- online learning
- data sets
- search engine
- learning community
- global optimization
- optimization method
- constrained optimization
- mobile learning
- background knowledge
- learning experience
- optimization algorithm
- unsupervised learning
- knowledge acquisition
- training data
- knowledge base