Learning Multimodal Rewards from Rankings.
Vivek MyersErdem BiyikNima AnariDorsa SadighPublished in: CoRL (2021)
Keyphrases
- reinforcement learning
- learning algorithm
- learning process
- learning tasks
- online learning
- learning problems
- prior knowledge
- knowledge acquisition
- learning systems
- rank aggregation
- data mining
- learning scheme
- mobile learning
- background knowledge
- hidden markov models
- artificial neural networks
- training set
- genetic algorithm