Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks.
Yuqian JiangSuda BharadwajBo WuRishi ShahUfuk TopcuPeter StonePublished in: AAAI (2021)
Keyphrases
- learning tasks
- reward shaping
- reinforcement learning
- transfer learning
- supervised learning
- learning agent
- complex domains
- learning algorithm
- machine learning
- reinforcement learning algorithms
- function approximation
- learning experience
- machine learning algorithms
- multi label
- knowledge representation
- markov decision problems
- kernel methods
- neural network
- state space
- learning process
- feature space
- model free
- reward function
- knowledge base
- feature selection