Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets.
Anirudhan BadrinathYannis Flet-BerliacAllen NieEmma BrunskillPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- supervised learning
- temporal difference
- function approximation
- learning algorithm
- learning problems
- model free
- kernel based learning
- reinforcement learning algorithms
- unsupervised learning
- state space
- fuzzy logic
- machine learning
- supervised classification
- markov decision processes
- training examples
- optimal policy
- policy search
- training data
- multi agent
- target detection
- statistical learning
- learning tasks
- fault diagnosis
- power system
- active learning
- unlabeled data
- multiple instance learning
- high voltage
- training set
- target recognition
- supervised machine learning
- temporal difference learning
- artificial intelligence
- power transformers
- distribution network
- class labels
- training samples
- support vector