Sample-Efficient Reinforcement Learning with Maximum Entropy Mellowmax Episodic Control.
Marta SarricoKai ArulkumaranAndrea AgostinelliPierre RichemondAnil Anthony BharathPublished in: CoRR (2019)
Keyphrases
- maximum entropy
- reinforcement learning
- markov models
- maximum entropy principle
- maximum entropy model
- random fields
- minimum cross entropy
- probabilistic logic
- optimal control
- conditional random fields
- state space
- class conditional
- optimal policy
- transformation based learning
- principle of maximum entropy
- iterative scaling
- training set