Filter dates
Overview
- reinforcement learning
- markov decision processes
- bucket elimination
- variational bayesian
- regret bounds
Publications
Efficient Exploration via Epistemic-Risk-Seeking Policy Optimization.
ICML
Probabilistic Inference in Reinforcement Learning Done Right.
NeurIPS
Efficient exploration via epistemic-risk-seeking policy optimization.
CoRR
Optimistic Meta-Gradients.
NeurIPS
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs.
ICML
Probabilistic Inference in Reinforcement Learning Done Right.
CoRR