Login / Signup
EWRL
2008
2012
2008
2012
Keyphrases
Publications
2012
Sergiu Goschin
,
Ari Weinstein
,
Michael L. Littman
,
Erick Chastain
Planning in Reward-Rich Domains via PAC Bandits.
EWRL
(2012)
Cosmin Paduraru
,
Doina Precup
,
Joelle Pineau
,
Gheorghe Comanici
An Empirical Analysis of Off-policy Learning in Discrete MDPs.
EWRL
(2012)
Marc Peter Deisenroth
,
Csaba Szepesvári
,
Jan Peters
Preface.
EWRL
(2012)
Yevgeny Seldin
,
Csaba Szepesvári
,
Peter Auer
,
Yasin Abbasi-Yadkori
Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments.
EWRL
(2012)
Mayank Daswani
,
Peter Sunehag
,
Marcus Hutter
Feature Reinforcement Learning using Looping Suffix Trees.
EWRL
(2012)
Ari Weinstein
,
Michael L. Littman
,
Sergiu Goschin
Rollout-based Game-tree Search Outprunes Traditional Alpha-beta.
EWRL
(2012)
Michael Castronovo
,
Francis Maes
,
Raphael Fonteneau
,
Damien Ernst
Learning Exploration/Exploitation Strategies for Single Trajectory Reinforcement Learning.
EWRL
(2012)
Jan Hendrik Metzen
Online Skill Discovery using Graph-based Clustering.
EWRL
(2012)
Nicolas Heess
,
David Silver
,
Yee Whye Teh
Actor-Critic Reinforcement Learning with Energy-Based Policies.
EWRL
(2012)
Michal Valko
,
Mohammad Ghavamzadeh
,
Alessandro Lazaric
Semi-Supervised Apprenticeship Learning.
EWRL
(2012)
Timothy A. Mann
,
Yoonsuck Choe
Directed Exploration in Reinforcement Learning with Transferred Knowledge.
EWRL
(2012)
David Silver
Gradient Temporal Difference Networks.
EWRL
(2012)
Andreas Vlachos
An investigation of imitation learning algorithms for structured prediction.
EWRL
(2012)
volume 7188, 2012
Recent Advances in Reinforcement Learning - 9th European Workshop, EWRL 2011, Athens, Greece, September 9-11, 2011, Revised Selected Papers
EWRL
7188 (2012)
volume 24, 2012
Proceedings of the Tenth European Workshop on Reinforcement Learning, EWRL 2012, Edinburgh, Scotland, UK, June, 2012
EWRL
24 (2012)
2011
Anestis Fachantidis
,
Ioannis Partalas
,
Matthew E. Taylor
,
Ioannis P. Vlahavas
Transfer Learning via Multiple Inter-task Mappings.
EWRL
(2011)
Munu Sairamesh
,
Balaraman Ravindran
Options with Exceptions.
EWRL
(2011)
Cosmin Paduraru
,
Doina Precup
,
Joelle Pineau
A Framework for Computing Bounds for the Return of a Policy.
EWRL
(2011)
Matthijs Snel
,
Shimon Whiteson
Multi-Task Reinforcement Learning: Shaping and Feature Selection.
EWRL
(2011)
Nikolaos Tziortziotis
,
Konstantinos Blekas
Value Function Approximation through Sparse Bayesian Modeling.
EWRL
(2011)
Francis Maes
,
Louis Wehenkel
,
Damien Ernst
Automatic Discovery of Ranking Formulas for Playing with Multi-armed Bandits.
EWRL
(2011)
Kyriakos C. Chatzidimitriou
,
Ioannis Partalas
,
Pericles A. Mitkas
,
Ioannis P. Vlahavas
Transferring Evolved Reservoir Features in Reinforcement Learning Tasks.
EWRL
(2011)
Matthieu Geist
,
Bruno Scherrer
ℓ1-Penalized Projected Bellman Residual.
EWRL
(2011)
Francis Maes
,
Louis Wehenkel
,
Damien Ernst
Optimized Look-ahead Tree Search Policies.
EWRL
(2011)
Kristian Kersting
Invited Talk: Increasing Representational Power and Scaling Inference in Reinforcement Learning.
EWRL
(2011)
Christos Dimitrakakis
Robust Bayesian Reinforcement Learning through Tight Lower Bounds.
EWRL
(2011)
Kfir Y. Levy
,
Nahum Shimkin
Unified Inter and Intra Options Learning Using Policy Gradient Methods.
EWRL
(2011)
Peter Stone
Invited Talk: PRISM - Practical RL: Representation, Interaction, Synthesis, and Mortality.
EWRL
(2011)
Mauricio Araya-López
,
Olivier Buffet
,
Vincent Thomas
,
François Charpillet
Active Learning of MDP Models.
EWRL
(2011)
Bruno Scherrer
,
Matthieu Geist
Recursive Least-Squares Learning with Eligibility Traces.
EWRL
(2011)
Matthew W. Hoffman
,
Alessandro Lazaric
,
Mohammad Ghavamzadeh
,
Rémi Munos
Regularized Least Squares Temporal Difference Learning with Nested ℓ2 and ℓ1 Penalization.
EWRL
(2011)
Christos Dimitrakakis
,
Constantin A. Rothkopf
Bayesian Multitask Inverse Reinforcement Learning.
EWRL
(2011)
Charles Elkan
Reinforcement Learning with a Bilinear Q Function.
EWRL
(2011)
Kazuteru Miyazaki
,
Masaaki Ida
Proposal and Evaluation of the Active Course Classification Support System with Exploitation-Oriented Learning.
EWRL
(2011)
Yuxi Li
,
Dale Schuurmans
MapReduce for Parallel Reinforcement Learning.
EWRL
(2011)
Tohgoroh Matsui
,
Takashi Goto
,
Kiyoshi Izumi
,
Yu Chen
Compound Reinforcement Learning: Theory and an Application to Finance.
EWRL
(2011)
Pablo Samuel Castro
,
Doina Precup
Automatic Construction of Temporally Extended Actions for MDPs Using Bisimulation Metrics.
EWRL
(2011)
Seiya Kuroda
,
Kazuteru Miyazaki
,
Hiroaki Kobayashi
Introduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot.
EWRL
(2011)
Phuong Minh Nguyen
,
Peter Sunehag
,
Marcus Hutter
Feature Reinforcement Learning in Practice.
EWRL
(2011)
Georgios Boutsioukis
,
Ioannis Partalas
,
Ioannis P. Vlahavas
Transfer Learning in Multi-Agent Reinforcement Learning Domains.
EWRL
(2011)
Matthew W. Robards
,
Peter Sunehag
Gradient Based Algorithms with Loss Functions and Kernels for Improved On-Policy Control.
EWRL
(2011)
Peter Auer
Invited Talk: UCRL and Autonomous Exploration.
EWRL
(2011)
Csaba Szepesvári
Invited Talk: Towards Robust Reinforcement Learning Algorithms.
EWRL
(2011)
Ioannis Lambrou
,
Vassilis Vassiliades
,
Chris Christodoulou
An Extension of a Hierarchical Reinforcement Learning Algorithm for Multiagent Settings.
EWRL
(2011)
Sylvie C. W. Ong
,
Yuri Grinberg
,
Joelle Pineau
Goal-Directed Online Learning of Predictive Models.
EWRL
(2011)
Edouard Klein
,
Matthieu Geist
,
Olivier Pietquin
Batch, Off-Policy and Model-Free Apprenticeship Learning.
EWRL
(2011)
Boris Lesner
,
Bruno Zanuttini
Handling Ambiguous Effects in Action Learning.
EWRL
(2011)