Login / Signup

Fast Reinforcement Learning with Large Action Sets Using Error-Correcting Output Codes for MDP Factorization.

Gabriel Dulac-ArnoldLudovic DenoyerPhilippe PreuxPatrick Gallinari
Published in: ECML/PKDD (2) (2012)
Keyphrases