Improving Optimality of Neural Rewards Regression for Data-Efficient Batch Near-Optimal Policy Identification.

Daniel SchneegaßSteffen UdluftThomas Martinetz
Published in: ICANN (1) (2007)