Login / Signup
Efficient exploration for optimizing immediate reward.
Dale Schuurmans
Lloyd G. Greenwald
Published in:
AAAI/IAAI (1999)
Keyphrases
</>
reinforcement learning
artificial intelligence
multi agent
long run
reward function
real world
machine learning
search engine
social networks
computer vision
image segmentation
video sequences
least squares
medical images
average reward