Login / Signup
Calculating Transient Distributions of Cumulative Reward.
Edmundo de Souza e Silva
H. Richard Gail
Reinaldo Vallejos Campos
Published in:
SIGMETRICS (1995)
Keyphrases
</>
reinforcement learning
probability distribution
steady state
random variables
power law
databases
statistical distributions
heavy tailed
real time
mobile robot
computer vision
artificial intelligence
joint distribution
long run
normal distribution
machine learning
data mining
log normal