Login / Signup
A New Methodology for Calculating Distributions of Reward Accumulated During a Finite Interval.
Muhammad A. Qureshi
William H. Sanders
Published in:
FTCS (1996)
Keyphrases
</>
reinforcement learning
long run
probability distribution
random variables
artificial intelligence
website
artificial neural networks
finite number
design methodology
joint distribution
kullback leibler divergence
real numbers
real valued functions
type fuzzy logic systems