Login / Signup
Fixed point iteration using stochastic reward nets.
Varsha Mainkar
Kishor S. Trivedi
Published in:
PNPM (1995)
Keyphrases
</>
fixed point
reinforcement learning
dynamical systems
belief propagation
bargaining solution
monte carlo
floating point
objective function
sufficient conditions
computer vision
policy iteration
constraint databases
higher order
variational inequalities
average reward