A reinforcement learning recommender system using bi-clustering and Markov Decision Process.
Arta IftikharMustansar Ali GhazanfarMubbashir AyubSaad Ali AlahmariNadeem QaziJulie A. WallPublished in: Expert Syst. Appl. (2024)
Keyphrases
- markov decision process
- reinforcement learning
- recommender systems
- markov decision processes
- state space
- optimal policy
- temporal difference learning
- finite horizon
- infinite horizon
- policy iteration
- markov games
- partial observability
- k means
- reinforcement learning algorithms
- function approximation
- initial state
- action space
- state action
- transition probabilities
- partially observable
- control problems
- optimal control
- evaluation function
- sufficient conditions
- markov decision problems
- dynamic programming