Reinforcement Learning-based Recommender Systems with Large Language Models for State Reward and Action Modeling.
Jie WangAlexandros KaratzoglouIoannis ArapakisJoemon M. JosePublished in: CoRR (2024)
Keyphrases
- language model
- reinforcement learning
- recommender systems
- state action
- language modeling
- state space
- action space
- speech recognition
- agent learns
- information retrieval
- probabilistic model
- n gram
- document retrieval
- language modelling
- statistical language modeling
- action selection
- query expansion
- collaborative filtering
- test collection
- agent receives
- ad hoc information retrieval
- reward signal
- statistical language models
- markov decision processes
- retrieval model
- reinforcement learning algorithms
- reward shaping
- context sensitive
- learning algorithm
- machine learning
- discounted reward
- smoothing methods
- user profiles
- error rate
- pseudo relevance feedback
- optimal policy