Learning to Control Renewal Processes with Bandit Feedback.
Semih CayciAtilla EryilmazR. SrikantPublished in: Proc. ACM Meas. Anal. Comput. Syst. (2019)
Keyphrases
- learning process
- learning tasks
- prior knowledge
- learning algorithm
- reinforcement learning
- control rules
- case study
- learning environment
- learning community
- incremental learning
- supervised learning
- motor skills
- random sampling
- learning analytics
- learning systems
- markov chain
- data sets
- support vector
- bayesian networks
- neural network