Online Bandit Learning with Offline Preference Data.
Akhil AgnihotriRahul JainDeepak RamachandranZheng WenPublished in: CoRR (2024)
Keyphrases
- data sets
- online learning
- prior knowledge
- knowledge discovery
- database
- high quality
- data analysis
- synthetic data
- data processing
- learning models
- learning systems
- real time
- data quality
- original data
- human experts
- data collection
- input data
- learning process
- data structure
- learning algorithm
- knowledge acquisition
- small number
- end users
- sensor data
- metadata