Value Function Dynamic Estimation in Reinforcement Learning based on Data Adequacy.
Huifan GaoYinghui PanJing TangYifeng ZengPeihua ChaiLangcai CaoPublished in: HPCCT/BDAI (2020)
Keyphrases
- reinforcement learning
- synthetic data
- training data
- data sources
- original data
- data structure
- data analysis
- data collection
- data sets
- feature selection
- high quality
- prior knowledge
- probability density
- database
- complex data
- raw data
- supervised learning
- knowledge discovery
- search engine
- machine learning
- data mining techniques
- data processing
- small number
- sensor data
- end users
- density estimation
- neural network