DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging.
Tzu-Han LinChen-An LiHung-yi LeeYun-Nung ChenPublished in: CoRR (2024)
Keyphrases
- knowledge sources
- domain knowledge
- domain models
- probabilistic model
- statistical model
- hybrid model
- modeling method
- prior knowledge
- experimental data
- statistical models
- accurate models
- computational model
- neural network model
- mathematical model
- classification models
- linear models
- management system
- modeling framework
- parametric models
- linear model
- mathematical models
- analytical model
- learning models
- metamodel
- linear regression
- computational models
- goodness of fit
- similarity measure
- multiple models
- learned models
- social networks
- model validation
- hierarchical model
- predictive model
- prediction model
- petri net
- generative model
- model selection
- maximum likelihood