Counterfactual Off-Policy Training for Neural Response Generation.
Qingfu ZhuWeinan ZhangTing LiuWilliam Yang WangPublished in: CoRR (2020)
Keyphrases
- neural network
- network architecture
- training examples
- recurrent networks
- training phase
- training set
- supervised learning
- training process
- nonlinear predictive control
- data sets
- artificial neural
- neural model
- generation process
- bio inspired
- serious games
- training samples
- case study
- artificial intelligence
- genetic algorithm