The use of rating and Likert scales in Natural Language Generation human evaluation tasks: A review and some recommendations.
Jacopo AmideiPaul PiwekAlistair WillisPublished in: INLG (2019)
Keyphrases
- natural language generation
- recommender systems
- dialogue system
- dialog systems
- natural language processing
- natural language
- human users
- human operators
- online reviews
- text generation
- user preferences
- machine translation
- rating prediction
- collaborative filtering
- human computer
- word order
- aggregated search
- dialogue management
- personalized recommendation
- ground truth
- xml retrieval
- text classification