BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation.
Tianxiang SunJunliang HeXipeng QiuXuanjing HuangPublished in: EMNLP (2022)
Keyphrases
- text generation
- natural language generation
- natural language
- social interaction
- average reward reinforcement learning
- social media
- social networks
- theorem prover
- programming language
- evaluation metrics
- social relationships
- natural language processing
- electronic commerce
- software engineering
- machine translation
- online communities
- language processing
- social behavior
- relational databases