TOCAB: A Dataset for Chinese Abusive Language Processing.
I ChungChuan-Jie LinPublished in: IRI (2021)
Keyphrases
- language processing
- natural language processing
- human language technology
- natural language
- human language
- machine translation
- spoken language
- knowledge representation
- benchmark datasets
- language understanding
- unknown words
- lexical information
- information retrieval
- grammar induction
- data mining
- artificial intelligence
- database
- natural language understanding
- training dataset