TyDiP: A Dataset for Politeness Classification in Nine Typologically Diverse Languages.
Anirudh SrinivasanEunsol ChoiPublished in: EMNLP (Findings) (2022)
Keyphrases
- classification accuracy
- benchmark datasets
- pattern recognition
- uci datasets
- training dataset
- machine learning
- wide variety
- feature set
- preprocessing
- training set
- feature vectors
- classification algorithm
- unsupervised learning
- supervised learning
- database
- learning algorithm
- automatic classification
- pattern classification
- image classification
- real world
- feature extraction
- databases
- query language
- face recognition
- object classification
- decision trees
- classification process
- target language
- feature selection