TyDiP: A Dataset for Politeness Classification in Nine Typologically Diverse Languages.
Anirudh SrinivasanEunsol ChoiPublished in: CoRR (2022)
Keyphrases
- benchmark datasets
- classification accuracy
- uci datasets
- pattern recognition
- pattern classification
- classification systems
- decision trees
- support vector machine
- machine learning methods
- classification method
- feature set
- class labels
- databases
- training samples
- support vector machine svm
- classification process
- object classification
- language independent
- classification rules
- text classification
- image classification
- supervised learning
- machine learning
- natural language processing
- feature space
- preprocessing
- classification models
- automatic classification
- classification scheme
- support vector
- feature extraction
- real world
- neural network