LSDC - A comprehensive dataset for Low Saxon Dialect Classification.
Janine SiewertYves ScherrerMartijn WielingJörg TiedemannPublished in: VarDial@COLING (2020)
Keyphrases
- benchmark datasets
- classification accuracy
- classification systems
- classification scheme
- classification method
- feature set
- automatic classification
- pattern recognition
- pattern classification
- uci datasets
- classification process
- decision trees
- supervised learning
- text classification
- support vector machine svm
- feature extraction
- training data
- document classification
- feature space
- classification models
- benchmark data sets
- database
- false negative
- feature selection
- preprocessing
- classification algorithm
- decision rules
- cross validation
- class labels
- training set
- machine learning algorithms
- high dimensional