A data-based classification of Slavic languages: Indices of qualitative variation applied to grapheme frequencies.
Michaela KoscováJán MacutekEmmerich KelihPublished in: CoRR (2015)
Keyphrases
- data sets
- database
- synthetic data
- classification accuracy
- feature extraction
- data analysis
- data quality
- image data
- original data
- raw data
- spatial data
- statistical analysis
- data processing
- pattern classification
- knowledge discovery
- classification models
- quantitative and qualitative
- high quality
- decision trees
- feature subset
- quantitative data
- data collection
- text classification
- image classification
- high dimensional
- preprocessing
- support vector