A Data-based Classification of Slavic Languages: Indices of Qualitative Variation Applied to Grapheme Frequencies.
Michaela KoscováJán MacutekEmmerich KelihPublished in: J. Quant. Linguistics (2016)
Keyphrases
- data sets
- data collection
- data sources
- data analysis
- data processing
- data quality
- image data
- data distribution
- classification accuracy
- database
- synthetic data
- databases
- classification method
- pattern recognition
- data structure
- high quality
- data points
- labeled data
- statistical analysis
- model selection
- quantitative and qualitative
- knowledge discovery
- data mining
- xml documents
- high dimensional
- preprocessing
- neural network
- support vector
- feature selection
- machine learning