Keyphrases
- text classification
- data sets
- data collection
- data cleaning
- data quality
- small number
- knowledge discovery
- training data
- data sources
- original data
- statistical analysis
- feature selection
- data mining techniques
- data analysis
- prior knowledge
- text data
- labeled data
- missing values
- synthetic data
- missing data
- database
- text mining
- databases
- experimental data
- bag of words
- high dimensional data
- end users
- image data
- input data
- high quality