Using Kernel Density Classifier with Topic Model and Cost Sensitive Learning for Automatic Text Categorization.
Dwi Sianto MansjurTed S. WadaBiing-Hwang JuangPublished in: ICDAR (2009)
Keyphrases
- cost sensitive learning
- topic models
- text documents
- cost sensitive
- missing values
- class imbalance
- active learning
- text mining
- probabilistic model
- generative model
- decision trees
- co occurrence
- information retrieval
- class distribution
- text categorization
- data sets
- mean shift
- natural language processing
- similarity measure
- machine learning
- data mining