Text Representations for Text Categorization: A Case Study in Biomedical Domain.
Man LanChew Lim TanJian SuHwee-Boon LowPublished in: IJCNN (2007)
Keyphrases
- multi instance multi label learning
- text categorization
- text documents
- text collections
- document categorization
- cross domain
- automatic categorization
- text classifiers
- text classification
- textual data
- text mining
- feature selection
- text clustering
- naive bayes
- k nearest neighbor
- reuters corpus
- multi label
- information gain
- knn
- information extraction
- semi supervised learning
- text data
- automated text categorization
- information retrieval
- feature selections
- term frequency
- tf idf
- word frequency
- knowledge discovery
- similarity measure
- decision trees
- learning algorithm