Features of Latinate Etymologies on the Tasks of Text Categorization.
Wanyin LiAlex Chengyu FangPublished in: Int. J. Comput. Linguistics Appl. (2011)
Keyphrases
- text categorization
- feature generation
- information gain
- feature weighting
- feature selection
- text classification
- multi label
- knn
- feature reduction
- linear svm
- semi supervised learning
- feature extraction
- text documents
- term frequency
- training documents
- text classifiers
- feature vectors
- k nearest neighbor
- tf idf
- feature set
- feature selections
- automated text categorization
- feature space
- multi instance multi label learning
- automatic text categorization
- term weighting
- feature selection and classifier
- reuters corpus
- semi supervised
- natural language processing
- co occurrence
- object recognition
- prior knowledge
- multiple features
- classification accuracy
- machine learning
- naive bayes