Feature selection for text classification based on part of speech filter and synonym merge.
Sijun QinJia SongPengzhou ZhangYue TanPublished in: FSKD (2015)
Keyphrases
- text classification
- part of speech
- n gram
- feature selection
- bag of words
- text documents
- training corpus
- text categorization
- pos tagging
- wordnet
- word sense disambiguation
- natural language processing
- unsupervised grammar induction
- k nearest neighbor
- text mining
- knn
- tf idf
- co occurrence
- word segmentation
- sentiment classification
- unsupervised learning
- language modeling
- machine learning
- labeled data
- support vector machine
- syntactic categories
- multiword
- feature set
- query expansion
- semantic features
- feature extraction
- support vector
- document retrieval
- language model
- feature vectors
- bayesian networks
- relation extraction
- feature reduction