Rigorous dimensionality reduction through linguistically motivated feature selection for text categorization.
Hans Friedrich WitschelChris BiemannPublished in: NODALIDA (2005)
Keyphrases
- linguistically motivated
- feature selection for text categorization
- dimensionality reduction
- co occurrence
- linguistic knowledge
- text categorization
- feature selection
- noun phrases
- natural language
- high dimensional
- information gain
- principal component analysis
- low dimensional
- feature extraction
- unsupervised learning
- data points
- feature space
- principal components
- mutual information
- wordnet
- artificial intelligence
- user queries
- natural language processing
- supervised learning
- feature vectors