Text Classification in an Under-Resourced Language via Lexical Normalization and Feature Pooling.
Omayya SohailInam ElahiAhsan IjazAsim KarimFaisal KamiranPublished in: PACIS (2018)
Keyphrases
- text classification
- lexical information
- bag of words
- feature selection
- naive bayes
- supervised feature selection
- text mining
- programming language
- context sensitive
- n gram
- text data
- machine learning
- natural language
- feature set
- feature vectors
- knn
- text categorization
- syntactic categories
- spatial pyramid matching
- sentiment analysis
- domain specific
- linguistic analysis
- text classifiers
- semantic features
- natural language processing
- preprocessing
- text documents
- neural network
- cross lingual
- word sense disambiguation
- data cleaning
- normalization method
- knowledge base
- wordnet
- spatial pooling