Feature Selection on Noisy Twitter Short Text Messages for Language Identification.
Mohd Zeeshan AnsariTanvir AhmadAna FatimaPublished in: CoRR (2020)
Keyphrases
- short text
- language identification
- topic detection
- feature selection
- text classification
- short text classification
- text categorization
- speaker identification
- document images
- noisy environments
- feature extraction
- text mining
- text data
- support vector
- feature set
- knn
- document clustering
- machine learning
- feature vectors
- multi modal
- feature space