Efficient Social Network Multilingual Classification using Character, POS n-grams and Dynamic Normalization.
Carlos-Emiliano González-GallardoJuan-Manuel Torres-MorenoAzucena Montes RendónGerardo SierraPublished in: CoRR (2017)
Keyphrases
- word level
- n gram
- language independent
- text classification
- part of speech
- social networks
- language model
- language specific
- word segmentation
- bag of words
- variable length
- viterbi algorithm
- classification accuracy
- machine learning
- language modeling
- cross lingual
- language modelling
- text categorization
- image classification
- feature extraction
- decision trees
- feature selection
- social network analysis
- document retrieval
- web documents
- parallel corpora
- document collections
- information retrieval