Text Classification for Monolingual Political Manifestos with Words Out of Vocabulary.
Arsenii RasovIlya ObabkovEckehard OlbrichIvan P. YamshchikovPublished in: COMPLEXIS (2020)
Keyphrases
- out of vocabulary
- text classification
- cross lingual
- n gram
- chinese english
- word segmentation
- english chinese
- cross language information retrieval
- term frequency
- spoken document retrieval
- cross language
- bag of words
- parallel corpora
- text categorization
- language modeling
- language independent
- text documents
- language specific
- sentiment classification
- language model
- feature selection
- query translation
- machine translation
- text mining
- text classifiers
- linguistic features
- machine learning
- knn
- parallel corpus
- word alignment
- hand crafted
- translation model
- part of speech
- transfer learning
- speech recognition
- semantic features