Multilingual Text Classification for Dravidian Languages.
Xiaotian LinNankai LinKanoksak WattanachoteShengyi JiangLianxi WangPublished in: CoRR (2021)
Keyphrases
- text classification
- language independent
- cross lingual
- n gram
- multi lingual
- language specific
- cross lingual information retrieval
- bag of words
- text categorization
- text mining
- naive bayes
- feature selection
- cross language
- labeled data
- multilingual information retrieval
- text classifiers
- sentiment analysis
- machine learning
- semantic features
- text data
- multilingual documents
- text documents
- multi label
- language modeling
- parallel corpus
- indian languages
- language resources
- knn
- machine translation
- statistical machine translation
- query expansion
- comparable corpora
- training data