Establishing Baselines for Text Classification in Low-Resource Languages.
Jan Christian Blaise CruzCharibeth ChengPublished in: CoRR (2020)
Keyphrases
- text classification
- language independent
- cross lingual
- naive bayes
- sentiment analysis
- text categorization
- bag of words
- expressive power
- feature selection
- text mining
- machine learning
- n gram
- text documents
- resource management
- labeled data
- text classifiers
- sentiment classification
- semantic features
- multi label
- language modeling
- databases
- knn
- data cleaning
- text summarization
- training data