Multilingual and cross-lingual document classification: A meta-learning approach.
Niels van der HeijdenHelen YannakoudakisPushkar MishraEkaterina ShutovaPublished in: CoRR (2021)
Keyphrases
- cross lingual
- document classification
- meta learning
- text classification
- feature selection
- cross lingual information retrieval
- learning tasks
- text categorization
- machine translation
- inductive learning
- machine learning
- text mining
- model selection
- language modeling
- transfer learning
- decision trees
- text documents
- data mining
- classification algorithm
- bag of words
- web documents
- machine learning algorithms
- naive bayes
- document clustering
- news articles
- information extraction
- information retrieval
- word alignment
- n gram
- language model
- knn
- training set
- topic modeling
- databases