Generalized Funnelling: Ensemble Learning and Heterogeneous Document Embeddings for Cross-Lingual Text Classification.
Alejandro MoreoAndrea PedrottiFabrizio SebastianiPublished in: ACM Trans. Inf. Syst. (2023)
Keyphrases
- cross lingual
- text classification
- ensemble learning
- text documents
- unlabeled data
- text classifiers
- feature selection
- labeled data
- generalization ability
- text categorization
- language modeling
- document clustering
- ensemble methods
- naive bayes
- bag of words
- text mining
- information retrieval
- base classifiers
- machine learning
- keywords
- random forest
- information retrieval systems
- document collections
- news articles
- multi label
- retrieval systems
- vector space
- decision trees
- low dimensional
- k nearest neighbor
- transfer learning
- semi supervised learning
- multi class
- n gram
- data mining
- dimensionality reduction
- information extraction
- test collection
- data analysis
- training data