Generalized Funnelling: Ensemble Learning and Heterogeneous Document Embeddings for Cross-Lingual Text Classification.
Andrea PedrottiFabrizio SebastianiAlejandro MoreoPublished in: IIR (2021)
Keyphrases
- cross lingual
- text classification
- ensemble learning
- text documents
- unlabeled data
- text classifiers
- feature selection
- text categorization
- generalization ability
- labeled data
- naive bayes
- language modeling
- text mining
- bag of words
- document clustering
- ensemble methods
- base classifiers
- knn
- vector space
- information retrieval
- random forest
- keywords
- information retrieval systems
- machine learning
- n gram
- concept drift
- transfer learning
- multi label
- k nearest neighbor
- unsupervised learning
- active learning
- document collections
- image classification
- natural language processing
- support vector
- artificial intelligence
- user queries