Cross-lingual and generic text categorization (Apprentissage d'une classification thématique générique et cross-langue à partir des catégories de la Wikipédia) [in French].
François-Régis ChaumartinPublished in: TALN (2) (2013)
Keyphrases
- text categorization
- text classification
- cross lingual
- feature selection
- document classification
- cross language
- text classifiers
- naive bayes
- mono lingual
- knn
- language modeling
- bag of words
- image classification
- k nearest neighbor
- machine learning
- text documents
- text mining
- feature extraction
- n gram
- transfer learning
- semi supervised learning
- machine translation
- labeled data
- unsupervised learning
- support vector
- document clustering
- supervised learning
- probabilistic model
- artificial intelligence