Cross-lingual Dataless Classification for Languages with Small Wikipedia Presence.
Yangqiu SongStephen MayhewDan RothPublished in: CoRR (2016)
Keyphrases
- cross lingual
- text classification
- language independent
- machine translation
- language modeling
- multi lingual
- cross lingual information retrieval
- european languages
- cross language
- language specific
- query translation
- information retrieval
- feature vectors
- feature space
- bag of words
- linguistic resources
- image classification
- language model
- training set
- machine learning
- query expansion
- decision trees
- knn
- knowledge base
- parallel corpus
- text categorization
- document clustering
- retrieval model
- transfer learning
- document collections