An index-based joint multilingual/cross-lingual text categorization using topic expansion via BabelNet.
Eniafe Festus AyetiranPublished in: Turkish J. Electr. Eng. Comput. Sci. (2020)
Keyphrases
- text categorization
- cross lingual
- text classification
- cross language
- news articles
- text documents
- cross lingual information retrieval
- language independent
- transfer learning
- language modeling
- machine translation
- text mining
- knn
- text classifiers
- feature selection
- translation model
- bag of words
- labeled data
- k nearest neighbor
- machine learning
- term frequency
- semi supervised learning
- unlabeled data
- topic models
- document clustering
- language model
- clustering algorithm
- unsupervised learning
- similarity search
- information extraction
- natural language
- n gram
- statistical machine translation