Scalable Cross-lingual Document Similarity through Language-specific Concept Hierarchies.
Carlos Badenes-OlmedoJosé Luis Redondo GarcíaÓscar CorchoPublished in: CoRR (2021)
Keyphrases
- cross lingual
- language specific
- concept hierarchy
- machine translation
- document clustering
- language independent
- language modeling
- document representation
- co occurrence
- text corpora
- text classification
- wordnet
- keywords
- user profiles
- background knowledge
- text mining
- information retrieval
- transfer learning
- news articles
- n gram
- information extraction
- text documents
- language model