Research on Multilingual News Clustering Based on Cross-Language Word Embeddings.
Lin WuRui LiWong-Hing LamPublished in: CoRR (2023)
Keyphrases
- cross language
- spoken document retrieval
- character n grams
- question answering
- text retrieval
- cross language information retrieval
- language independent
- bilingual lexicon
- document retrieval
- text categorization
- information access
- document collections
- cross lingual
- translation model
- query translation
- co occurrence
- low dimensional
- vector space
- n gram
- artificial intelligence
- word segmentation
- bilingual dictionaries
- parallel corpora
- query words
- textual and visual information
- source language
- information retrieval systems
- retrieval model
- dimensionality reduction
- natural language processing
- language specific
- digital libraries
- multimedia
- feature selection
- information retrieval
- machine learning