A Closer Look at Clustering Bilingual Comparable Corpora.
Anna LaskinaÉric GaussierGaëlle CalvaryPublished in: LREC/COLING (2024)
Keyphrases
- comparable corpora
- cross language information retrieval
- parallel corpora
- bilingual lexicon
- news articles
- terminology extraction
- machine translation
- language modeling
- clustering algorithm
- cross lingual
- word pairs
- clustering method
- k means
- text documents
- document clustering
- text corpora
- cross language
- query translation
- bi directional
- language independent
- bilingual dictionaries
- information retrieval
- labor intensive
- query expansion
- parallel corpus
- linguistic resources
- machine learning