Constructing Cross-lingual Consumer Health Vocabulary with Word-Embedding from Comparable User Generated Content.
Chia-Hsuan ChangLei WangChristopher C. YangPublished in: CoRR (2022)
Keyphrases
- cross lingual
- user generated content
- out of vocabulary
- translation model
- parallel corpus
- word segmentation
- word sense
- social media
- machine translation
- language modeling
- word alignment
- cross lingual information retrieval
- language independent
- parallel corpora
- cross language
- statistical machine translation
- indian languages
- recommender systems
- language model
- chinese english
- machine translation system
- bilingual dictionaries
- word sense disambiguation
- sentiment classification
- n gram
- co occurrence
- query translation
- source language
- keywords
- information retrieval
- text classification
- cross language information retrieval
- transfer learning
- information extraction
- image retrieval
- vector space