SexWEs: Domain-Aware Word Embeddings via Cross-lingual Semantic Specialisation for Chinese Sexism Detection in Social Media.
Aiqi JiangArkaitz ZubiagaPublished in: CoRR (2022)
Keyphrases
- cross lingual
- word segmentation
- social media
- event extraction
- word sense
- translation model
- mono lingual
- machine translation
- parallel corpus
- language modeling
- cross lingual information retrieval
- language independent
- word alignment
- chinese english
- statistical machine translation
- cross language
- text classification
- unknown words
- transfer learning
- n gram
- language model
- co occurrence
- word sense disambiguation
- document clustering
- indian languages
- machine translation system
- semantic similarity
- natural language
- parallel corpora
- target language
- online news
- bag of words
- query translation
- information retrieval
- news articles
- learning algorithm