Cross-lingual hate speech detection based on multilingual domain-specific word embeddings.
Aymé ArangoJorge PérezBarbara PobletePublished in: CoRR (2021)
Keyphrases
- cross lingual
- domain specific
- multi lingual
- parallel corpus
- translation model
- language specific
- machine translation
- out of vocabulary
- cross lingual information retrieval
- cross language
- language modeling
- word sense
- language independent
- word alignment
- word segmentation
- indian languages
- text classification
- speech recognition
- n gram
- word sense disambiguation
- statistical machine translation
- query translation
- machine translation system
- parallel corpora
- transfer learning
- language model
- monolingual and cross lingual
- co occurrence
- bilingual dictionaries
- cross language information retrieval
- document clustering
- vector space
- web news
- keywords
- retrieval model
- text categorization
- comparable corpora
- query expansion
- information retrieval systems