XLEnt: Mining a Large Cross-lingual Entity Dataset with Lexical-Semantic-Phonetic Word Alignment.
Ahmed El-KishkyAdithya RenduchintalaJames CrossFrancisco GuzmánPhilipp KoehnPublished in: EMNLP (1) (2021)
Keyphrases
- cross lingual
- word alignment
- machine translation
- parallel corpus
- language independent
- text classification
- cross language
- statistical machine translation
- language modeling
- text mining
- data mining
- knowledge base
- machine translation system
- news articles
- translation model
- language model
- parallel corpora
- query translation
- markov networks
- target language
- cross language information retrieval
- natural language
- document classification
- semi supervised
- named entities
- transfer learning