XLEnt: Mining a Large Cross-lingual Entity Dataset with Lexical-Semantic-Phonetic Word Alignment.
Ahmed El-KishkyAdi RenduchintalaJames CrossFrancisco GuzmánPhilipp KoehnPublished in: CoRR (2021)
Keyphrases
- cross lingual
- word alignment
- machine translation
- language modeling
- cross language
- parallel corpus
- language independent
- text classification
- statistical machine translation
- data mining
- machine translation system
- translation model
- markov networks
- knowledge base
- graphical models
- knowledge discovery
- information retrieval
- language model
- probabilistic model
- query translation
- news articles
- text mining
- named entities
- parallel corpora