Robust Transliteration Mining from Comparable Corpora with Bilingual Topic Models.
John RichardsonToshiaki NakazawaSadao KurohashiPublished in: IJCNLP (2013)
Keyphrases
- comparable corpora
- topic models
- cross language information retrieval
- text corpora
- text mining
- word pairs
- text documents
- parallel corpora
- machine translation
- bilingual lexicon
- topic modeling
- latent dirichlet allocation
- cross language
- bilingual dictionaries
- query translation
- language independent
- news articles
- cross lingual
- co occurrence
- named entities
- probabilistic model
- linguistic resources
- computational linguistics
- knowledge discovery
- language modeling
- relevance model
- translation model
- artificial intelligence
- machine translation system
- statistical machine translation
- information extraction
- sentence level
- natural language processing
- text analysis
- text classification
- generative model