Identifying Misaligned Spans in Parallel Corpora Using Change Point Detection.
Andrea PagottoPatrick LittellYunli WangCyril GouttePublished in: Canadian Conference on AI (2019)
Keyphrases
- parallel corpora
- change point detection
- cross language information retrieval
- language independent
- machine translation
- sequential data
- non stationary
- cross lingual
- labor intensive
- normalized maximum likelihood
- cross language
- query translation
- outlier detection
- word pairs
- machine translation system
- text retrieval
- sentence level
- statistical machine translation
- semi automatic
- keywords
- information retrieval