MT Detection in Web-Scraped Parallel Corpora.
Spencer RarrickChris QuirkWilliam LewisPublished in: MTSummit (2011)
Keyphrases
- parallel corpora
- machine translation
- cross language information retrieval
- query translation
- statistical machine translation
- web pages
- web documents
- cross lingual
- language independent
- link analysis
- information extraction
- information sources
- anomaly detection
- cross language
- text retrieval
- web data
- machine learning
- user experience
- web mining
- information retrieval