Lattice Score Based Data Cleaning for Phrase-Based Statistical Machine Translation.
Jie JiangJulie Carson-BerndsenAndy WayPublished in: EAMT (2010)
Keyphrases
- statistical machine translation
- data cleaning
- data integration
- machine translation
- record linkage
- data quality
- text classification
- outlier detection
- word alignment
- language model
- database
- data processing
- web usage mining
- fraud detection
- chinese english
- missing values
- data warehousing
- machine translation system
- information extraction
- cross lingual
- data warehouse
- cross language information retrieval
- multiword
- translation model
- high dimensional
- machine learning
- information retrieval
- target language
- data model
- social networks
- database systems
- search engine
- user behavior
- text mining