Bilingual Data Cleaning for SMT using Graph-based Random Walk.
Lei CuiDongdong ZhangShujie LiuMu LiMing ZhouPublished in: ACL (2) (2013)
Keyphrases
- random walk
- data cleaning
- statistical machine translation
- data integration
- record linkage
- machine translation
- data quality
- text classification
- outlier detection
- cross lingual
- markov chain
- data processing
- database
- missing values
- data warehouse
- data warehousing
- fraud detection
- information extraction
- semi supervised
- cross language information retrieval
- web usage mining
- databases
- link prediction
- data model
- query translation
- data analysis
- learning algorithm