Factored Translation with Unsupervised Word Clusters.
Christian RishøjAnders SøgaardPublished in: WMT@EMNLP (2011)
Keyphrases
- translation model
- statistical machine translation
- clustering algorithm
- machine translation system
- unsupervised clustering
- pointwise mutual information
- cluster validation
- english words
- agglomerative clustering
- bilingual dictionaries
- possibilistic clustering
- hierarchical clustering
- syntactic categories
- co occurrence
- cross language information retrieval
- n gram
- self organizing maps
- semi supervised
- language model
- grammar induction
- unsupervised learning
- data clustering
- fuzzy clustering
- cluster analysis
- query translation
- state space
- word level
- document clustering
- word alignment
- parallel corpus
- word sense disambiguation
- english chinese
- cross lingual
- cross language
- target language
- syntactic analysis
- source language
- word segmentation
- machine translation
- multiword