Publication: Pre-processing English-Hindi Corpus for Statistical Machine Translation.