Bi-Text Alignment of Movie Subtitles for Spoken English-Arabic Statistical Machine Translation.
Fahad Al-ObaidliStephen CoxPreslav NakovPublished in: CoRR (2016)
Keyphrases
- statistical machine translation
- word alignment
- machine translation
- arabic language
- word level
- machine translation system
- grammar induction
- training corpus
- phrase based smt
- multiword
- source language
- parallel texts
- parallel corpus
- language model
- minimum error rate
- chinese english
- cross lingual
- language processing
- natural language generation
- speech recognition
- cross language information retrieval
- language independent
- translation model
- parallel corpora
- text retrieval
- text mining
- information extraction
- information retrieval
- machine learning
- target language
- natural language
- keywords
- text clustering
- text summarization