Leveraging Arabic-English Bilingual Corpora with Crowd Sourcing-Based Annotation for Arabic-Hebrew SMT.
Manish GauravGuruprasad SaikumarAmit SrivastavaPremkumar NatarajanShankar AnanthakrishnanSpyros MatsoukasPublished in: CICLing (2) (2013)
Keyphrases
- arabic language
- crowd sourcing
- language identification
- statistical machine translation
- machine translation
- mt evaluation
- arabic documents
- genetic algorithm
- social networking
- active learning
- data sets
- document images
- optical character recognition
- computational model
- natural language processing
- natural language
- crowd sourced
- word forms
- metadata