Computational and Linguistic Issues in Designing a Syntactically Annotated Parallel Corpus of Indo-European Languages.
Dag T. HaugMarius L. JøhndalHanne M. EckhoffEirik WeloMari J. B. HertzenbergAngelika MüthPublished in: Trait. Autom. des Langues (2009)
Keyphrases
- european languages
- cross lingual
- parallel corpus
- language independent
- cross language
- machine translation
- text classification
- cross language information retrieval
- natural language
- parallel corpora
- machine translation system
- language modeling
- text retrieval
- document clustering
- natural language processing
- news articles
- transfer learning
- query translation
- bag of words
- n gram
- semi supervised learning
- target language
- language model
- information extraction
- learning algorithm