Training Part-of-Speech Taggers to build Machine Translation Systems for Less-Resourced Language Pairs.
Felipe Sánchez-MartínezCarme Armentano-OllerJuan Antonio Pérez-OrtizMikel L. ForcadaPublished in: Proces. del Leng. Natural (2007)
Keyphrases
- part of speech
- machine translation system
- pos tagging
- machine translation
- target language
- lexical information
- natural language processing
- n gram
- word sense disambiguation
- syntactic categories
- source language
- multiword
- bilingual dictionaries
- query translation
- statistical machine translation
- pos taggers
- cross language information retrieval
- parse tree
- cross lingual
- word alignment
- training set
- parallel corpora
- supervised learning
- natural language
- language model
- information extraction
- tf idf
- translation model
- text documents