Parallel corpus alignment at the document, sentence and vocabulary levels.
Rogelio NazarPublished in: Proces. del Leng. Natural (2011)
Keyphrases
- parallel corpus
- sentence pairs
- word alignment
- word level
- language independent
- document classification
- cross lingual
- machine translation
- source language
- document clustering
- keywords
- cross language information retrieval
- machine translation system
- document images
- target language
- latent semantic analysis
- statistical machine translation
- information retrieval systems
- information retrieval
- web documents
- n gram
- document collections
- query translation
- markov networks
- text documents
- tf idf
- document representation
- relevant documents
- retrieval systems
- semantic space
- text categorization
- query terms
- semantic information