Multilingual Test Sets for Machine Translation of Search Queries for Cross-Lingual Information Retrieval in the Medical Domain.
Zdenka UresováJan HajicPavel PecinaOndrej DusekPublished in: LREC (2014)
Keyphrases
- test set
- cross lingual information retrieval
- search queries
- machine translation
- cross lingual
- chinese english
- search engine
- web search
- information retrieval systems
- language independent
- parallel corpora
- web search engines
- information extraction
- training set
- cross language information retrieval
- natural language processing
- query translation
- user queries
- statistical machine translation
- cross language
- query logs
- relevant documents
- target language
- domain specific
- parallel corpus
- training data
- translation model
- machine translation system
- linguistic resources
- query terms
- keyword search
- knowledge representation
- information retrieval
- comparable corpora
- word alignment
- natural language
- text mining
- machine learning