Combination Approaches in Information Retrieval: Words vs. N-grams and Query Translation vs. Document Translation.
In-Su KangSeung-Hoon NaJong-Hyeok LeePublished in: NTCIR (2004)
Keyphrases
- n gram
- query translation
- language model
- information retrieval
- cross language information retrieval
- query terms
- query expansion
- translation model
- language modeling
- character n grams
- retrieve documents
- translation probabilities
- cross language
- machine translation
- document retrieval
- source language
- word level
- language independent
- cross lingual
- english chinese
- document collections
- text classification
- bag of words
- vector space model
- document representation
- out of vocabulary
- information retrieval systems
- parallel corpora
- bilingual dictionaries
- monolingual information retrieval
- retrieval model
- web documents
- relevance ranking
- parallel corpus
- test collection
- relevant documents
- retrieval systems
- tf idf
- text retrieval
- word segmentation
- search engine
- text mining
- text documents
- passage retrieval
- information extraction
- machine translation system
- document clustering
- chinese english
- feature selection
- information seeking