Handling Orthographic Varieties in Japanese IR: Fusion of Word-, N-Gram-, and Yomi-Based Indices Across Different Document Collections.
Nina KummerChrista Womser-HackerNoriko KandoPublished in: AIRS (2005)
Keyphrases
- n gram
- document collections
- information retrieval
- information retrieval systems
- language model
- text retrieval
- ad hoc retrieval
- document retrieval
- text classification
- retrieval model
- language independent
- bag of words
- language modeling
- relevant documents
- test collection
- query expansion
- data fusion
- xml retrieval
- retrieval effectiveness
- part of speech
- digital libraries
- cross language
- scatter gather
- word segmentation
- document representation
- text mining
- document clustering
- passage retrieval
- character n grams
- information access
- information extraction
- question answering
- machine learning
- search engine
- retrieval systems