Spoken Term Detection by N-gram Index with Exact Distance for NTCIR-SpokenDoc2.
Nagisa SakamotoSeiichi NakagawaPublished in: NTCIR (2013)
Keyphrases
- n gram
- spoken term detection
- language model
- test collection
- out of vocabulary
- language modeling
- language independent
- bag of words
- part of speech
- text classification
- retrieval model
- probabilistic model
- viterbi algorithm
- document retrieval
- word segmentation
- information retrieval
- retrieval effectiveness
- relevant documents
- average precision
- text mining
- feature selection
- search engine