Sopoken Term Detection Based on a Syllable N-gram Index at the NTCIR-11 SpokenQuery&Doc Task.
Nagisa SakamotoKazumasa YamamotoSeiichi NakagawaPublished in: NTCIR (2014)
Keyphrases
- n gram
- language model
- text classification
- language independent
- test collection
- variable length
- bag of words
- language modeling
- language modelling
- viterbi algorithm
- part of speech
- out of vocabulary
- document representation
- text categorization
- language specific
- inside outside algorithm
- query terms
- bayesian networks
- term frequency
- word level
- character n grams