Sopoken Term Detection Based on a Syllable N-gram Index at the NTCIR-11 SpokenQuery&Doc Task.

Nagisa Sakamoto Kazumasa Yamamoto Seiichi Nakagawa

Published in: NTCIR (2014)

Keyphrases

n gram
language model
text classification
language independent
test collection
variable length
bag of words
language modeling
language modelling
viterbi algorithm
part of speech
out of vocabulary
document representation
text categorization
language specific
inside outside algorithm
query terms
bayesian networks
term frequency
word level
character n grams