From digital library to n-grams: NB N-gram.
Magnus Breder BirkenesLars G. JohnsenArne Martinus LindstadJohanne OstadPublished in: NODALIDA (2015)
Keyphrases
- n gram
- digital libraries
- text classification
- naive bayes
- language model
- bag of words
- language independent
- metadata
- variable length
- viterbi algorithm
- language modelling
- information access
- part of speech
- language modeling
- text categorization
- multimedia
- word segmentation
- data mining
- statistical language modeling
- decision trees
- machine learning
- bayesian networks