Computing n-Gram Statistics in MapReduce
Klaus BerberichSrikanta J. BedathurPublished in: CoRR (2012)
Keyphrases
- n gram
- language model
- language independent
- text classification
- bag of words
- language modelling
- variable length
- language modeling
- word segmentation
- web documents
- inside outside algorithm
- databases
- word level
- viterbi algorithm
- cloud computing
- knn
- classification accuracy
- bayesian networks
- artificial intelligence
- information retrieval