A Forensic Authorship Classification in SMS Messages: A Likelihood Ratio Based Approach Using N-gram.
Shunichi IshiharaPublished in: ALTA (2011)
Keyphrases
- n gram
- likelihood ratio
- text classification
- hypothesis testing
- language model
- language modeling
- classification accuracy
- feature selection
- variable length
- language independent
- feature space
- decision trees
- feature vectors
- natural language
- part of speech
- information retrieval
- viterbi algorithm
- language modelling
- inside outside algorithm
- image classification
- machine learning methods
- bayesian networks
- word segmentation
- statistical language modeling