N-Gram Based Approach for Text Authorship Classification: Metric Selection.
Elena MikhailovaPolina DiurdevaDmitry S. ShalymovPublished in: Int. J. Embed. Real Time Commun. Syst. (2017)
Keyphrases
- n gram
- text classification
- language model
- character n grams
- classification accuracy
- language modeling
- variable length
- image classification
- text documents
- language independent
- bag of words
- machine learning
- text mining
- neural network
- language specific
- support vector machine
- part of speech
- distance measure
- language modelling
- word level
- text classifiers
- text retrieval
- text categorization
- information retrieval
- decision trees
- naive bayes classifier
- cross lingual
- document retrieval