N-gram Based Text Classification According To Authorship.
Andelka ZecevicPublished in: RANLP Student Research Workshop (2011)
Keyphrases
- text classification
- n gram
- bag of words
- text categorization
- text mining
- feature selection
- naive bayes
- sentiment analysis
- text documents
- variable length
- labeled data
- text classifiers
- machine learning
- k nearest neighbor
- knn
- multi label
- document classification
- text data
- language modeling
- data cleaning
- text classification tasks
- semantic features
- learning algorithm
- databases
- writing style
- authorship attribution
- database