Quasi Error-free Text Classification and Authorship Recognition in a large Corpus of English Literature based on a Novel Feature Set.
Arthur M. JacobsAnnette KinderPublished in: CoRR (2020)
Keyphrases
- feature set
- error free
- text classification
- linguistic features
- feature selection
- feature extraction
- semantic features
- feature reduction
- feature vectors
- classification accuracy
- random forest
- syntactic features
- error prone
- machine learning
- bag of words
- feature extraction and selection
- text categorization
- feature subset
- feature space
- text mining
- n gram
- natural language
- neural network
- texture features
- multi label
- error resilience
- data sets
- learning algorithm