A New Approach to Automated Text Readability Classification based on Concept Indexing with Integrated Part-of-Speech n-gram Features.
Abigail R. RazonJohn A. BarndenPublished in: RANLP (2015)
Keyphrases
- n gram
- part of speech
- syntactic features
- text classification
- classification accuracy
- language model
- noun phrases
- feature extraction
- feature set
- feature vectors
- feature space
- language independent
- syntactic categories
- bag of words
- information retrieval
- language modeling
- text documents
- pos tagging
- web documents
- pos taggers
- feature selection
- decision trees
- machine learning
- word segmentation
- image classification
- language specific
- text mining
- document analysis
- training data
- training set
- natural language processing