Beyond content: discriminatory power of function words in text type classification.
Klára VenglarováVladimír MatlachPublished in: Digit. Scholarsh. Humanit. (2024)
Keyphrases
- discriminatory power
- classification accuracy
- textual features
- keywords
- text documents
- document content
- feature space
- support vector
- information retrieval
- feature selection
- text recognition
- text corpus
- text content
- web documents
- text classification
- recognition rate
- english words
- metadata
- semantic content
- short text
- related words
- punctuation marks
- linguistic information
- word pairs
- data mining
- textual content
- word segmentation
- data sets
- semantic information
- training set
- decision trees
- chinese text
- syntactic categories