Accurate Stemming of Dutch for Text Classification.
Tanja GaustadGosse BoumaPublished in: CLIN (2001)
Keyphrases
- text classification
- n gram
- bag of words
- naive bayes
- machine learning
- information retrieval
- computationally efficient
- text documents
- text categorization
- high accuracy
- high quality
- text mining
- feature selection
- sentiment analysis
- text data
- supervised learning
- information extraction
- knowledge discovery
- decision trees
- highly accurate
- document classification
- semantic features