Feature Extraction in Subject Classification of Text Documents in Polish.
Tomasz WalkowiakSzymon DatkoHenryk MaciejewskiPublished in: ICAISC (2) (2018)
Keyphrases
- text documents
- feature extraction
- text classification
- document classification
- text mining
- image classification
- feature selection
- pattern recognition
- text categorization
- feature vectors
- automatic text categorization
- document clustering
- bag of words
- support vector machine svm
- information extraction
- classification accuracy
- support vector
- feature space
- text clustering
- feature set
- machine learning
- wordnet
- image processing
- text data
- data mining
- training set
- news articles
- keywords
- topic models
- support vector machine
- named entities
- classification algorithm
- unsupervised learning
- supervised learning
- decision trees
- information retrieval
- maximum likelihood
- natural language processing
- nearest neighbor
- data sets