Open Set Subject Classification of Text Documents in Polish by Doc-to-Vec and Local Outlier Factor.
Tomasz WalkowiakSzymon DatkoHenryk MaciejewskiPublished in: ICAISC (2) (2019)
Keyphrases
- text documents
- text classification
- document classification
- text mining
- automatic text categorization
- text data
- information extraction
- document clustering
- keywords
- text clustering
- classification accuracy
- text categorization
- bag of words
- machine learning
- news articles
- feature vectors
- classification algorithm
- feature extraction
- support vector machine
- decision trees
- image classification
- wordnet
- named entities
- text collections
- data sets
- databases
- topic models
- artificial intelligence
- object recognition