Bag-of-Words, Bag-of-Topics and Word-to-Vec Based Subject Classification of Text Documents in Polish - A Comparative Study.
Tomasz WalkowiakSzymon DatkoHenryk MaciejewskiPublished in: DepCoS-RELCOMEX (2018)
Keyphrases
- bag of words
- text documents
- text classification
- image classification
- document classification
- n gram
- latent topics
- term frequency
- visual words
- document representation
- image representation
- text data
- text categorization
- tf idf
- text mining
- action recognition
- feature selection
- news articles
- text collections
- keywords
- machine learning
- document clustering
- feature extraction
- text classifiers
- natural language text
- text representation
- sentiment analysis
- labeled data
- co occurrence
- supervised learning
- support vector machine
- knn
- decision trees
- image features
- topic modeling
- training set
- data analysis
- topic models
- wordnet
- unsupervised learning
- language model
- multiscale
- computer vision