Word Embedding Based Extension of Text Categorization Topic Taxonomies.
Tobias Eljasik-SwobodaFelix EngelMichael KaufmannMatthias L. HemmjePublished in: CERC (2019)
Keyphrases
- text categorization
- word frequency
- term frequency
- term weighting
- distributional clustering
- text classification
- document frequency
- feature selection
- multi label
- knn
- text documents
- k nearest neighbor
- information gain
- document classification
- automatic text categorization
- document set
- n gram
- text classifiers
- vector space
- automated text categorization
- word sense disambiguation
- information theoretic
- tf idf
- reuters corpus
- text collections
- bag of words
- topic models
- automatic summarization
- semi supervised learning
- labeled data
- learning algorithm
- information retrieval
- data sets