Bag-of-concepts: Comprehending document representation through clustering words in distributed representation.
Han Kyul KimHyunjoong KimSungzoon ChoPublished in: Neurocomputing (2017)
Keyphrases
- document representation
- bag of words
- document clustering
- vector representation
- text representation
- image representation
- text documents
- vector space model
- index terms
- data fusion
- document collections
- clustering algorithm
- text classification
- clustering method
- document content
- n gram
- image classification
- k means
- background knowledge
- document space
- web documents
- text mining
- semantic information
- vector space
- language model
- action recognition
- cluster analysis
- text data
- data points
- unsupervised learning
- natural language
- keywords
- metadata
- computer vision
- data mining