Multivariate Gaussian Document Representation from Word Embeddings for Text Categorization.
Giannis NikolentzosPolykarpos MeladianosFrançois RousseauYannis StavrakasMichalis VazirgiannisPublished in: EACL (2) (2017)
Keyphrases
- document representation
- text categorization
- document frequency
- vector space
- text documents
- term frequency
- document categorization
- bag of words
- text classification
- term weighting
- vector space model
- document clustering
- language model
- n gram
- text data
- knn
- document collections
- feature selection
- web documents
- k nearest neighbor
- tf idf
- semantic information
- data fusion
- distance measure
- action recognition
- co occurrence
- similarity search
- information retrieval
- retrieval model
- unlabeled data
- background knowledge
- neural network
- dimensionality reduction
- text mining
- keywords
- bayesian networks
- similarity measure
- learning algorithm