Word Vector Enrichment of Low Frequency Words in the Bag-of-Words Model for Short Text Multi-class Classification Problems.
Bradford HeapMichael BainWayne WobckeAlfred KrzywickiSusanne SchmeidlPublished in: CoRR (2017)
Keyphrases
- short text
- low frequency
- high frequency
- latent topics
- short texts
- short text classification
- frequency domain
- wavelet transform
- topic models
- query words
- n gram
- subband
- topic detection
- co occurrence
- multi class
- feature vectors
- wavelet coefficients
- latent dirichlet allocation
- topic modeling
- error correcting output codes
- text documents
- multiresolution
- high resolution
- keywords
- bag of words
- text classification
- prior knowledge
- machine learning
- wordnet
- text categorization