Text Classification Research Based on Improved Word2vec and CNN.
Mengyuan GaoTinghui LiPeifang HuangPublished in: ICSOC Workshops (2018)
Keyphrases
- text classification
- n gram
- training corpus
- term frequency
- bag of words
- text mining
- machine learning
- semantic features
- word segmentation
- naive bayes
- text categorization
- feature selection
- knn
- cellular neural networks
- probabilistic model
- text data
- data cleaning
- distributional clustering
- language modeling
- text documents
- multi label
- data analysis
- information retrieval