Text Classification with Document Embeddings.
Chaochao HuangXipeng QiuXuanjing HuangPublished in: CCL (2014)
Keyphrases
- text classification
- text documents
- document classification
- text classifiers
- term frequency
- training documents
- text categorization
- automatic text classification
- document categorization
- topic discovery
- bag of words
- feature selection
- text mining
- document images
- naive bayes
- text data
- vector space
- labeled data
- classify documents
- document representation
- n gram
- document collections
- data cleaning
- information retrieval systems
- semantic features
- tf idf
- sentiment classification
- dimensionality reduction
- document retrieval
- knn
- information retrieval
- document clustering
- sentiment analysis
- data analysis
- retrieval systems
- web documents
- training data
- digital libraries
- euclidean space
- neural network