Text Classification Using Label Names Only: A Language Model Self-Training Approach.
Yu MengYunyi ZhangJiaxin HuangChenyan XiongHeng JiChao ZhangJiawei HanPublished in: CoRR (2020)
Keyphrases
- language model
- text classification
- co training
- multi label
- language modeling
- n gram
- text categorization
- document retrieval
- semi supervised learning
- bag of words
- labeled data
- text mining
- retrieval model
- probabilistic model
- feature selection
- speech recognition
- language modelling
- naive bayes
- cost sensitive
- query expansion
- information retrieval
- class labels
- statistical language modeling
- unlabeled data
- cross lingual
- statistical language models
- machine learning
- knn
- text classifiers
- label propagation
- text documents
- test collection
- context sensitive
- ad hoc information retrieval
- named entities
- mixture model
- query terms
- smoothing methods
- data mining
- semantic features
- keywords
- vector space model
- training set
- relevance model
- statistical machine translation
- query specific
- image annotation