Semi-Supervised Discriminative Language Modeling with Out-of-Domain Text Data.
Arda ÇelebiMurat SaraçlarPublished in: HLT-NAACL (2013)
Keyphrases
- language modeling
- text data
- semi supervised
- text classification
- language model
- labeled data
- information retrieval
- text mining
- n gram
- query expansion
- unlabeled data
- probabilistic model
- retrieval model
- semi supervised learning
- text documents
- high dimensional
- feature selection
- text categorization
- text clustering
- machine learning
- document collections
- structured data
- high dimensional data
- active learning
- feature extraction
- generative model
- pairwise
- dimensionality reduction
- unsupervised learning
- supervised learning
- bag of words
- document retrieval
- vector space