Constructing informative prior distributions from domain knowledge in text classification.
Aynur A. DayanikDavid D. LewisDavid MadiganVladimir MenkovAlexander GenkinPublished in: SIGIR (2006)
Keyphrases
- text classification
- domain knowledge
- text categorization
- bag of words
- machine learning
- feature selection
- labeled data
- sentiment analysis
- text documents
- domain experts
- naive bayes
- text mining
- text data
- data cleaning
- text classifiers
- knn
- semantic features
- n gram
- data sets
- document classification
- multi label
- language model
- information content
- knowledge sources
- databases
- domain ontology
- text classification tasks
- background knowledge
- prior knowledge
- prior domain knowledge
- data integration
- image classification
- active learning
- artificial intelligence
- neural network