Sprinkled Latent Semantic Indexing for Text Classification with Background Knowledge.
Haiqin YangIrwin KingPublished in: ICONIP (2) (2008)
Keyphrases
- background knowledge
- latent semantic indexing
- text classification
- labeled data
- document representation
- bag of words
- text retrieval
- vector space
- singular value decomposition
- unlabeled data
- text categorization
- feature selection
- domain knowledge
- information retrieval
- logic programs
- text documents
- machine learning
- text mining
- vector space model
- semi supervised learning
- text data
- n gram
- semantic information
- knowledge base
- knn
- prior knowledge
- data cleaning
- similarity search
- document collections
- contextual information
- context aware
- pairwise
- databases
- data sets