Protecting Sensitive Topics in Text Documents with PROTEXTOR.
Chad M. CumbyPublished in: ECML/PKDD (2) (2009)
Keyphrases
- text documents
- topic models
- text mining
- text categorization
- information extraction
- text classification
- keywords
- tf idf
- text data
- news articles
- document classification
- text collections
- wordnet
- document clustering
- text analysis
- topic modeling
- latent topics
- latent dirichlet allocation
- bag of words
- named entities
- text corpora
- term frequency
- probabilistic model
- data sets
- automatic text categorization
- co occurrence
- feature selection
- natural language processing
- object recognition
- search engine