Towards a private vector space model for confidential documents.
Daniel AbrilGuillermo Navarro-ArribasVicenç TorraPublished in: SAC (2013)
Keyphrases
- vector space model
- information retrieval
- document representation
- index terms
- web documents
- document clustering
- vector space
- retrieval model
- language model
- cosine similarity
- semantic information
- tf idf
- semantic similarity
- document categorization
- latent semantic indexing
- document similarity
- document space
- information retrieval systems
- text representation
- data mining
- relevance model
- retrieval systems
- relevant documents
- database
- test collection
- privacy preserving
- document collections
- text mining
- information extraction
- knowledge discovery
- digital libraries
- machine learning
- databases