Provenance of exposure: Identifying sources of leaked documents.
Christian S. CollbergAaron GibsonSam MartinNitin ShindeAmir HerzbergHaya ShulmanPublished in: CNS (2013)
Keyphrases
- metadata
- information retrieval
- document collections
- text documents
- web documents
- xml documents
- information sources
- document clustering
- information retrieval systems
- legal documents
- cross references
- document retrieval
- multiple sources
- document classification
- vector space model
- free text
- knowledge sources
- sensitive information
- databases
- user queries
- fine grained
- text mining
- information extraction
- keywords
- website
- multimedia
- document set
- textual content
- document content
- search engine
- database