Masquerading as a Trustworthy Entity through Portable Document File (PDF) Format.
Gundeep Singh BindraPublished in: SocialCom/PASSAT (2011)
Keyphrases
- information retrieval
- document images
- web documents
- entity linking
- database
- document analysis
- information retrieval systems
- retrieval systems
- cross document
- document collections
- document retrieval
- signature file
- document clustering
- named entities
- lightweight
- document classification
- electronic documents
- keywords
- xml format
- knowledge base
- search engine
- semantic information
- file system
- vector space model
- tf idf
- document representation
- image retrieval
- coreference resolution
- entity identification