Identification of Original Document by Using Textual Similarities.
Prasha ShresthaThamar SolorioPublished in: CICLing (2) (2015)
Keyphrases
- keywords
- document images
- information retrieval
- document structure
- free text
- multimedia
- similarity measure
- information retrieval systems
- document collections
- web documents
- structured data
- document clustering
- document retrieval
- text documents
- retrieval systems
- database
- natural language
- website
- metadata
- vector space model
- document representation
- textual information
- automatic identification
- search engine
- textual contents