Identifying Repeated Sections within Documents.
Girish K. PalshikarSachin PawarRajiv SrivastavaMahek ShahPublished in: Computación y Sistemas (2019)
Keyphrases
- information retrieval
- document collections
- document retrieval
- web documents
- xml documents
- information retrieval systems
- relevant documents
- legal documents
- metadata
- xml format
- keywords
- document clustering
- free text
- document classification
- vector space model
- retrieval systems
- multi document summarization
- text analysis
- text documents
- structured documents
- retrieved documents
- multimedia documents
- highly relevant
- plagiarism detection
- information extraction
- machine learning