Detecting Document Versions and Their Ordering in a Collection.
Natwar ModaniAnurag MauryaGaurav VermaInderjeet NairVaidehi PatilAnirudh KanfadePublished in: WISE (2) (2021)
Keyphrases
- document collections
- information retrieval
- information retrieval systems
- document retrieval
- trec genomics
- document set
- document images
- relevant documents
- pdf files
- text collections
- document classification
- web documents
- document clustering
- vector space model
- text documents
- data sets
- semantic information
- document content
- related documents
- text corpus
- text retrieval
- database
- information extraction
- image retrieval
- keywords
- metadata
- multimedia documents
- document analysis
- partial order
- document structure
- document corpus
- text files
- machine learning
- test collection