Semantic Document Distance Measures and Unsupervised Document Revision Detection.
Xiaofeng ZhuDiego KlabjanPatrick N. BlessPublished in: CoRR (2017)
Keyphrases
- distance measure
- information retrieval
- information retrieval systems
- document images
- euclidean distance
- document collections
- cosine similarity
- document clustering
- distance function
- web documents
- semantic information
- keywords
- text documents
- similarity measure
- edit distance
- dynamic time warping
- user queries
- document retrieval
- proximity measures
- tf idf
- vector representation
- nearest neighbor classification
- kullback leibler
- vector space model
- vector space
- cost function
- computer vision