Semantic Document Distance Measures and Unsupervised Document Revision Detection.
Xiaofeng ZhuDiego KlabjanPatrick N. BlessPublished in: IJCNLP(1) (2017)
Keyphrases
- distance measure
- semantic information
- web documents
- document retrieval
- cosine similarity
- document images
- information retrieval systems
- text documents
- document collections
- keywords
- supervised learning
- high dimensional
- euclidean distance
- image sequences
- information retrieval
- semantic similarity
- document clustering
- kullback leibler
- document representation
- vector space
- relevant documents
- co occurrence
- text mining
- semi supervised
- similarity measure
- social networks