Login / Signup
Identifying document similarity using a fast estimation of the Levenshtein Distance based on compression and signatures.
Peter Coates
Frank Breitinger
Published in:
CoRR (2023)
Keyphrases
</>
document similarity
graph theory
distance measure
document clustering
vector space model
document representation
cosine similarity
index terms
information retrieval
latent dirichlet allocation