An Alternative to NCD for Large Sequences, Lempel-Ziv Jaccard Distance.
Edward RaffCharles K. NicholasPublished in: KDD (2017)
Keyphrases
- lempel ziv
- data compression
- compression scheme
- approximate string matching
- edit distance
- lossless compression
- source coding
- distance measure
- similarity measure
- suffix tree
- distance function
- euclidean distance
- n gram
- arithmetic coding
- information theoretic
- compression algorithm
- similarity metric
- quadtree
- channel coding
- suffix array
- compression ratio
- information extraction
- multiresolution