Computing Burrows-Wheeler Similarity Distributions for String Collections.
Felipe A. LouzaGuilherme P. TellesSimon GogLiang ZhaoPublished in: SPIRE (2018)
Keyphrases
- edit distance
- string similarity
- similarity measure
- data structure
- levenshtein distance
- hamming distance
- data sets
- random variables
- distance measure
- information retrieval
- probability distribution
- gaussian distribution
- probability measure
- similarity join
- data collections
- structural similarity
- power law
- joint distribution
- similarity metric
- regular expressions
- semantic similarity
- distance metric
- euclidean distance
- document collections
- databases