String Distances for Near-duplicate Detection.
Iulia DanailaLiviu P. DinuVlad NiculaeOctavia-Maria SuleaPublished in: Polibits (2012)
Keyphrases
- hamming distance
- pattern matching
- distance measure
- distance function
- data structure
- euclidean distance
- string matching
- edit distance
- decision trees
- dissimilarity measure
- databases
- binary strings
- distance map
- variable length
- regular expressions
- real time
- query processing
- evolutionary algorithm
- suffix array
- neural network
- geometrical properties
- string edit distance
- city block