Similarity of Names Across Scripts: Edit Distance Using Learned Costs of N-Grams.
Bruno PouliquenPublished in: GoTAL (2008)
Keyphrases
- edit distance
- n gram
- similarity measure
- language model
- approximate string matching
- levenshtein distance
- graph matching
- distance measure
- distance function
- text classification
- string matching
- language independent
- edit operations
- string similarity
- string edit distance
- longest common subsequence
- part of speech
- dissimilarity measure
- bag of words
- distance computation
- tree edit distance
- dynamic programming
- named entities
- similarity join
- triangle inequality
- keywords
- similarity function
- web documents
- neural network
- context sensitive
- computer vision
- pattern recognition
- approximate matching
- word level