Using Clustering and Edit Distance Techniques for Automatic Web Data Extraction.
Manuel ÁlvarezAlberto PanJuan RaposoFernando BellasFidel CachedaPublished in: WISE (2007)
Keyphrases
- edit distance
- web data extraction
- dissimilarity measure
- string similarity
- edit operations
- distance computation
- graph matching
- string matching
- data extraction
- similarity measure
- distance measure
- graph edit distance
- string edit distance
- clustering method
- levenshtein distance
- clustering algorithm
- distance function
- tree edit distance
- approximate matching
- semi structured
- hierarchical clustering
- dynamic programming
- k means
- web pages
- data points
- pattern recognition
- nearest neighbor
- finite alphabet
- neural network
- normalized edit distance