Adaptive Edit-Distance and Regression Approach for Post-OCR Text Correction.
Thi-Tuyet-Hai NguyenMickaël CoustatyAntoine DoucetAdam JatowtNhu-Van NguyenPublished in: ICADL (2018)
Keyphrases
- edit distance
- string matching
- similarity measure
- approximate string matching
- edit operations
- graph matching
- graph edit distance
- distance measure
- levenshtein distance
- approximate matching
- distance function
- string edit distance
- string similarity
- subgraph isomorphism
- pattern matching
- character recognition
- optical character recognition
- document analysis
- tree edit distance
- keywords
- normalized edit distance
- adjacency matrix
- distance computation
- dissimilarity measure
- document images
- information retrieval
- similarity join
- suffix array
- text documents
- text mining
- dynamic programming
- finite alphabet
- feature vectors
- machine learning