Information Retrieval of Word Form Variants in Spoken Language Corpora Using Generalized Edit Distance.
Siim OrasmaaReina KäärikJaak ViloTiit HennostePublished in: LREC (2010)
Keyphrases
- edit distance
- spoken language
- information retrieval
- string edit distance
- string matching
- language processing
- edit operations
- linguistic knowledge
- graph matching
- similarity measure
- string similarity
- approximate string matching
- natural language processing
- dialogue system
- search engine
- distance measure
- approximate matching
- levenshtein distance
- distance function
- co occurrence
- information retrieval systems
- information extraction
- pattern matching
- natural language
- retrieval model
- language modeling
- semantic analysis
- point sets
- n gram