Unsupervised Context-Sensitive Spelling Correction of English and Dutch Clinical Free-Text with Word and Character N-Gram Embeddings.
Pieter FivezSimon SusterWalter DaelemansPublished in: CoRR (2017)
Keyphrases
- free text
- spelling correction
- context sensitive
- character n grams
- n gram
- cross language
- language model
- language specific
- natural language
- information extraction
- cross language information retrieval
- natural language processing
- english text
- structured data
- optical character recognition
- language modeling
- machine translation
- multiword
- document retrieval
- question answering
- word sense disambiguation
- information retrieval
- supervised learning
- text retrieval
- semi supervised
- document collections
- cross lingual
- language independent
- part of speech
- machine learning
- query translation
- vector space
- text classification
- information access
- co occurrence
- word level
- test collection
- text categorization
- text mining
- language identification
- speech recognition
- metadata