Lexical Normalization of Spanish Tweets with Preprocessing Rules, Domain-specific Edit Distances, and Language Models.
Pablo Ruiz FaboMontse CuadrosThierry EtchegoyhenPublished in: Tweet-Norm@SEPLN (2013)
Keyphrases
- language model
- domain specific
- preprocessing
- edit distance
- language modeling
- context sensitive
- n gram
- probabilistic model
- document retrieval
- speech recognition
- information retrieval
- language modelling
- statistical language models
- test collection
- graph matching
- query expansion
- retrieval model
- vector space model
- similarity measure
- smoothing methods
- pseudo relevance feedback
- document ranking
- relevance model
- language models for information retrieval
- feature extraction
- question answering
- passage retrieval
- distance measure
- natural language processing
- named entities
- graph kernels
- distance function
- pattern recognition