Analogy-based Text Normalization : the case of unknowns words (Normalisation de textes par analogie: le cas des mots inconnus) [in French].
Marion BaranesBenoît SagotPublished in: TALN (1) (2014)
Keyphrases
- keywords
- text documents
- text recognition
- english words
- syntactic categories
- text databases
- n gram
- chinese text
- related words
- natural language text
- text corpora
- linguistic information
- word pairs
- multiword
- short text
- linguistic analysis
- textual features
- lexical information
- proper nouns
- lexical features
- information retrieval
- historical manuscripts
- syntactic analysis
- word level
- training corpus
- semantically related
- document analysis
- noun phrases
- case base
- text mining
- search engine