Text Normalization Infrastructure that Scales to Hundreds of Language Varieties.
Mason ChuaDaan van EschNoah CoccaroEunjoon ChoSujeet BhandariLibin JiaPublished in: LREC (2018)
Keyphrases
- language generation
- english text
- text to speech synthesis
- computational linguistics
- english language
- human language
- text to speech
- text understanding
- language learning
- information retrieval
- programming language
- native language
- text generation
- web documents
- language specific
- text retrieval
- language processing
- text mining
- semantic representations
- linguistic analysis
- natural language
- semantic structure
- natural language generation
- preprocessing
- key concepts
- text documents
- information exchange
- normalization method
- lexical information
- character n grams
- keywords
- syntactic categories
- multiscale