Unsupervised Creation of Normalization Dictionaries for Micro-Blogs in Arabic, French and English.
Amal HtaitSébastien FournierPatrice BellotPublished in: Computación y Sistemas (2018)
Keyphrases
- arabic language
- language identification
- language resources
- grammar induction
- foreign language
- bilingual dictionaries
- social media
- machine translation
- sparse representation
- natural language
- unknown words
- unsupervised learning
- social networks
- english language
- mono lingual
- machine readable dictionaries
- supervised learning
- semi supervised
- preprocessing
- arabic documents
- monolingual retrieval
- pos tagging
- cross language
- language processing
- arabic text
- morphological analysis
- keywords
- word forms
- information retrieval
- multilingual retrieval
- pos taggers
- cross language retrieval
- multiword
- answer questions
- word segmentation
- cross language information retrieval
- character recognition
- language learning
- feature space