MANorm: A Normalization Dictionary for Moroccan Arabic Dialect Written in Latin Script.
Randa ZarnoufiWalid BachriHamid JaafarMounia AbikPublished in: CoRR (2022)
Keyphrases
- language identification
- arabic documents
- optical character recognition
- word spotting
- document images
- sparse representation
- speaker identification
- normalization method
- document level
- handwritten documents
- real time
- natural language processing
- preprocessing
- indian languages
- genetic algorithm
- handwriting recognition
- natural images
- character n grams
- word forms
- machine learning