The MADAR Arabic Dialect Corpus and Lexicon.
Houda BouamorNizar HabashMohammad SalamehWajdi ZaghouaniOwen RambowDana AbdulrahimOssama ObeidSalam KhalifaFadhl EryaniAlexander ErdmannKemal OflazerPublished in: LREC (2018)
Keyphrases
- semantic lexicon
- handwritten words
- manually annotated
- domain specific
- handwritten word recognition
- natural language
- bilingual lexicon
- open domain
- text corpus
- lexical units
- information extraction systems
- writing style
- unknown words
- machine learning
- handwriting recognition
- information retrieval
- training corpus
- coreference resolution
- test set
- text classification