YouDACC: the Youtube Dialectal Arabic Comment Corpus.
Ahmed SalamaHouda BouamorBehrang MohitKemal OflazerPublished in: LREC (2014)
Keyphrases
- user comments
- social media
- manually annotated
- unknown words
- supervised machine learning
- coreference resolution
- spanish language
- arabic language
- user generated
- text corpora
- action recognition
- test set
- language identification
- open domain
- hidden markov models
- arabic text
- data sets
- sentence level
- co occurrence
- search engine
- neural network