Humor Detection in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System.
Ankush KhandelwalSahil SwamiSyed Sarfaraz AkhtarManish ShrivastavaPublished in: LREC (2018)
Keyphrases
- statistical machine translation
- machine translation
- language identification
- link grammar
- social media content
- proper names
- person names
- security informatics
- parallel corpus
- broad coverage
- open domain
- comparable corpora
- cross lingual
- social media
- natural language
- spoken language
- parallel corpora
- noun phrases
- penn treebank
- machine learning
- multiword
- indian languages
- english words
- contextual features
- query translation
- user generated content