A Dataset of Hindi-English Code-Mixed Social Media Text for Hate Speech Detection.
Aditya BohraDeepanshu VijayVinay SinghSyed Sarfaraz AkhtarManish ShrivastavaPublished in: PEOPLES@NAACL-HTL (2018)
Keyphrases
- english text
- text to speech
- social media
- language identification
- proper names
- spoken language
- indian languages
- text recognition
- machine translation
- text to speech synthesis
- optical character recognition
- english language
- multi lingual
- natural language generation
- speaker identification
- cross lingual
- speech recognition
- object detection
- statistical machine translation
- broad coverage
- broadcast news
- conversational speech
- lexical features
- natural language
- named entity recognizer
- document images
- linguistic analysis
- text input
- information extraction
- keywords
- social networks
- source code
- word level
- document analysis
- noun phrases
- cross language
- named entity recognition
- text retrieval
- source language
- reading comprehension
- machine translation system
- information retrieval
- social media data
- text documents
- target language
- language learning